Date of Award
9-1992
Document Type
Thesis
Degree Name
Master of Science
Department
Department of Electrical and Computer Engineering
First Advisor
Matthew Kabrisky, PhD
Abstract
A system has been developed that tracks lip motion using infrared (IR) or visible detectors. The purpose of this study was to determine if the additional information obtained from the IR or visible detectors can be used to increase the recognition rate of audio Automatic Speech Recognition (ASR) systems. To accomplish this goal, several hardware analog prototypes had to be designed, built and tested. Different detectors (IR and visible) and modes of operation (active and passive) were tried before a reliable and useful signal was found. An analog-to-digital (A/D) board was then designed and built that digitized both the microphone and photo signals. Software algorithms, executed from a desktop PC, were used to interface with the A/D board, process the digitized data, and perform certain optical and audio ASR experiments. The results showed that isolated ASR audio recognition rates increased after using additional information gained from the photo speech signals. However, the results for the continuous case were inconclusive since not all of the available photo information was utilized to perform ASR experiments.
AFIT Designator
AFIT-GE-ENG-93M-01
DTIC Accession Number
ADA262490
Recommended Citation
Marshall, Patrick T., "Speech Recognition Using Visible and Infrared Detectors" (1992). Theses and Dissertations. 7589.
https://scholar.afit.edu/etd/7589
Comments
The author's Vita page is omitted.