Date of Award

3-2006

Document Type

Thesis

Degree Name

Master of Science

Department

Department of Electrical and Computer Engineering

First Advisor

Steven C. Gustafson, PhD

Abstract

The purpose of this research was to improve performance in speech recognition. Specifically, a new approach was investigating by applying an integral transform known as the Mellin transform (MT) on the output of an auditory model to improve the recognition rate of phonemes through the scale-invariance property of the Mellin transform. Scale-invariance means that as a time-domain signal is subjected to dilations, the distribution of the signal in the MT domain remains unaffected. An auditory model was used to transform speech waveforms into images representing how the brain "sees" a sound. The MT was applied and features were extracted. The features were used in a speech recognizer based on Hidden Markov Models. The results from speech recognition experiments showed an increase in recognition rates for some phonemes compared to traditional methods.

AFIT Designator

AFIT-GE-ENG-06-22

DTIC Accession Number

ADA451292

Recommended Citation

Hornback, Jesse R., "Speech Recognition Using the Mellin Transform" (2006). Theses and Dissertations. 3484.
https://scholar.afit.edu/etd/3484

Download

Included in

Computational Linguistics Commons, Language Interpretation and Translation Commons

COinS

Theses and Dissertations

Speech Recognition Using the Mellin Transform

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

AFIT Designator

DTIC Accession Number

Recommended Citation

Included in

Search

Browse

Author Corner

Theses and Dissertations

Speech Recognition Using the Mellin Transform

Author

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

AFIT Designator

DTIC Accession Number

Recommended Citation

Included in

Share

Search

Browse

Author Corner