Theses and Dissertations

Channel-Mismatch Compensation in Speaker Identification Feature Selection and Adaptation with Artificial Neural Networks

Edmund A. Fitzgerald

Date of Award

3-1998

Document Type

Thesis

Degree Name

Master of Science

Department

Department of Electrical and Computer Engineering

First Advisor

Martin DeSimio, PhD

Abstract

We develop and present results of an artificial neural network (ANN) based compensation technique for mismatched classifier training and testing conditions in speaker identification (SID). One ANN per feature per speaker is trained to perform a mapping of that feature from a corrupted condition to an undistorted condition. Therefore, a classifier trained under one condition may be used to classify data collected under a different condition. Speech utterances from 168 speakers, collected in a studio, and also re-recorded after transmission over telephone networks, are used for developing and testing the method. Peak formant resonant frequencies, their bandwidths, and pitch are used as features. These features from the studio speech are used to train Gaussian Mixture Model classifiers. Portions of the studio and telephone speech are used to train the compensation ANNs. In mismatched train and test conditions, features from telephone speech are modified by the trained ANNs and applied to the GMMs trained with features from studio speech. Without compensation, SID accuracy is 6%. The compensation method developed in this work provides mismatch SID accuracy of 58.3%. Previous research on the same data with the commonly used Mel Frequency Cepstral Coefficients as features and a typical compensation method of Cepstral Mean Subtraction with Band Limiting gives SID accuracy of 27.4% with the same type of classifiers.

AFIT Designator

AFIT-GE-ENG-98M-02

DTIC Accession Number

ADA342401

Recommended Citation

Fitzgerald, Edmund A., "Channel-Mismatch Compensation in Speaker Identification Feature Selection and Adaptation with Artificial Neural Networks" (1998). Theses and Dissertations. 5629.
https://scholar.afit.edu/etd/5629

Download

Included in

Signal Processing Commons

COinS

Theses and Dissertations

Channel-Mismatch Compensation in Speaker Identification Feature Selection and Adaptation with Artificial Neural Networks

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

AFIT Designator

DTIC Accession Number

Recommended Citation

Included in

Search

Browse

Author Corner

Theses and Dissertations

Channel-Mismatch Compensation in Speaker Identification Feature Selection and Adaptation with Artificial Neural Networks

Author

Date of Award

Document Type

Degree Name

Department

First Advisor

Abstract

AFIT Designator

DTIC Accession Number

Recommended Citation

Included in

Share

Search

Browse

Author Corner