Date of Award

3-21-2013

Document Type

Thesis

Degree Name

Master of Science

Department

Department of Operational Sciences

First Advisor

Kenneth W. Bauer, PhD.

Abstract

A well-known multivariate data reduction method is principal components analysis (PCA). PCA transforms the variables under study into a set of components that are used to summarize the variation among the variables. The benefit is the dimension of the data may be reduced by the descriptive power of the components, permitting tractable analysis on large and messy datasets. Integral to successful PCA is determining when to stop extracting components - the matter is not a trivial one. A component extraction stopping rule that consistently produces reliable estimates of principal components is Horn's test. The drawback of the test is it requires a large amount of random data to evaluate the hidden component structure. Leveraging the flexibility and power of the MATLAB software package, a lookup table interpolates nearest neighbor searches of pre-processed mean eigenvalue data to provide real-time results for datasets up to 1,000 variables on 7,000 samples. The methodology is extended to a linear regression second-order model producing Horn's curve, significantly reducing the required size of the lookup table with no loss of resolution into the dimensionality estimate.

AFIT Designator

AFIT-ENS-13-M-01

DTIC Accession Number

ADA582666

Share

COinS