BEARdocs

Topics in dimension reduction and missing data in statistical discrimination.

DSpace/Manakin Repository

Show simple item record

dc.contributor.advisor Tubbs, Jack Dale.
dc.contributor.author Young, Phil D.
dc.contributor.other Baylor University. Dept. of Statistical Sciences. en
dc.date.copyright 2009-12
dc.identifier.uri http://hdl.handle.net/2104/5543
dc.description.abstract This dissertation is comprised of four chapters. In the first chapter, we define the concept of linear dimension reduction, review some popular linear dimension reduction procedures, discuss background research that we use in chapters two and three, and give a brief outline of the dissertation contents. In chapter two, we derive a linear dimension reduction (LDR) procedure for statistical discriminant analysis for multiple multivariate skew-normal populations. First, we define the multivariate skew-normal distribution and give several applications of its use. We also provide marginal and conditional properties of the MSN random vector. Then, we state and prove several lemmas used in a series of theorems that present our LDR procedure for the multivariate skew-normal populations using parameter configurations. Lastly, we illustrate our LDR method for multiple multivariate skew-normal distributions with three examples. In the third chapter, we define and rigorously prove the existence of the multivariate singular skew-normal (MSSN) distribution. Next, we state and prove distributional properties for linear combinations, marginal, and conditional random variables from a MSSN distribution. Then, we state and prove several lemmas used in deriving our LDR transformation for the multiple MSSN distributions with assorted parameter combinations. We then state and prove several theorems concerning the formulation of our LDR technique. Finally, we illustrate the effectiveness of our LDR technique for multiple multivariate singular skew-normal classes with two examples. In chapter four, we compare two statistical linear discrimination procedures when monotone missing training data exists in the training data sets from two different multivariate normally distributed populations with unequal means but equal covariance matrices. We derive the maximum likelihood estimators (MLEs) for the partitioned population means and the common covariance matrix in an appendix. Additionally, we contrast two classifiers: a linear combination discriminant function derived from Chung and Han (C-H) (2000) and a linear classifier based on the MLE of two multivariate normal training samples with identical monotone missing training-data in one or more features. We then perform two Monte Carlo simulations with various parameter configurations to compare the effectiveness of the MLE and C-H classifiers as the correlation between features for the population covariance matrix increases. Moreover, we compare the two competing classifiers using parametric bootstrap estimated expected error rates for a subset of the well-known Iris data. en
dc.rights Baylor University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. Contact librarywebmaster@baylor.edu for inquiries about permission. en
dc.subject Dimension reduction. en
dc.subject Statistical discrimination. en
dc.title Topics in dimension reduction and missing data in statistical discrimination. en
dc.type Thesis en
dc.description.degree Ph.D. en
dc.rights.accessrights Worldwide access en
dc.rights.accessrights Access changed 7/16/12
dc.contributor.department Statistical Sciences. en


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BEARdocs


Advanced Search

Browse

My Account

Statistics