PG-means: learning the number of clusters in data.

DSpace/Manakin Repository

BEARdocs is currently undergoing a scheduled upgrade. We expect the upgrade to be completed no later than Monday, March 2nd, 2015. During this time you will be able to access existing documents, but will not be able to log in or submit new documents.

Show simple item record

dc.contributor.advisor Hamerly, Gregory James, 1977- Feng, Yu.
dc.contributor.other Baylor University. Dept. of Computer Science. en 2006-12
dc.description.abstract We present a novel algorithm called PG-means in this thesis. This algorithm is able to determine the number of clusters in a classical Gaussian mixture model automatically. PG-means uses efficient statistical hypothesis tests on one-dimensional projections of the data and model to determine if the examples are well represented by the model. In so doing, we apply a statistical test to the entire model at once, not just on a per-cluster basis. We show that this method works well in difficult cases such as overlapping clusters, eccentric clusters and high dimensional clusters. PG-means also works well on non-Gaussian clusters and many true clusters. Further, the new approach provides a much more stable estimate of the number of clusters than current methods. en
dc.rights Baylor University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. Contact for inquiries about permission. en
dc.subject Algorithms. en
dc.subject Computer network architecture. en
dc.title PG-means: learning the number of clusters in data. en
dc.type Thesis en M.S. en
dc.rights.accessrights Worldwide access en
dc.contributor.department Computer Science. en

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BEARdocs

Advanced Search


My Account