DFKI-LT - Novel Properties and Well-Tried Performance of EM-Based Multivariate Clustering

Detlef Prescher
Novel Properties and Well-Tried Performance of EM-Based Multivariate Clustering
1 Proceedings of the EuroConference on Recent Advances in Natural Language Processing (RANLP-01), September 5-7, Pages 216-222, Tzigov Chark, Bulgaria, o.A., 2001
 
We present three novel properties for EM-based multivariate clustering: simplified re-estimation formulas, a simple pruning technique, and a novel invariance property preserving the characteristics of the given empirical distribution. Evaluation on two tasks shows: EM-based multivariate clustering models require only twice the storage space of the original sample, and these models yield reliable estimates for unknown data. Moreover we refer to selected experiments showing that EM-based multivariate clustering improves several real-world applications.
 
Files: BibTeX, Prescher:2001:NPW.pdf