Data-Pattern Discovery Methods for Detection
in Nongaussian High-dimensional Data Sets

Cécile Levasseur, Ken Kreutz-Delgado, Uwe F. Mayer and Gregory Gancarz

Abstract: Many important expert system applications depend on the ability to accurately detect or predict the occurrence of key events given a data set of observations. We concentrate on multidimensional data that are highly nongaussian (continuous and/or discrete), noisy and nonlinearly related. We investigate the feasibility of data-pattern discovery and event detection by applying generalized principal component analysis (GPCA) techniques for pattern extraction based on an exponential family probability distribution assumption. We develop theoretical extensions of the GPCA model by exploiting results from the theory of generalized linear models and nonparametric mixture density estimation.

Key words: Principal component analysis, exponential family probability distribution, generalized linear models.


You can download a copy of this paper (about 5 pages).

Mayer18.pdf This file is in Portable Document Format. (210 Kbytes)


[leftarrow]Back

mayer@math.utah.edu
First posted: Sun Nov 13 21:36:20 PST 2005
Last updated: Thu May 10 18:04:11 PDT 2007