首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A Computational Perspective on Projection Pursuit in High Dimensions: Feasible or Infeasible Feature Extraction
Authors:Chunming Zhang  Jimin Ye  Xiaomei Wang
Institution:1. Department of Statisics, University of Wisconsin-Madison, Madison, WI, 53706 USA;2. School of Mathematics and Statistics, Xidian University, Xi'an, Shaanxi, 710071 China;3. School of Management, Northwestern Polytechnical University, Xi'an, Shaanxi, 710072 China
Abstract:Finding a suitable representation of multivariate data is fundamental in many scientific disciplines. Projection pursuit ( PP) aims to extract interesting ‘non-Gaussian’ features from multivariate data, and tends to be computationally intensive even when applied to data of low dimension. In high-dimensional settings, a recent work (Bickel et al., 2018) on PP addresses asymptotic characterization and conjectures of the feasible projections as the dimension grows with sample size. To gain practical utility of and learn theoretical insights into PP in an integral way, data analytic tools needed to evaluate the behaviour of PP in high dimensions become increasingly desirable but are less explored in the literature. This paper focuses on developing computationally fast and effective approaches central to finite sample studies for (i) visualizing the feasibility of PP in extracting features from high-dimensional data, as compared with alternative methods like PCA and ICA, and (ii) assessing the plausibility of PP in cases where asymptotic studies are lacking or unavailable, with the goal of better understanding the practicality, limitation and challenge of PP in the analysis of large data sets.
Keywords:density estimation  empirical distribution function  exploratory data analysis  Gaussian mixture  ICA  PCA
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号