首页 | 本学科首页   官方微博 | 高级检索  
     


Multivariate Prediction with Nonlinear Principal Components Analysis: Theory
Authors:JOHN?C.?GOWER,J?RG?BLASIUS  author-information"  >  author-information__contact u-icon-before"  >  mailto:jblasius@uni-bonn.de"   title="  jblasius@uni-bonn.de"   itemprop="  email"   data-track="  click"   data-track-action="  Email author"   data-track-label="  "  >Email author
Affiliation:(1) Department of Statistics, The Open University, Walton Hall, Milton Keynes, MK7 6AA, U.K;(2) Seminar for Sociology, University of Bonn, Lennéstr. 27, 53113 Bonn, Germany
Abstract:We propose the notion of multivariate predictability as a measure of goodness-of-fit in data reduction techniques which are useful for visualizing and screening data. For quantitative variables this leads to the usual sums-of-squares and variance accounted for criteria. For categorical variables we show how to predict the category-levels of all variables associated with every point (case). The proportion of predictions which agree with the true categories gives the measure of fit. The ideas are very general; as an illustration we use nonlinear principal components analysis (NLPCA) in association with ordered categorical variables. A detailed example using data from the International Social Survey Program (ISSP) will be given in Blasius and Gower (quality and quantity, 39, to appear). It will be shown that the predictability criterion suggests that the fits are rather better than is indicated by “percentage of variance accounted for”.This article was written while John Gower was a visiting professor at the ZA-Eurolab, at the Zentralarchiv für Empirische Sozialforschung, University of Cologne, Germany. The ZA is a Large Scale Facility funded by the Training and Mobility of Researchers program of the European Union.
Keywords:biplot  large scale data analysis  nonlinear principal components analysis  prediction
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号