Show simple item record

Statistical and computational trade-offs in estimation of sparse principal components

dc.creatorWang, Tengyao
dc.creatorBerthet, Quentin
dc.creatorSamworth, Richard John
dc.date.accessioned2018-11-24T23:26:27Z
dc.date.available2015-08-11T10:39:21Z
dc.date.available2018-11-24T23:26:27Z
dc.date.issued2016
dc.identifierhttps://www.repository.cam.ac.uk/handle/1810/249255
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/123456789/3830
dc.description.abstractIn recent years, Sparse Principal Component Analysis has emerged as an extremely popular dimension reduction technique for highdimensional data. The theoretical challenge, in the simplest case, is to estimate the leading eigenvector of a population covariance matrix under the assumption that this eigenvector is sparse. An impressive range of estimators have been proposed; some of these are fast to compute, while others are known to achieve the minimax optimal rate over certain Gaussian or subgaussian classes. In this paper we show that, under a widely-believed assumption from computational complexity theory, there is a fundamental trade-off between statistical and computational performance in this problem. More precisely, working with new, larger classes satisfying a Restricted Covariance Concentration condition, we show that there is an effective sample size regime in which no randomised polynomial time algorithm can achieve the minimax optimal rate. We also study the theoretical performance of a (polynomial time) variant of the well-known semidefinite relaxation estimator, revealing a subtle interplay between statistical and computational efficiency.
dc.languageen
dc.publisherInstitute of Mathematical Statistics
dc.publisherAnnals of Statistics
dc.rightshttp://creativecommons.org/licenses/by-nc/2.0/uk/
dc.rightsAttribution-NonCommercial 2.0 UK: England & Wales
dc.subjectcomputational lower bounds
dc.subjectPlanted Clique problem
dc.subjectpolynomial time algorithm
dc.subjectSparse Principal Component Analysis
dc.titleStatistical and computational trade-offs in estimation of sparse principal components
dc.typeArticle


Files in this item

FilesSizeFormatView

This item appears in the following Collection(s)

Show simple item record