In statistics, principal component regression (PCR) is a regression analysis technique that is based on principal component analysis (PCA). More specifically, PCR is used for estimating the unknown regression coefficients in a standard linear regression model.
In PCR, instead of regressing the dependent variable on the explanatory variables directly, the principal components of the explanatory variables are used as regressors. One typically uses only a subset of all the principal components for regression, making PCR a kind of regularized procedure and also a type of shrinkage estimator.
Often the principal components with higher variances (the ones based on eigenvectors corresponding to the higher eigenvalues of the sample variance-covariance matrix of the explanatory variables) are selected as regressors. However, for the purpose of predicting the outcome, the principal components with low variances may also be important, in some cases even more important.[1]
One major use of PCR lies in overcoming the multicollinearity problem which arises when two or more of the explanatory variables are close to being collinear.[2] PCR can aptly deal with such situations by excluding some of the low-variance principal components in the regression step. In addition, by usually regressing on only a subset of all the principal components, PCR can result in dimension reduction through substantially lowering the effective number of parameters characterizing the underlying model. This can be particularly useful in settings with high-dimensional covariates. Also, through appropriate selection of the principal components to be used for regression, PCR can lead to efficient prediction of the outcome based on the assumed model.
^Jolliffe, Ian T. (1982). "A note on the Use of Principal Components in Regression". Journal of the Royal Statistical Society, Series C. 31 (3): 300–303. doi:10.2307/2348005. JSTOR 2348005.
^Dodge, Y. (2003) The Oxford Dictionary of Statistical Terms, OUP. ISBN 0-19-920613-9
and 29 Related for: Principal component regression information
In statistics, principalcomponentregression (PCR) is a regression analysis technique that is based on principalcomponent analysis (PCA). More specifically...
reduce them to a few principalcomponents and then run the regression against them, a method called principalcomponentregression. Dimensionality reduction...
Partial least squares regression (PLS regression) is a statistical method that bears some relation to principalcomponentsregression; instead of finding...
linear regression; for more than one, the process is called multiple linear regression. This term is distinct from multivariate linear regression, where...
In statistics, ordinal regression, also called ordinal classification, is a type of regression analysis used for predicting an ordinal variable, i.e....
regression and classification (e.g., functional linear regression). Scree plots and other methods can be used to determine the number of components included...
taken into account. It is a generalization of Deming regression and also of orthogonal regression, and can be applied to both linear and non-linear models...
Ridge regression is a method of estimating the coefficients of multiple-regression models in scenarios where the independent variables are highly correlated...
Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional...
Tikhonov regularization. Tikhonov regularization, along with principalcomponentregression and many other regularization schemes, fall under the umbrella...
applications of FPCA include the modes of variation and functional principalcomponentregression. Functional linear models can be viewed as an extension of the...
calibration techniques such as partial-least squares regression, or principalcomponentregression (and near countless other methods) are then used to...
In statistics, specifically regression analysis, a binary regression estimates a relationship between one or more explanatory variables and a single output...
In statistics, polynomial regression is a form of regression analysis in which the relationship between the independent variable x and the dependent variable...
Local regression or local polynomial regression, also known as moving regression, is a generalization of the moving average and polynomial regression. Its...
Robust PrincipalComponent Analysis (RPCA) is a modification of the widely used statistical procedure of principalcomponent analysis (PCA) which works...
(e.g., nonparametric regression). Regression analysis is primarily used for two conceptually distinct purposes. First, regression analysis is widely used...
In statistics, multinomial logistic regression is a classification method that generalizes logistic regression to multiclass problems, i.e. with more than...
Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables. Poisson regression assumes...
Segmented regression, also known as piecewise regression or broken-stick regression, is a method in regression analysis in which the independent variable...
case, the "regression" effect is statistically likely to occur, but in the second case, it may occur less strongly or not at all. Regression toward the...
(WLS), also known as weighted linear regression, is a generalization of ordinary least squares and linear regression in which knowledge of the unequal variance...
In statistics, nonlinear regression is a form of regression analysis in which observational data are modeled by a function which is a nonlinear combination...
In statistics, semiparametric regression includes regression models that combine parametric and nonparametric models. They are often used in situations...
especially in the case of a simple linear regression, in which there is a single regressor on the right side of the regression equation. The OLS estimator is consistent...
In statistics, binomial regression is a regression analysis technique in which the response (often referred to as Y) has a binomial distribution: it is...