PCA

The main idea of principal component analysis (PCA) is to reduce the dimensionality of a data set consisting of many variables correlated with each other, either heavily or lightly, while retaining the variation present in the dataset, up to the maximum extent. The same is done by transforming the variables to a new set of variables, which are known as the principal components (or simply, the PCs) and are orthogonal, ordered such that the retention of variation present in the original variables decreases as we move down in the order. So, in this way, the 1st principal component retains maximum variation that was present in the original components.

Correlation indicates that there is redundancy in the data. Due to this redundancy, PCA can be used to reduce the original variables into a smaller number of new variables ( = principal components) explaining most of the variance in the original variables The eigenvectors and eigenvalues of a covariance (or correlation) matrix represent the “core” of a PCA: The eigenvectors (principal components) determine the directions of the new feature space, and the eigenvalues determine their magnitude. In other words, the eigenvalues explain the variance of the data along the new feature axes.

Covariance matrix The classic approach to PCA is to perform the eigendecomposition on the covariance matrix, which is a d×d matrix where each element represents the covariance between two features.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
PCA.ipynb		PCA.ipynb
README.md		README.md
Wholesale customers data.csv		Wholesale customers data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PCA

About

Releases

Packages

Languages

KSSRINIVASARAO/PCA

Folders and files

Latest commit

History

Repository files navigation

PCA

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages