This dataset is about various factors affecting obesity levels and use of machine learning algorithms for classification analysis and visualization of centers.
Below are briefly described important parts of this project.
- Principal Component Analysis (PCA) to identify features with most variance and visualize the distribution of those features.
- Unsupervised K-means algorithm to visualize the centers of features.
- Classification Analysis with Automatic Classifiers using Training and Testing sets (Supervised algorithms KNN, RandomForest and SVM).