The project includes building seven different machine learning classifiers (including Linear Regression, Decision Tree, Bagging, Random Forest, Gradient Boost, AdaBoost, and XGBoost) using Original, OverSampled, and UnderSampled data of ReneWind case study, tuning hyperparameters of the models, performance comparisons, and pipeline development for productionizing the final model.
Outline
-
Data Overview
-
Exploratory Data Analysis (EDA)
-
Data Preprocessing
-
Model Evaluation Criterion
-
Model Building with Original Data
-
Model Building with OverSampled Data
-
Model Building with Undersampled data
-
Model Selection for Tuning
-
Hyperparameter Tuning
-
Comparing all Models
-
The Final Model
-
Pipelines for Productionizing the Final Model