This repository contains various datasets and scripts for assessing the performance of classification and regression machine learning algorithms on tabular data problems. The goal of this project is to provide a comprehensive set of tools and examples for anyone interested in exploring the world of tabular machine learning.
All datasets used in this project can be found in the "data" folder, and all scripts are located in the "scripts" folder. The following machine learning algorithms and techniques are used in this project:
- Logistic Regression
- Decision Tree
- Random Forest
- XGBoost
- GridSearchCV
- SVM (Support Vector Machines)
- KFold Cross-Validation
- SMOTE (Synthetic Minority Over-sampling Technique)
- SimpleImputer (for missing value imputation)
- OneHotEncoder (for categorical data encoding)
- Linear Regression
- Random Forest Regression
- XGBoost Regression
- GridSearchCV
- KFold Cross-Validation
- SimpleImputer (for missing value imputation)
- StandardScaler (for feature scaling)
The following Python modules are used in this project:
- pandas (for data manipulation and analysis)
- math (for mathematical functions)
- statistics (for statistical calculations)
- numpy (for numerical computing)
- sklearn (for machine learning algorithms and utilities)
- PIL (for image processing)
- skimage (for image processing)
- matplotlib (for data visualization)
- seaborn (for data visualization)
To use the scripts in this project, you will need to have Python 3 installed on your machine. You can download the latest version of Python from the official Python website(https://www.python.org/downloads/).
Once you have Python installed, you can clone this repository to your local machine using the following command:
git clone https://github.com/your-username/tabular-machine-learning.git
You can then navigate to the project directory and run any of the scripts using the following command:
python script-name.py
If you find a bug or have a suggestion for how to improve this project, please feel free to submit an issue or pull request.
This project is licensed under the MIT License. Feel free to use, modify, and distribute this code as you see fit.