Skip to content

pyRocksy/Tabular_Data_Analysis_and_ML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Tabular Data Analysis and Machine Learning Prediction

This repository contains various datasets and scripts for assessing the performance of classification and regression machine learning algorithms on tabular data problems. The goal of this project is to provide a comprehensive set of tools and examples for anyone interested in exploring the world of tabular machine learning.

Contents

All datasets used in this project can be found in the "data" folder, and all scripts are located in the "scripts" folder. The following machine learning algorithms and techniques are used in this project:

Classification Algorithms

  • Logistic Regression
  • Decision Tree
  • Random Forest
  • XGBoost
  • GridSearchCV
  • SVM (Support Vector Machines)
  • KFold Cross-Validation
  • SMOTE (Synthetic Minority Over-sampling Technique)
  • SimpleImputer (for missing value imputation)
  • OneHotEncoder (for categorical data encoding)

Regression Algorithms

  • Linear Regression
  • Random Forest Regression
  • XGBoost Regression
  • GridSearchCV
  • KFold Cross-Validation
  • SimpleImputer (for missing value imputation)
  • StandardScaler (for feature scaling)

Python Modules

The following Python modules are used in this project:

  • pandas (for data manipulation and analysis)
  • math (for mathematical functions)
  • statistics (for statistical calculations)
  • numpy (for numerical computing)
  • sklearn (for machine learning algorithms and utilities)
  • PIL (for image processing)
  • skimage (for image processing)
  • matplotlib (for data visualization)
  • seaborn (for data visualization)

Usage

To use the scripts in this project, you will need to have Python 3 installed on your machine. You can download the latest version of Python from the official Python website(https://www.python.org/downloads/).

Once you have Python installed, you can clone this repository to your local machine using the following command:

git clone https://github.com/your-username/tabular-machine-learning.git

You can then navigate to the project directory and run any of the scripts using the following command:

python script-name.py

Contributing

If you find a bug or have a suggestion for how to improve this project, please feel free to submit an issue or pull request.

License

This project is licensed under the MIT License. Feel free to use, modify, and distribute this code as you see fit.

About

Repo for tabular machine learning projects

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published