Applied Machine Learning

Disease Prediction Project

Overview:

This machine learning project comes from the Applied Machine Learning course I took in Fall 2020.

Project Goal:

The goal is to predict whether or not a patient has a certain unspecified disease. This is a binary classification problem.

Dataset:

Provided by the professor the course, the training dataset has 49,000 rows and 12 columns. Methodology:

This analysis and report of two jupyter nootbooks all has below steps.

Data Preparation

I discussed the potential data quality issues I identified about the dataset and how I applied various data preprocessing techniques to cope with those issues and performed Exploratory Data Analysis (EDA). Whenever appropriate, I enhanced my EDA with the effective data visualization.

Build, tune and evaluate various machine learning algorithms

I applied a list of machine learning algorithms covered in the course to the training data and construct disease diagnosis models. I also performed extensive model experiments with hyper-parameters’ tuning.

The first jupyter notebook has NBC, KNN, linear SVM, non-linear SVM, Random Forest and Gradient Boosting Machine. The second jupyter notebook has Logistic Regression, Artificial Neural Network/Deep Learning and Decision Tree.

Prediction and Interpretation

After building the classification models, I applied them to the test dataset (Disease Prediction Testing.csv) provided to predict if each person in the testing dataset has the disease.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Part1 Disease Prediction.ipynb		Part1 Disease Prediction.ipynb
Part2 Disease Prediction.ipynb		Part2 Disease Prediction.ipynb
README.md		README.md
Weather Forecast Testing.csv		Weather Forecast Testing.csv
Weather Forecast Training.csv		Weather Forecast Training.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Applied Machine Learning

Overview:

Project Goal:

Dataset:

This analysis and report of two jupyter nootbooks all has below steps.

Data Preparation

Build, tune and evaluate various machine learning algorithms

Prediction and Interpretation

About

Releases

Packages

Languages

Jieer334/Machine-Learning-Disease-Prediction

Folders and files

Latest commit

History

Repository files navigation

Applied Machine Learning

Overview:

Project Goal:

Dataset:

This analysis and report of two jupyter nootbooks all has below steps.

Data Preparation

Build, tune and evaluate various machine learning algorithms

Prediction and Interpretation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages