Skip to content

chayandatta/Heart_disease_prediction

Repository files navigation

Heart Disease Prediction

Binder

Final code

Welcome to the Heart Disease Prediction GitHub repository! This project is designed to help beginners learn the fundamentals of machine learning in a hands-on and interactive way. Whether you're completely new to machine learning or looking to refresh your knowledge, this repository has something for you.

Table of Contents

Introduction

Machine learning is a fascinating and rapidly growing field that involves building algorithms and models that can learn from data and make predictions or decisions. This repository is aimed at beginners who want to dive into the world of machine learning. Here, you'll find a collection of beginner-friendly tutorials, code examples, and resources to help you understand the core concepts of machine learning.

Getting Started

Before you dive into the tutorials and examples in this repository, make sure you have the following prerequisites:

  • Basic programming knowledge (preferably in Python).
  • Understanding of fundamental mathematical concepts (linear algebra, calculus, probability).
  • Python 3.x installed on your system.

Project Structure

This repository is organized into the following sections:

  1. Tutorials: Step-by-step tutorials that cover fundamental machine learning concepts. These tutorials include code examples and explanations to help you grasp the concepts.

  2. Code Examples: A collection of standalone code examples that demonstrate various machine learning techniques and algorithms. These examples serve as building blocks for your own projects.

  3. Datasets: Sample datasets that you can use to practice and experiment with machine learning algorithms.

  4. Resources: Additional resources, including books, courses, and articles, to further your understanding of machine learning.

How to Use This Repository

  1. Clone the Repository: Start by cloning this repository to your local machine using git clone.
git clone https://github.com/chayandatta/Heart_disease_prediction.git
  1. Explore Tutorials and Code Examples: Browse the Tutorials and Code Examples directories to learn and experiment with different machine learning concepts. Each folder contains its own README file with instructions.

  2. Practice with Datasets: If you want to practice on real data, check out the Datasets directory for sample datasets that you can use in your machine learning projects.

  3. Check Resources: Explore the Resources section for links to books, online courses, and articles to deepen your knowledge of machine learning.

  4. Contribute: If you have your own machine learning examples or resources that you'd like to share, feel free to contribute to this repository. See the Contributing section for guidelines.

Contributing

contributions from the community is welcomed! If you'd like to contribute to this project, please follow these guidelines:

  1. Fork this repository.
  2. Create a new branch for your contribution.
  3. Make your changes and improvements.
  4. Test your changes thoroughly.
  5. Create a pull request (PR) with a clear description of your contribution.

I appreciate your help in making machine learning more accessible to beginners.

License

This project is licensed under the MIT License. You can find the full license details in the LICENSE file.


I hope you find this repository helpful on your journey to learning machine learning. If you have any questions or encounter issues, please feel free to open an issue, and I'll be happy to assist you. Happy learning! 🚀

Heart Disease prediction using 5 algorithms

- Logistic regression,
- Random forest,
- Naive Bayes,
- KNN(K Nearest Neighbors),
- Decision Tree

then improved accuracy by adjusting different aspect of algorithms.

Final Decision tree


Dataset source (link)

Dataset Creators:

  1. Hungarian Institute of Cardiology. Budapest: Andras Janosi, M.D.
  2. University Hospital, Zurich, Switzerland: William Steinbrunn, M.D.
  3. University Hospital, Basel, Switzerland: Matthias Pfisterer, M.D.
  4. V.A. Medical Center, Long Beach and Cleveland Clinic Foundation: Robert Detrano, M.D., Ph.D.

Data Set Information:

This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one that has been used by ML researchers to this date. The "goal" field refers to the presence of heart disease in the patient. It is integer valued from 0 (no presence) to 4. Experiments with the Cleveland database have concentrated on simply attempting to distinguish presence (values 1,2,3,4) from absence (value 0).


Star History

Star History Chart