Heartbeat Sound Classification

This project is a machine learning approach to classify heartbeat sounds into four categories:

Normal (0)
Extrahls (1)
Murmur (2)
Extrastole (4)

The model utilizes the Dangerous Heartbeat Dataset (DHD) from Kaggle to learn and predict heartbeat sound types. Initial experiments and model iterations are detailed below.

Getting Started

To run the pre-trained model, follow the steps below:

Edit run_model.py to specify the audio file path to be classified.
Run run_model.py to load the trained model and classify the audio file.
That's it! The script will output the predicted heartbeat sound type (0 = Normal, 1 = Extrahls, 2 = Murmur, 4 = Extrastole).

Project Overview

Objective

The goal of this project is to develop an efficient, lightweight machine learning model capable of classifying heartbeat sounds into distinct categories. This model, with its compact size, is intended to be deployable on mobile devices such as smartphones.

Initial Approach

Baseline Model:
- Feature Extraction: Used audio frequency as the primary feature.
- Classifier: Basic neural network model.
- Result: Accuracy capped at around 30%.
Gradient Boosting:
- Algorithm: Implemented using "PerpetualBoosters" for gradient boosting.
- Result: No significant improvement over the baseline.

Enhanced Approach: Custom Preprocessing

To improve model performance, custom preprocessing techniques were developed, as follows:

Preprocessor 1 (preprocessor1.py):
- Method: Divided audio into frames based on delta time and extracted frequency-amplitude pairs from each frame using librosa.
- Padding: End of data was padded to ensure uniform length across samples.
- Datasets: Generated three datasets—mini, small, and main.
- Result: Achieved ~72% accuracy with the main dataset. However, model size was still large (~770 MB).
Preprocessor 2 (preprocessor2.py):
- Method: Employed alignment padding, using Euclidean distance to align smaller samples with larger ones for minimal distance across padding. This ensures heartbeat samples are consistently aligned in the padded arrays.
- Datasets: Due to processing time, only mini and small datasets were generated.
- Result: Achieved ~73% accuracy using the small dataset, with a drastically reduced model size of ~6 MB—over 99% smaller than previous models, making it feasible for mobile deployment.

Summary of Results

The current model achieves an accuracy of ~73% on the small dataset, with a compact model size that supports local execution on mobile devices (model size of ~6MB).

Files

Data Preprocessing
- preprocessor1.py: Initial preprocessing script that segments audio and pads data to uniform length.
- preprocessor2.py: Advanced preprocessing script for alignment padding based on Euclidean distance.
Model Training
- train.py: Script used to train the model on the processed datasets.
Model Inference
- run_model.py: Single script to load the trained model and run inference on a given audio file, uses alignment_reference.pkl, amp_scaler.pkl, freq_scaler.pkl, and final_model_fcnn_classifier_16_8_7368.pth.

Potential Future Improvements

Additional Feature Engineering: Exploring features beyond frequency and amplitude, such as Mel-frequency cepstral coefficients (MFCCs) or spectral contrast.
Model Optimization: Experimenting with quantization and pruning techniques to further reduce model size without sacrificing accuracy.
Other approachs: Using some other approach like CNN, RNN, etc. instead of a fully connected neural network.

Dataset

Dataset used: Dangerous Heartbeat Dataset (DHD).

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
LICENSE		LICENSE
alignment_reference.pkl		alignment_reference.pkl
amp_scaler.pkl		amp_scaler.pkl
final_model_fcnn_classifier_16_8_7368.pth		final_model_fcnn_classifier_16_8_7368.pth
freq_scaler.pkl		freq_scaler.pkl
preprocessor1.py		preprocessor1.py
preprocessor2.py		preprocessor2.py
pth_to_h5.py		pth_to_h5.py
readme.md		readme.md
requirements.txt		requirements.txt
run_model.py		run_model.py
save_amp_and_freq_scalers.py		save_amp_and_freq_scalers.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Heartbeat Sound Classification

Getting Started

Project Overview

Objective

Initial Approach

Enhanced Approach: Custom Preprocessing

Summary of Results

Files

Potential Future Improvements

Dataset

About

Releases

Packages

Languages

License

akshaldhal/heartbeat-sound

Folders and files

Latest commit

History

Repository files navigation

Heartbeat Sound Classification

Getting Started

Project Overview

Objective

Initial Approach

Enhanced Approach: Custom Preprocessing

Summary of Results

Files

Potential Future Improvements

Dataset

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages