4.1. Motif Search
4.2. Classification
4.3. Clustering
4.5. RFClassifier
For the later tasks, the following data set was used. The database contains measures of gait from 93 patients with idiopathic PD, and 73 healthy controls. The dataset contains:
- Vertical ground reaction force records of subjects as they walked for approximately 2 minutes on level ground.
• The file contains the measures from 8 sensors for each foot.
• Each individual walks for 2 minutes, records are taken at 100 samples per second.
Thus, we have 12000 record for each 2 mins walk.
- Demographics file contains demographic information, measures of disease severity and other related measures.
The dataclass contains read the data, segment it, scale it and iterpolate it.
Applying random forest classifier on the statics from the raw signal data AFTER being filtered.
In the first file, tried to apply motif identification on different features from the time series dataset, it appeared that the shape of the dataset (pressure - no pressure) is what resulted as motif and that is not useful.
In the second file, Tried to filter the data, but still having the same problem.
Task | features | Accuracy |
---|---|---|
Severity Detection | Univariate classification (L2 sensor) | 0.39 |
Parkinson’s Classification | Univariate classification (L2 sensor) | 0.71 |
Severity Detection | Multivariate classification | 0.38 |
Parkinson’s Classification | Multivariate classification | 0.82 |
Clustering for fait time series dataset did not result in promising results. The first file tried to cluster the data from both lef and right feet. The second file applies the clustering on data from the left foot only. The third file applies the clustering on data from the right foot only.
Data Class + Applying FWHM on accumalated forces fro the right foot + applying models on the sequences.
- RFClassifier Class for training, predicting, scoring the results with Random Forest Classifier.
- RFC models In this code file, we use class data and RFC class to apply the previous different models on the data. (SUMMING UP)
- RFC basemodel using statics from raw data.
- RFC models on interpolated scaled stances with FWHM algorithm, and additional features
- RFC on scaled stances with 3 extra features and statics
- RFC on scaled stances with 6 extra features and statics
- RFC model on scaled stances from the right & left foot with 6 extra features each
- RFC model on scaled stances from the right & left foot with 6 extra features each and basemodels
- RFC model on scaled stances from 16 sensor from the right & left foot with 6 extra features each
- RFC model on scaled stances from 16 sensor from the right & left foot with the sum of all sensors and with 6 extra features each
- RFC model on scaled stances from 16 sensor from the right & left foot and the sum of all sensors and with 6 extra features each foot and the statics from base models
Model | Input data | Accuracy | Precision | Recall | F1 |
---|---|---|---|---|---|
RFC n_est=200 | Statics on filtered raw data from each sensor | 0.8329 | 0.8716 | 0.9084 | 0.8844 |
RFC n_est=200 | Right foot related: [ Interpolated Scaled stances, 3 features] | 0.7092 | 0.7708 | 0.8176 | 0.7897 |
RFC n_est=200 | Right foot related: [ Interpolated Scaled stances, 6 features] | 0.7341 | 0.7896 | 0.8370 | 0.8815 |
-
Trying hybrid models trying different hybrid models for right stances and 3 features.
-
Hybrid model Hybrid model class and a train it on for right stances and 3 features and on for right stances and 6 features. Two final notebooks for training hybrid model on all data with/without statics for binary classification and severity Detection.
First file: Finding what features are more important on models trained on statics and extracted features
Second file: Showing the most important features in different plots.