This repository contains all the files used in my master's thesis on comparison of machine learning and statistical classifiers.
You need to run each file inside the "Code" folder. After that you should have multiple csv files in each folders. This allows you to run the Statistical test file for each performance measure.
The full paper is the paper.pdf and if you want to change or update you can use the .tex file
The goal of this paper was to dive in the mathematics behind the machine learning algorithms.
pip install -r requirements.txt
Adult: https://archive.ics.uci.edu/ml/datasets/adult
bank: https://archive.ics.uci.edu/ml/datasets/bank+marketing
basket: https://data.world/exercises/logistic-regression-exercise-1
Breast: https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)
Contraceptive: https://archive.ics.uci.edu/ml/datasets/Contraceptive+Method+Choice
CreditScoreAus: http://archive.ics.uci.edu/ml/datasets/statlog+(australian+credit+approval)
CreditScoreGer: https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data)
Iris: https://archive.ics.uci.edu/ml/datasets/iris
Sonar: http://archive.ics.uci.edu/ml/datasets/connectionist+bench+(sonar,+mines+vs.+rocks)
Wine: https://archive.ics.uci.edu/ml/datasets/wine
Alzheimer and Cell can be found in the package "Applied Predictive Modeling" in R which is accessed with the rpy2 library in python.