Skip to content

This repository contains all the files used in my master's thesis on comparison of machine learning algorithms performance.

Notifications You must be signed in to change notification settings

Kwirtz/Masters-thesis

Repository files navigation

Masters-thesis

This repository contains all the files used in my master's thesis on comparison of machine learning and statistical classifiers.

You need to run each file inside the "Code" folder. After that you should have multiple csv files in each folders. This allows you to run the Statistical test file for each performance measure.

The full paper is the paper.pdf and if you want to change or update you can use the .tex file

The goal of this paper was to dive in the mathematics behind the machine learning algorithms.

Requirements

pip install -r requirements.txt

Data

Adult: https://archive.ics.uci.edu/ml/datasets/adult
bank: https://archive.ics.uci.edu/ml/datasets/bank+marketing
basket: https://data.world/exercises/logistic-regression-exercise-1
Breast: https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)
Contraceptive: https://archive.ics.uci.edu/ml/datasets/Contraceptive+Method+Choice
CreditScoreAus: http://archive.ics.uci.edu/ml/datasets/statlog+(australian+credit+approval)
CreditScoreGer: https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data)
Iris: https://archive.ics.uci.edu/ml/datasets/iris
Sonar: http://archive.ics.uci.edu/ml/datasets/connectionist+bench+(sonar,+mines+vs.+rocks)
Wine: https://archive.ics.uci.edu/ml/datasets/wine

Alzheimer and Cell can be found in the package "Applied Predictive Modeling" in R which is accessed with the rpy2 library in python.

Results

About

This repository contains all the files used in my master's thesis on comparison of machine learning algorithms performance.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published