A set of tasks as an introduction to data mining.
- Task 1: Exploratory Data Analysis
- Task 2: Rank the numerical features using ANOVA; PCA and KPCA
- Task 3: Linear Discriminant Functions and KNN
- Task 4: Model Evaluation
- Paper analysis: Replicate the steps followed on a published paper and added improvements on it
Note: Initial folder contains the first project just to initial exploration
- ubuntu 20.04
- python 3
-
Clone of the repository and
cd kdd
-
Create a virtual environment:
1.1
python -m venv venv
1.2source venv/bin/activate
-
Install the requirements
pip install -r requirements.txt
-
Run:
jupyter notebook
-
Open the following address on browser:
http://localhost:8888/tree
-
Run:
jupyter notebook list
-
Copy the token and paste on
http://localhost:8888/tree
box to password or token
- Catarina Silva - catarinaacsilva
This project is licensed under the MIT License - see the LICENSE file for details