- We will learn how to predict tags for posts from StackOverflow. To solve this task you will use multilabel classification approach.
This tutorial requires the following packages:
- Numpy — a package for scientific computing.
- Pandas — a library providing high-performance, easy-to-use data structures and data analysis tools for the Python
- scikit-learn — a tool for data mining and data analysis.
- NLTK — a platform to work with natural language.
- pip install nltk
- pip install sklearn
- pip install pandas
- pip install numpy
Got the dataset from Natural Language Processing course (Coursera) https://www.coursera.org/learn/language-processing