Lab assignments for the course ID2222-Data Mining at KTH
-
Updated
Oct 27, 2022 - Roff
Lab assignments for the course ID2222-Data Mining at KTH
Information Retrieval, Information Extraction and Data Mining projects
Association Rules & Data Streams
Code for finding consistent topics in a tweets dataset
This repository houses an implementation of finding frequent items utilizing A-Priori and PCY Algorithms on Apache Kafka. It leverages a 15GB .json file as a sample of the 100+GB Amazon_Reviews_Metadata Dataset. This was developed as part of an assignment for the course Fundamentals of Big Data Analytics (DS2004).
Add a description, image, and links to the a-priori-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the a-priori-algorithm topic, visit your repo's landing page and select "manage topics."