This project can take in a list of URLS in filePaths.json and create an inverted index with TFIDF scores, cached page content, and metadata. Data will need to be stored in a database which can be linked in credentials.py (MongoDB recommended).
This was a collaborative project with equal contributions from Danielle, Gayatri, and Greta.