Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 403 Bytes

README.md

File metadata and controls

11 lines (6 loc) · 403 Bytes

Data-Extraction-and-NLP

Data-Extraction-and-NLP(TASK 1) Input.xlsx For each of the articles, given in the input.xlsx file, extract the article text and save the extracted article in a text file with URL_ID as its file name.

Data-Extraction-and-NLP(TASK 2) For each of the extracted texts from the article, perform textual analysis and compute variables, given in the output structure excel file.