Analysis of the AI publishing landscape. See this notebook in the repository
To run the notebook and script:
Create a new anaconda environment via conda:
conda create -n env anaconda
activate it
source activate env
To run the notebook
jupyter notebook
Extras:
Download arXiv data into data.jsonlines (takes about 1 hour)
python scrape.py > data.jsonlines
Download Guardian data into guardian.jsonlines (you will need to insert your API key into creds_guardian.txt
)
python guardian.py > guardian.jsonlines