Antisemitic Tracing on Reddit: An Intersection Approach Toward Dissecting Ideologies

The domain-specific communities thatform in subreddits may seem at first glanceto cohere around the topical label withwhich each subgroup is named. However,such a surface-level orientation may notbe as transparent in framing what is dis-cussed and how. Particularly in detectinghate speech and other potentially harmfuldiscussions, such analysis requires morerobust methods to detect and understandthe emergence of ideologies. In this study,we present an intersection method of de-termining likelihood of harmful discus-sions. Our methodology replicates suc-cessful detection methods from previousstudies and suggests opportunity for morediscrete analysis.

Data Source

The data we used are already extracted and saved in the data folder. The origianl data were the compressed files on the linux sever gh.luddy.indiana.edu. To replicate the extracting, login to the server and run Anti-Semitics_data_extract.py.

How-to

Environment

python3 -m venv <env_name>
source <env_name>/bin/activate
pip install -r requirements.txt

Execution

git clone https://github.com/dzcyb0rg68/reddit-mining.git
cd reddit-mining
python Anti-Semitics_data_extract.py

Please note that the original data files have 204.4GB and contain 793,326,999 reddit posts. The extracting may take a up to 24 hours to complete. To speed up, please contract the authors for parallel extracting support.

Questions?

Please reach out to the author Chang at cc93@iu.edu or Michalek at jasomich@iu.edu

Paper Preview

Your browser does not support PDFs. Please download the PDF to view it: Download PDF.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
dashboard		dashboard
data		data
image		image
report		report
Anti-Semitics_data_extract.py		Anti-Semitics_data_extract.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Antisemitic Tracing on Reddit: An Intersection Approach Toward Dissecting Ideologies

Data Source

Questions?

Paper Preview

About

Languages

dzcyb0rg68/reddit-mining

Folders and files

Latest commit

History

Repository files navigation

Antisemitic Tracing on Reddit: An Intersection Approach Toward Dissecting Ideologies

Data Source

Questions?

Paper Preview

About

Topics

Resources

Stars

Watchers

Forks

Languages