Twitter + Python: Tweet Analysis on Trump, Obama and Biden

Project Intro/Objective

The purpose of this project is to visualise observations made through analysing scraped tweet level data from 2008-2020 for Barack Obama, Joe Biden and Donald Trump. The 'Exploratory_analysis_twitter_obama_trump_biden.ipynb' details how I was able to generate a WordCloud fitted to the shape of Obama, Trump and Biden

Blog for detailed writeup

https://medium.com/@hitennaran

Methods Used

Data Visualization
WordCloud generator
Web Scraping

Technologies

Python
GetOldTweets3
PIL
json
wordcloud
pandas
numpy
matplotlib
seaborn
glob
csv
time
re
datetime

Project Description

The project is broken down into 7 key sections within the "Exploratory_analysis_twitter_obama_trump_biden.ipynb" workbook:

Scraping tweet level data from Obama, Trump and Biden's Twitter feeds via the "GetOldTweet3" library. Given the limitations of only being able to obtain up to 3,200 tweets via basic Twitter API access. Working with the 'GetOldTweets3' library is a useful hack for scraping an inifinite amount of tweets as we're able to obtain the neccessary tweet data through web scraping the twitter user feeds versus accessing through an API connection.
Assessing the data in order to identify what cleaning steps are required.
Cleaning the dataset in order to make it fit for conducting exploratory analysis.
Exploratory analysis into the dataset and uncover learnings. This is where I produce a WordCloud fitted to the shape of Obama, Trump and Biden
Export cleaned DataFrame to a GoogleSheet.
Export DataFrame to csv for DataStudio usage.
Export original cleaned Dataset.

Getting Started

Will need to pip install the following libraries in order to scrape tweets and generate a wordcloud.

Relevant files

Web scraping, cleaning and exploratory analysis: "Exploratory_analysis_twitter_obama_trump_biden.ipynb"
Explanatory analysis (Jupyter Notebook and HTML Slidedeck) : "Explanatory_analysis_twitter_obama _trump_biden.ipynb", "Explanatory_analysis_twitter_obama _trump_biden.slides.html"
Clean csv file for Jupyter Notebook usage: "biden_trump_obama_clean_2008_2020_original"
Clean csv file for Google Sheets usage: "biden_trump_obama_clean_2008_2020_gspread"
Clean csv file for DataStudio usage: "biden_trump_obama_clean_2008_2020_datastudio"

References

Following blog served extremely useful in providing an overview on how to extrapolate tweet level data working with the 'GetOldTweets3' library

https://medium.com/@AIY/getoldtweets3-830ebb8b2dab

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Explanatory_analysis_twitter_obama _trump_biden.ipynb		Explanatory_analysis_twitter_obama _trump_biden.ipynb
Explanatory_analysis_twitter_obama _trump_biden.slides.html		Explanatory_analysis_twitter_obama _trump_biden.slides.html
Exploratory_analysis_twitter_obama_trump_biden.ipynb		Exploratory_analysis_twitter_obama_trump_biden.ipynb
README.md		README.md
biden_trump_obama_clean_2008_2020_datastudio.csv		biden_trump_obama_clean_2008_2020_datastudio.csv
biden_trump_obama_clean_2008_2020_gspread.csv		biden_trump_obama_clean_2008_2020_gspread.csv
biden_trump_obama_clean_2008_2020_original.csv		biden_trump_obama_clean_2008_2020_original.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter + Python: Tweet Analysis on Trump, Obama and Biden

Project Intro/Objective

Blog for detailed writeup

Methods Used

Technologies

Project Description

Getting Started

Relevant files

References

About

Releases

Packages

Languages

hiten-naran/Twitter-Python-Tweet-Analysis-Trump-Biden-Obama-2008-2020

Folders and files

Latest commit

History

Repository files navigation

Twitter + Python: Tweet Analysis on Trump, Obama and Biden

Project Intro/Objective

Blog for detailed writeup

Methods Used

Technologies

Project Description

Getting Started

Relevant files

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages