GitHub - joyce-lin/Project_Twitter_NLP: Building Event Extraction and Trending Framework for Twitter

Building Event Extraction and Trending Framework for Twitter

This is my capstone project for the Data Science Immersive at General Assembly.

In this project, my goals are:

Set up real-time Data Collection process and Data Infrastructure
Examine different Natural Language Processing tools on collected tweets
Create A|B Testing Model on similarity comparison
Use Time Series Modeling to catch the trends
Tuning hyperparameters for model improvement

To test my framework:

I collected and cleaned over 1.5 million tweets from uing TwitterStream API
```
   /lib/get_tweets.py
```
Create scheduled and on demand LSA processing for text vectrozation
```
   /ipynb/01_Fit_pipeline_TfiDf_SVD.ipynb
```

Event and Trend Detection using Cosine Similarity and ARIMA modeling

   Event extracting using TFIDF and SVD
     /ipynb/03_Tweets_Modeling_CosineSim_AB_Test_SVD.ipynb
   
   Hashtag Time Series Modeling 
     /ipynb/05_Hashtags_Modeling_WhatsTrending.ipynb

Technologies used in the project

Python
TwitterStream API

Data Management

postgres
redis

Modeling

NLP (LSA): SAPCY | TFIDF |SVD | Count Vectorizer
Cosine Similarity
ARIMA Modeling

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
__pycache__		__pycache__
doc		doc
ipynb		ipynb
lib		lib
Dockerfile		Dockerfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building Event Extraction and Trending Framework for Twitter

Technologies used in the project

Data Management

Modeling

For detail of this project see /doc/Twitter_capstone_GA_Profile.pdf

About

Releases

Packages

Languages

joyce-lin/Project_Twitter_NLP

Folders and files

Latest commit

History

Repository files navigation

Building Event Extraction and Trending Framework for Twitter

Technologies used in the project

Data Management

Modeling

For detail of this project see /doc/Twitter_capstone_GA_Profile.pdf

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages