Skip to content

Movie and TV show script database and web site for Norman Lear Center

License

Notifications You must be signed in to change notification settings

srikrishna98/nlc-script-database

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Norman Lear Center Script Database

Overview

The Norman Lear Center (NLC) is a multi-disciplinary research and public policy center exploring implications of the convergence of entertainment, commerce, and society and is based at the USC Annenberg School for Communication. One of NLC's current research projects (as of summer/fall 2019) involves analyzing the portrayal of philanthropy in TV shows and movies in the past decade. As part of this research, NLC wished to search through scripts for key words and phrases and explore the contexts in which they are used.

To aid in this research, a PostgreSQL database was created to hold script data scraped from Springfield, Springfield using the Beautiful Soup Python package. A Django web site was created to perform the searches and export information as CSV files, and the database model was defined in the project. TV/Movie titles, scripts, and release year information was scraped from the web and temporarily stored as text files, then migrated to the database with the Psycopg2 Python package.

The final product was deployed to Amazon Web Services as an Elastic Beanstalk application using the EB CLI utility.

Screenshots

Login

Login

Search Results

Results

About

Movie and TV show script database and web site for Norman Lear Center

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 49.3%
  • Python 46.8%
  • CSS 2.7%
  • PLpgSQL 1.2%