Skip to content

In the present day, the entertainment industry is constantly evolving toward making the most enjoyable and profitable sources of film entertainment. Through the use of movie rating sites, we can now decide whether or not it is worth the trip to the movie theatre to watch a partiuclar film. With this in mind, I wanted to explore what aspects of m…

Notifications You must be signed in to change notification settings

feng-jj/movie-script-IMDb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Movie Script Analyzer

A webscraping script that fetches all the movie title data for each of the 18 genres on IMDb. Specifically gets each title's rating and box office data, as well as performs sentiment analysis on the movie script using the NRC Lexicon in order to get the % of the script that expresses 10 different emotions and sentiments.

The data is then exported to a CSV file where a correlation analysis is then done manually with each metric and the rating/box office data.

How To Use

Just run main.py! The program will take a long time due to the sheer amount of data we are working with (200,000+ titles and each of their movie scripts), so it would be best to comment out and run one IMDb object at a time.

References

https://www.imsdb.com/

https://monkeylearn.com/sentiment-analysis/

http://saifmohammad.com/WebPages/NRC-Emotion-Lexicon.htm

http://www.nltk.org/book/

About

In the present day, the entertainment industry is constantly evolving toward making the most enjoyable and profitable sources of film entertainment. Through the use of movie rating sites, we can now decide whether or not it is worth the trip to the movie theatre to watch a partiuclar film. With this in mind, I wanted to explore what aspects of m…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages