Skip to content
forked from adinaspertus/imdb

This Data Analysis proposes models for predicting film quality based on certain metadata attributes and the text of the film’s IMDb description. We developed a topic model based on film descriptions from past years, and used this and other variables to develop models for predicting a film’s rating. We then tested the model’s validity on a set of…

Notifications You must be signed in to change notification settings

blue-create/imdb

 
 

Repository files navigation

Data analysis project "Introduction to Data Science"

This Data Analysis proposes models for predicting film quality based on certain metadata attributes and the text of the film's IMDb description. We developed a topic model based on film descriptions from past years, and used this and other variables to develop models for predicting a film's rating. We then tested the model's validity on a set of freshly-scraped 2020 films.

About

This Data Analysis proposes models for predicting film quality based on certain metadata attributes and the text of the film’s IMDb description. We developed a topic model based on film descriptions from past years, and used this and other variables to develop models for predicting a film’s rating. We then tested the model’s validity on a set of…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 100.0%