Skip to content

Parse IMDB data and plot average movie runtime by country

Notifications You must be signed in to change notification settings

ManavalanG/imdb_movie_runtime

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

imdb_movie_runtime

Task:

  • Determine average movie runtime by country from IMDB data and plot it on a map.
  • Do the same with for Indian movies by language.

Steps

  1. Obtain IMDB data in list format (Files obtained: countries, language, movies and running-times)
  2. Use in-house modified imdb2json.py with convert_IMDB_list_to_json.py to convert IMDB list format to JSON format. Note: This modified imdb2json.py is not made available here as I am not certain about its license terms
  3. Convert JSON data to TSV format using convert_IMDB_json_to_tsv.py
  4. Combine TSV data obtained from multiple IMDB files using analyze_imdb_data.ipynb and determine mean movie runtime by country
  5. Pair the data with geo-coordinates from developers.google.com and this github repo
  6. Plot it on map using plot_imdb_runtime_on_map.ipynb
  7. Plot average length of movies in Indian map by language using plot_india_by_language.ipynb. Map source: ArcGIS

Results

Average movie length by country Average movie runtime by country

Interactive version of this chart is available at plot.ly

Average Indian movie length by language Average Indian movie runtime by language

Scatter plot version of the world movies chart: Average movie runtime by country - Scatter plot

Interactive version of this chart is available at plot.ly

License

Code in this repository is distributed under MIT license.

About

Parse IMDB data and plot average movie runtime by country

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published