Skip to content

Latest commit

 

History

History
33 lines (24 loc) · 2.16 KB

README.md

File metadata and controls

33 lines (24 loc) · 2.16 KB

imdb_movie_runtime

Task:

  • Determine average movie runtime by country from IMDB data and plot it on a map.
  • Do the same with for Indian movies by language.

Steps

  1. Obtain IMDB data in list format (Files obtained: countries, language, movies and running-times)
  2. Use in-house modified imdb2json.py with convert_IMDB_list_to_json.py to convert IMDB list format to JSON format. Note: This modified imdb2json.py is not made available here as I am not certain about its license terms
  3. Convert JSON data to TSV format using convert_IMDB_json_to_tsv.py
  4. Combine TSV data obtained from multiple IMDB files using analyze_imdb_data.ipynb and determine mean movie runtime by country
  5. Pair the data with geo-coordinates from developers.google.com and this github repo
  6. Plot it on map using plot_imdb_runtime_on_map.ipynb
  7. Plot average length of movies in Indian map by language using plot_india_by_language.ipynb. Map source: ArcGIS

Results

Average movie length by country Average movie runtime by country

Interactive version of this chart is available at plot.ly

Average Indian movie length by language Average Indian movie runtime by language

Scatter plot version of the world movies chart: Average movie runtime by country - Scatter plot

Interactive version of this chart is available at plot.ly

License

Code in this repository is distributed under MIT license.