My Spotify 🎵

Overview

I built a pipeline to collect my daily Spotify activity using Python. To gain insights from my data, I made a web app using HTML/CSS/Javascript for the front-end and d3.js for making visualizations.

Data collection

Spotify does not give access to your full listening history. Their API can only return your last 50 tracs. There is a "download your data" option on Spotify's "Privacy Settings" page, but it says it may take up to 30 days to complete. Our solution was to link a Last.fm account to Spotify, because they track (and store) Spotify streaming activity in real-time.

Our data collection pipeline is as follows:

create_backup.py

Create a backup of all most recent data files.

get_recent_tracks.py

Use Last.fm API to get my recent listening history on Spotify (track, artist, album, date).

spotify_authentication.py

Use Selenium to automate Spotify Oauth process (optional). Otherwise, manually authenticate and update config.py.

get_spotify_track_IDs.py

Use Spotify API to search the track + artist + album combination and get its track ID and the artist ID(s) of all featured artist(s). Merge with data from step 1.

get_spotify_genres.py

Use Spotify API to get genres associated with each artist using the artist ID from step 3. Merge with data from step 3.

get_spotify_audio_features.py

Use Spotify API to get audio features associated with each track ID from step 3. Merge with data from step 4.

create_app_data.py

Compute aggregate statistics from merged data from step 5 such as number of daily plays. Generate data for several visualizations.

TO DO

Improve UI (grid layout)
Improve tooltip UI on heatmap (better out-of-bounds checking)
Idea: use a database (SQL) instead of working entirely with .csv. This isn't really needed for the small amount of data we're collecting, but it's good practice. Also may schedule our pipeline with cron or Airflow and host using AWS.
- I've started implementing this
Idea: show recent tracks/genres/artist (last 5, 10) in addition to "top"

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data_analysis		data_analysis
data_collection		data_collection
spotify-app		spotify-app
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

My Spotify 🎵

Overview

Tags

Data collection

TO DO

About

Releases

Packages

Languages

tysonpond/my-spotify

Folders and files

Latest commit

History

Repository files navigation

My Spotify 🎵

Overview

Tags

Data collection

TO DO

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages