Skip to content

asross/babynames

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

90 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Popular Baby Names in the US

This repo is for a data visualization of baby name popularity from the US Office of Social Security. It contains the code necessary to scrape/parse the baby name popularity data from the ssa.gov website, analyze and augment it with extra features like similar names, and then expose it as a webpage with charts using d3.

To Get The Data

The easiest way to get the data is to go to the visualization website and download it using the links at the very end of the page. That way, you will be able to get it as JSON in a hopefully convenient format. If you'd like to generate it all from scratch, though, you can clone this repository, run ruby download.rb (assuming you have Ruby and curl installed), and it will re-download the data by scraping the Office of Social Security's website. Then you can run ruby parse.rb, which will parse those HTML pages, compute the closest names, and then compress everything into a format optimized for sending to the browser. At that point, you can cd into visualization/ and run the visualization Middleman app to see it in your local browser.