Skip to content

Import public NYC taxi and for-hire vehicle (Uber, Lyft, etc.) trip data into PostgreSQL database

License

Notifications You must be signed in to change notification settings

ldocao/nyc-taxi-data-postgres

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nyc-taxi-data

originally from https://github.com/toddwschneider/nyc-taxi-data

You can go directly to step 4 if you already have normalized and denormalized datasets.

  1. import into postgresql with run.sh
  2. delete the raw data because it eats up space. Then, create normalized and denormalized datasets for other technologies with output*.sql
  3. upload all the files to gcs
  4. delete the VM and reinitialize postgresql with only normalized/denormalized (this is useful if you want to decrease the size of the hard drive, and be able to stop/start the VM without losing everything)

About

Import public NYC taxi and for-hire vehicle (Uber, Lyft, etc.) trip data into PostgreSQL database

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 87.1%
  • Shell 11.1%
  • Ruby 1.6%
  • Dockerfile 0.2%