Skip to content

Product scraping from Walmart Canada website, with further cleaning and integration of data from a different store.

Notifications You must be signed in to change notification settings

MicahWallace/scrapy-walmart

 
 

Repository files navigation

Web Crawler + Data Ingestion

A python script to obtain product data and store availability from Walmart Canada website for specific categories and branches. It also digests and integrates additional data to the database produced by the scraper.

Packages

  • Scrapy
  • Pandas
  • SQLAlchemy

Execution Workflow

  • Run spider.py within product_scraping folder:
    spider crawl ca_walmart
    
  • Run ingestion.py within additional_data folder:
    python ingestion.py
    

About

Product scraping from Walmart Canada website, with further cleaning and integration of data from a different store.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%