This repository includes 5 major Europe leagues:
- English Premier League – https://datahub.io/core/english-premier-league
- Spanish La Liga – https://datahub.io/core/spanish-la-liga
- Italian Serie A – https://datahub.io/core/italian-serie-a
- German Bundesliga – https://datahub.io/core/german-bundesliga
- French Ligue 1 – https://datahub.io/core/french-ligue-1
Each league has data for the all the seasons. The data is updated on monthly basis via Github-Actions
The data is sourced from the https://www.football-data.co.uk/
website, datasets range starts from 1993 up to current year
You need to have Python version >=3.5:
- Install requirements using
pip install -r scripts/requirements.txt
- Run the script
python scripts/process.py
- Update datapackage
pyhton scripts/process.py
Up-to-date (auto-updates every month) football dataset could be found on the datahub.io: https://datahub.io/core/football-datasets
Each directory in datasets/
directory is a data package. It has a common schema.json
for all its resources. You need to run python package.py
from root directory to generate datapackage.json
for each data package.
This Data Package is made available under the Public Domain Dedication and License v1.0 whose full text can be found at: http://www.opendatacommons.org/licenses/pddl/1.0/