This project uses data from various sources that are openly licensed or in the public domain. Below are the sources and their respective information:
Description: A .txt
file provided by Timid Robot containing all legal
tool paths.
API documentation link:
google_custom_search/legal-tool-paths.txt
: a list of all current Creative Commons (CC) legal tool pathsdata/prioritized-tool-urls.txt
: a prioritized list of all current CC legal tool URLs
API information:
- No API key required
- No query limits
Description: With over 5 billion photos (many with valuable metadata such as tags, geolocation, and Exif data), the Flickr community creates wonderfully rich data. The Flickr API is how you can access that data. In fact, almost all the functionality that runs flickr.com is available through the API. (Flickr: The Flickr Developer Guide)
API documentation link:
API information:
- API key required
- Query limit: 3600 requests per hour
- Data available through CSV format
Description: A development platform for hosting and managing code.
API documentation link:
API information:
- API key not required but recommended by GitHub
- Query limit: 60 requests per hour if unauthenticated, 5000 requests per hour if authenticated
- Data available through JSON format
Description: The Custom Search JSON API allows user-defined detailed query and access towards related query data using a programmable search engine.
Admin links:
API documentation links:
- Custom Search JSON API Reference | Programmable Search Engine | Google Developers
- Google API Python Client Library
- Google API Client Library for Python Docs |
google-api-python-client
- Reference documentation for the core library
googleapiclient.
- See: googleapiclient.discovery > build
- Library reference documentation by API
- See Custom Search v1 cse()
- Reference documentation for the core library
googleapiclient.
- Google API Client Library for Python Docs |
google-api-python-client
- Method: cse.list | Custom Search JSON API | Google Developers
- XML API reference appendices
API information:
- API key required
- Query limit: 100 queries per day
- Data available through JSON format
Notes:
- The data from Google Custom Search will only cover 50+ general, most significant categories of CC License for data collection quota constraint. As an additional note, the order of precedence of license the collected data's first column is sorted due to intermediate data analysis progress.
Description: A python interface to archive.org to achieve API requests towards internet archive.
API documentation link:
API information:
- No API key required
- No query limits
Description: The MediaWiki Action API is a web service that allows access to some wiki features like authentication, page operations, and search. It can provide meta information about the wiki and the logged-in user. (API:Main page - MediaWiki)
API documentation link:
API information:
- No API key required
- Query limit: depends on user status and request type
- Data available through XML or JSON format
Description: The Met’s Open Access datasets are available through our API. The API (RESTful web service in JSON format) gives access to all of The Met’s Open Access data and to corresponding high resolution images (JPEG format) that are in the public domain. (The Metropolitan Museum of Art Collection API)
API documentation link:
API information:
- No API key required
- 80 queries per second
Description: The Vimeo API allows users to perform filtered, advanced search on Vimeo videos.
API documentation link:
API information:
- API key required
- Query limit: 5000 authenticated requests per day
- Data available through JSON format
Description: An API from YouTube for platform users to upload videos, adjust video parameters, and obtain search results.
API documentation link:
API information:
- API key required
- Query limit: depends on the type and number of requests
- Data available through JSON format