Change the repository type filter
All
Repositories list
133 repositories
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
- Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
- Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
- This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
- Apify SDK monorepo
fingerprint-suite
PublicBrowser fingerprinting tools for anonymizing your scrapers. Developed by Apify.apify-actor-docker
Public.github
Publichomebrew-tap
Publicactor-aws-costs-to-slack
PublicThis tool integrates with AWS to monitor service usage costs and posts a summary of these costs to a Slack channel. The summary includes costs for various AWS services along with a chart that provides a visual breakdown of the costs over time.proxy-chain
PublicNode.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.apify-zapier-integration
Publicsuper-scraper
PublicGeneric REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!