Change the repository type filter
All
Repositories list
133 repositories
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
- Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
apify-client-python
Public- Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
- This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
- Apify SDK monorepo
workflows
Publicfingerprint-suite
PublicBrowser fingerprinting tools for anonymizing your scrapers. Developed by Apify.homebrew-tap
Publicactor-aws-costs-to-slack
PublicThis tool integrates with AWS to monitor service usage costs and posts a summary of these costs to a Slack channel. The summary includes costs for various AWS services along with a chart that provides a visual breakdown of the costs over time.push-actor-action
Publicapify-shared-python
Publicproxy-chain
PublicNode.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.super-scraper
PublicGeneric REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!