Apify
Pinned Loading
Repositories
- crawlee Public
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee’s past year of commit activity - rag-web-browser Public
RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.
apify/rag-web-browser’s past year of commit activity - crawlee-python Public
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee-python’s past year of commit activity - fingerprint-suite Public
Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
apify/fingerprint-suite’s past year of commit activity - apify-sdk-python Public
The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
apify/apify-sdk-python’s past year of commit activity