Skip to content
@apify

Apify

We're making the web more programmable.

Pinned Loading

  1. crawlee-python crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

    Python 4.6k 318

  2. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 15.6k 666

  3. proxy-chain proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

    JavaScript 850 144

  4. apify-sdk-js apify-sdk-js Public

    Apify SDK monorepo

    TypeScript 123 35

  5. got-scraping got-scraping Public

    HTTP client made for scraping based on got.

    TypeScript 555 44

  6. fingerprint-suite fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 979 103

Repositories

Showing 10 of 130 repositories
  • actor-vector-database-integrations Public

    Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)

    apify/actor-vector-database-integrations’s past year of commit activity
    Python 4 Apache-2.0 4 0 0 Updated Nov 14, 2024
  • workflows Public

    Apify's reusable github workflows

    apify/workflows’s past year of commit activity
    Python 7 4 4 4 Updated Nov 14, 2024
  • apify-docs Public

    This project is the home of Apify's documentation.

    apify/apify-docs’s past year of commit activity
    API Blueprint 29 Apache-2.0 76 69 23 Updated Nov 14, 2024
  • crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee-python’s past year of commit activity
    Python 4,588 Apache-2.0 318 72 10 Updated Nov 14, 2024
  • apify-sdk-js Public

    Apify SDK monorepo

    apify/apify-sdk-js’s past year of commit activity
    TypeScript 123 Apache-2.0 35 11 8 Updated Nov 14, 2024
  • zapier-integrations-scraper Public

    Scrape list of Zapier integrations from Zapier website

    apify/zapier-integrations-scraper’s past year of commit activity
    TypeScript 0 0 0 1 Updated Nov 14, 2024
  • make-integrations-scraper Public

    Scrape list of available integrations from Make

    apify/make-integrations-scraper’s past year of commit activity
    TypeScript 0 0 0 1 Updated Nov 14, 2024
  • apify-client-js Public

    Apify API client for JavaScript / Node.js.

    apify/apify-client-js’s past year of commit activity
    JavaScript 68 Apache-2.0 27 16 3 Updated Nov 14, 2024
  • apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    apify/apify-cli’s past year of commit activity
    TypeScript 122 19 35 (1 issue needs help) 3 Updated Nov 14, 2024
  • apify-client-python Public

    Apify API client for Python

    apify/apify-client-python’s past year of commit activity
    Python 49 Apache-2.0 11 8 2 Updated Nov 14, 2024