Skip to content

Add project: Crawl4AIΒ #214

@AtharvaDomale

Description

@AtharvaDomale

Project details:

  • Project Name: Crawl4AI
  • Github URL: https://github.com/unclecode/crawl4ai
  • Category: Web Crawling & Scraping
  • License: Apache-2.0
  • Package Managers: pypi:crawl4ai dockerhub:unclecode/crawl4ai

Additional context:

Crawl4AI is an open-source, AI-friendly web crawler designed to extract clean Markdown or structured data for use in RAG pipelines, LLM agents, or custom automation. It supports:

  • Automatic crawl-depth detection
  • Stealth crawling via Playwright
  • Proxy rotation and headless browser control
  • Output in Markdown, JSON, or HTML
  • A simple CLI and Python SDK

It has active development, great documentation, and offers performance advantages over alternatives like Firecrawl. Perfect for scraping AI training data or building agent-ready corpora.

Metadata

Metadata

Assignees

No one assigned

    Labels

    add-projectAdd new project to best-of list

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions