SimpleCrawl
Coming Q2 2026

API Documentation

SimpleCrawl provides a simple REST API for web scraping, crawling, and data extraction. Everything you need to get clean data from any website.

Quick Start

Get started with a single API call. No SDKs required — works with any HTTP client.

curl -X POST https://api.simplecrawl.com/v1/scrape \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com", "format": "markdown"}'

API Endpoints

POST/v1/scrape

Scrape a single URL and get markdown, HTML, or structured data back.

POST/v1/crawl

Crawl an entire website. Follows links, respects robots.txt, returns all pages.

POST/v1/extract

AI-powered structured data extraction. Define a schema, get JSON back.

POST/v1/screenshot

Capture full-page or element screenshots of any URL.

GET/v1/status/:id

Check the status of an async crawl job.

Output Formats

Markdown

Clean, LLM-ready markdown. Strips boilerplate, preserves structure.

JSON

AI-extracted structured data matching your schema definition.

HTML

Raw cleaned HTML with scripts and styles removed.

Screenshot

Full-page or element screenshots as PNG or JPEG.

SDKs & Libraries

Official SDKs are coming at launch. In the meantime, use any HTTP client — the API is REST-based and works with curl, fetch, requests, or any language.

PythonJavaScript/TypeScriptGoRubyPHPcURL

Get your API key

Join the waitlist to get early access and 500 free credits at launch.

Get early access + 500 free credits