API Documentation
SimpleCrawl provides a simple REST API for web scraping, crawling, and data extraction. Everything you need to get clean data from any website.
Quick Start
Get started with a single API call. No SDKs required — works with any HTTP client.
curl -X POST https://api.simplecrawl.com/v1/scrape \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"url": "https://example.com", "format": "markdown"}'API Endpoints
/v1/scrapeScrape a single URL and get markdown, HTML, or structured data back.
/v1/crawlCrawl an entire website. Follows links, respects robots.txt, returns all pages.
/v1/extractAI-powered structured data extraction. Define a schema, get JSON back.
/v1/screenshotCapture full-page or element screenshots of any URL.
/v1/status/:idCheck the status of an async crawl job.
Output Formats
Markdown
Clean, LLM-ready markdown. Strips boilerplate, preserves structure.
JSON
AI-extracted structured data matching your schema definition.
HTML
Raw cleaned HTML with scripts and styles removed.
Screenshot
Full-page or element screenshots as PNG or JPEG.
SDKs & Libraries
Official SDKs are coming at launch. In the meantime, use any HTTP client — the API is REST-based and works with curl, fetch, requests, or any language.
Get your API key
Join the waitlist to get early access and 500 free credits at launch.