What it does
- Extracts clean content from any URL
- Supports multiple output formats (markdown, HTML, PDF, screenshot)
- Handles redirects automatically
Pricing
**1 per 1,000 requests)
Authentication
Include your API key in the Authorization header:Endpoint
POST /api/v1/scrape
SDK Usage
Request Parameters
The URL to scrapeExample:
"https://example.com/article"
Output format:
markdown
, html
, screenshot
, or pdf
Include images in markdown output (markdown format only)
Include hyperlinks in markdown output (markdown format only)
Response
Content in markdown format (when format is “markdown”)
Content in HTML format (when format is “html”)
Base64-encoded PDF data (when format is “pdf”)
Base64-encoded image data (when format is “screenshot”)
Final URL after redirects
HTTP status code (200 for success)
Cost in USD ($0.001)
Format Options
Format | Returns | Use Case |
---|---|---|
markdown | Clean text with optional images/links | Content extraction for AI processing |
html | Raw HTML content | Full page structure preservation |
screenshot | Base64-encoded PNG image | Visual capture of page |
pdf | Base64-encoded PDF document | Document archival |
Limitations
- 15-second timeout for all operations
- Cannot scrape pages requiring login
- JavaScript-heavy sites may not render completely
- Subject to rate limits based on your plan
Error Codes
401 - Authentication Error
401 - Authentication Error
Invalid or missing API key
400/500 - Scraping Error
400/500 - Scraping Error
Failed to scrape the provided URL
504 - Timeout Error
504 - Timeout Error
Request exceeded 15-second limit
Need Help? Join our Discord or email support@llmlayer.ai