Extract API - Multi-mode page extraction
Extract from a single web page in one call. Combine any modes: structured JSON (‘json’, 0.005), an answer to a question (‘qa’, 0.001), and a brand profile with logos/colors/socials (‘brand’, 0.004 once when a scrape runs. If the request fails before any AI cost is incurred (page fetch failure, empty content, brand fetch failure), you are fully refunded.
Authorizations
Bearer token authentication using your LLMLayer API key. Include in Authorization header as: Bearer YOUR_LLMLAYER_API_KEY
Body
The page URL to extract from. Must be http(s). PDF URLs are not supported — use /get_pdf_content instead.
"https://www.ycombinator.com/blog"
Extraction modes to run in one call. Any combination; duplicates are ignored. Pricing is summed per mode: json/summary/qa $0.005 each, links $0.001, brand $0.002.
json, summary, qa, links, brand ["json", "summary"]Required when modes includes 'json'. Accepts a formal JSON schema, an example object (e.g. {"title": "string", "price": "number"}), or a plain-text description of the fields you want.
The question to answer from the page — required when modes includes 'qa'.
Optional extra guidance applied to all AI modes (json, summary, qa). E.g. 'dates in DD/MM/YYYY format'.
Output language for summary/qa ('auto' matches the user/page language). E.g. 'en', 'fr', 'es'.
Use advanced proxy for sites with bot protection. Adds $0.004 once per request, only when a scrape runs (brand-only requests never scrape).
Omit this field to let the API pick the best default per selection: links-only requests scrape the full page (nav/footer links matter), AI modes use main content only.
Response
Extraction results. All result fields (structured_data, summary, answer, links, brand) are always present; modes you did not request are null.
All five result fields are always present; modes you did not request are null. The json mode result is returned as structured_data.
Final URL after redirects.
Page title.
Page metadata (description, OpenGraph fields, language, ...) as found by the scraper.
Structured data matching your schema (when 'json' in modes).
Markdown summary of the page (when 'summary' in modes).
Markdown answer to your question (when 'qa' in modes).
All links found on the page, deduplicated, max 500 (when 'links' in modes).
Brand profile: domain, title, description, colors, logos, backdrops, socials, industries, key links and pages (when 'brand' in modes).
Total cost in USD — sum of the selected modes (+$0.004 advanced proxy when a scrape runs).
0.01
Total processing time in seconds.
"3.42"
200
