Absortio

Email → Summary → Bookmark → Email

Reader API

https://jina.ai/reader/ Sep 5, 2025 21:24

Extracto

Convert any URL to Markdown for better grounding LLMs.

Contenido

The target URL to fetch content from

Add API Key for Higher Rate Limit

Enter your Jina API key to access a higher rate limit. For latest rate limit information, please refer to the table below.

Browser Engine (Quality/Speed)

Choose the browser engine for fetching the webpage content. This affects the quality, speed, completeness, accessibility of the content.

Content Format

You can control the level of detail in the response to prevent over-filtering. The default pipeline is optimized for most websites and LLM input.

JSON Response

The response will be in JSON format, containing the URL, title, content, and timestamp (if available). In Search mode, it returns a list of five entries, each following the described JSON structure.

Timeout

Maximum page load wait time, use this if you find default browser engine is too slow on simple webpage.

Token Budget

Limits the maximum number of tokens used for this request. Exceeding this limit will cause the request to fail.

Uses ReaderLM-v2 for HTML to Markdown conversion, to deliver high-quality results for websites with complex structures and contents. Costs 3x tokens!

CSS Selector: Only

List of CSS selectors to target specific page elements.

CSS Selector: Wait-For

CSS selectors to wait for before returning results.

CSS Selector: Excluding

CSS selectors for elements to remove (headers, footers, etc.).

Remove All Images

Remove all images from the response.

Target Gpt-Oss Series Model

Use gpt-oss internal browser citation format for links.

Gather All Links At the End

A "Buttons & Links" section will be created at the end. This helps the downstream LLMs or web agents navigating the page or take further actions.

Gather All Images At the End

An "Images" section will be created at the end. This gives the downstream LLMs an overview of all visuals on the page, which may improve reasoning.

Sets browser viewport dimensions for responsive rendering.

Forward Cookie

Our API server can forward your custom cookie settings when accessing the URL, which is useful for pages requiring extra authentication. Note that requests with cookies will not be cached.

Image Caption

Captions all images at the specified URL, adding 'Image [idx]: [caption]' as an alt tag for those without one. This allows downstream LLMs to interact with the images in activities such as reasoning and summarizing.

Use a Proxy Server

Our API server can utilize your proxy to access URLs, which is helpful for pages accessible only through specific proxies.

Use a Country-Specific Proxy Server

Set country code for location-based proxy server. Use 'auto' for optimal selection or 'none' to disable.

Bypass Cached Content

Our API caches URL contents for a certain amount of time. Set it to true to ignore the cached result and fetch the content from the URL directly.

Do Not Cache & Track!

When enabled, the requested URL won't be cached and tracked on our server.

Github Flavored Markdown

Opt in/out features from GFM (Github Flavored Markdown).

Stream Mode

Stream mode is beneficial for large target pages, allowing more time for the page to fully render. If standard mode results in incomplete content, consider using Stream mode.

Customize Browser Locale

Control the browser locale to render the page. Lots of websites serve different content based on the locale.

Strictly comply robots policy

Define bot User-Agent to check against robots.txt before fetching content.

iframe Extraction

Processes content from all embedded iframes in the DOM tree.

Shadow DOM Extraction

Extracts content from all Shadow DOM roots in the document.

Follow Redirect

Choose whether to resolve to the final destination URL after following all redirects. Enable to follow the full redirect chain.

Use Reader on your local PDF and HTML file by uploading them. Only support pdf and html files. For HTML, please also specify a reference URL for better parsing related CSS/JS scripts.

Executes preprocessing JS code (inline string or remote URL).

Heading Style

Sets markdown heading format (passed to Turndown).

Alternative Heading Syntax

Horizontal Rule Style

Defines markdown horizontal rule format (passed to Turndown).

Bullet Point Style

Sets bullet list marker character (passed to Turndown).

Emphasis Style

Defines markdown emphasis delimiter (passed to Turndown).

Strong Emphasis Style

Sets markdown strong emphasis delimiter (passed to Turndown).

Link Style

Determines markdown link format (passed to Turndown).

All infrastructure and data processing operations reside entirely within EU jurisdiction.