Capability · Framework — rag

Jina Reader

Jina Reader is the simplest URL-to-Markdown service for LLM workflows. You call r.jina.ai/<url> and get back rendered, de-cluttered Markdown with images and links preserved. The Reader engine is also open source, so you can self-host for private networks or strict data residency.

Framework facts

Category
rag
Language
TypeScript / Go
License
Apache-2.0
Repository
https://github.com/jina-ai/reader

Install

# No install — just call the API
curl https://r.jina.ai/https://engineering.vips.edu

Quickstart

import requests

url = 'https://engineering.vips.edu/about'
markdown = requests.get(f'https://r.jina.ai/{url}').text
print(markdown[:500])

Alternatives

  • Firecrawl — multi-page crawler
  • Trafilatura — local extraction with no API
  • Mercury Parser — legacy reader mode
  • Crawl4AI — async open-source crawler

Frequently asked questions

When to pick Jina Reader over Firecrawl?

Reader is ideal when you already have a list of URLs and want zero-setup Markdown. Firecrawl is better when you need to crawl a whole site with sitemap following and structured extraction.

Does Reader run JavaScript?

Yes, the hosted service renders JS. The self-host image also supports headless Chromium rendering.

Sources

  1. Jina Reader — GitHub — accessed 2026-04-20
  2. Jina Reader — landing — accessed 2026-04-20