tavily-extract▌
tavily-ai/skills · updated Apr 8, 2026
Extract clean markdown or text from up to 20 URLs, with JavaScript rendering and query-focused chunking support.
- ›Handles JavaScript-rendered pages with configurable extraction depth (basic for simple pages, advanced for dynamic SPAs and tables)
- ›Supports query-focused extraction to return only relevant content chunks instead of full pages
- ›Returns LLM-optimized markdown by default, with options for plain text format and structured JSON output
- ›Processes up to 20 URLs in a single call
tavily extract
Extract clean markdown or text content from one or more URLs.
Before running any command
If tvly is not found on PATH, install it first:
curl -fsSL https://cli.tavily.com/install.sh | bash && tvly login
Do not skip this step or fall back to other tools.
See tavily-cli for alternative install methods and auth options.
When to use
- You have a specific URL and want its content
- You need text from JavaScript-rendered pages
- Step 2 in the workflow: search → extract → map → crawl → research
Quick start
# Single URL
tvly extract "https://example.com/article" --json
# Multiple URLs
tvly extract "https://example.com/page1" "https://example.com/page2" --json
# Query-focused extraction (returns relevant chunks only)
tvly extract "https://example.com/docs" --query "authentication API" --chunks-per-source 3 --json
# JS-heavy pages
tvly extract "https://app.example.com" --extract-depth advanced --json
# Save to file
tvly extract "https://example.com/article" -o article.md
Options
| Option | Description |
|---|---|
--query |
Rerank chunks by relevance to this query |
--chunks-per-source |
Chunks per URL (1-5, requires --query) |
--extract-depth |
basic (default) or advanced (for JS pages) |
--format |
markdown (default) or text |
--include-images |
Include image URLs |
--timeout |
Max wait time (1-60 seconds) |
-o, --output |
Save output to file |
--json |
Structured JSON output |
Extract depth
| Depth | When to use |
|---|---|
basic |
Simple pages, fast — try this first |
advanced |
JS-rendered SPAs, dynamic content, tables |
Tips
- Max 20 URLs per request — batch larger lists into multiple calls.
- Use
--query+--chunks-per-sourceto get only relevant content instead of full pages. - Try
basicfirst, fall back toadvancedif content is missing. - Set
--timeoutfor slow pages (up to 60s). - If search results already contain the content you need (via
--include-raw-content), skip the extract step.
See also
- tavily-search — find pages when you don't have a URL
- tavily-crawl — extract content from many pages on a site
Ratings
4.4★★★★★43 reviews- ★★★★★Layla Chawla· Dec 24, 2024
Registry listing for tavily-extract matched our evaluation — installs cleanly and behaves as described in the markdown.
- ★★★★★Pratham Ware· Dec 8, 2024
Solid pick for teams standardizing on skills: tavily-extract is focused, and the summary matches what you get after install.
- ★★★★★Zaid Khanna· Dec 8, 2024
tavily-extract fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.
- ★★★★★Lucas Harris· Dec 4, 2024
tavily-extract has been reliable in day-to-day use. Documentation quality is above average for community skills.
- ★★★★★Arjun Agarwal· Nov 27, 2024
tavily-extract is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.
- ★★★★★Evelyn Khan· Nov 15, 2024
Useful defaults in tavily-extract — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.
- ★★★★★Arjun Jackson· Oct 18, 2024
Keeps context tight: tavily-extract is the kind of skill you can hand to a new teammate without a long onboarding doc.
- ★★★★★Evelyn Gonzalez· Oct 6, 2024
I recommend tavily-extract for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.
- ★★★★★William Tandon· Sep 25, 2024
tavily-extract fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.
- ★★★★★Yash Thakker· Sep 13, 2024
Registry listing for tavily-extract matched our evaluation — installs cleanly and behaves as described in the markdown.
showing 1-10 of 43