tag

crawl

5 indexed skills · max 10 per page

skills (5)

alicloud-ai-misc-crawl-and-skill

cinience/alicloud-skills · Cloud

0

Category: task

crawl

tavily-ai/skills · Productivity

0

Extract and save website content as markdown files for offline access and analysis. \n \n Supports configurable crawl depth (1-5 levels), breadth limits, and page caps to balance coverage against performance \n Includes path filtering via regex patterns to focus on specific sections and exclude irrelevant content \n Offers two modes: full-page extraction for data collection, or semantic chunking with natural language instructions for feeding results into LLM context \n Provides a companion Map A

alicloud-ai-misc-crawl-and-skill-test

cinience/alicloud-skills · Cloud

0

Category: test \n Minimal Viable Test \n Goals \n \n Validate only the minimal request path for this skill. \n If execution fails, record exact error details without guessing parameters. \n \n Prerequisites \n \n Prepare authentication and region settings based on the skill instructions. \n Target skill: skills/ai/misc/alicloud-ai-misc-crawl-and-skill \n \n Test Steps (Minimal) \n \n Open the target skill SKILL.md and choose one minimal input example. \n Send one minimal request or run the examp

tavily-crawl

tavily-ai/skills · Productivity

0

Multi-page website crawler with semantic filtering and markdown export. \n \n Crawl entire site sections with depth and breadth control; filter by path regex, domain, or natural language instructions to focus results \n Save each page as local markdown files via --output-dir , or return structured JSON for agentic processing \n Use semantic instructions with chunk extraction to prevent context bloat when feeding results to LLMs; use full-page extraction for offline documentation downloads \n Sup

firecrawl-crawl

firecrawl/cli · Productivity

0

Bulk extract content from entire websites or site sections with depth and path filtering. \n \n Crawls pages following links up to configurable depth limits and page counts, with path inclusion/exclusion filters to scope extraction \n Supports async job polling or synchronous waiting with progress display via --wait and --progress flags \n Offers concurrency control, request delays, and JSON output formatting for integration into agent workflows \n Part of a four-step escalation pattern: search