pdf▌
41 indexed skills · max 10 per page
pdf-compress
claude-office-skills/skills · Documents
Reduce PDF file sizes for easier sharing, faster loading, and efficient storage.
pdf-merge-&-split
claude-office-skills/skills · Documents
Combine multiple PDF files into one or split PDFs into separate documents.
chat-with-pdf
claude-office-skills/skills · Documents
Have intelligent conversations about PDF documents - ask questions, get summaries, and extract specific information.
ai-pdf-filler-cli
askyourpdf/ai-pdf-filler · AI/ML
Autofill PDF forms from files or URLs using AI, with async task monitoring and download management. \n \n Supports both new form uploads (from local files or URLs) and existing form IDs, with AI-powered field population from context data or source documents \n Includes task monitoring commands ( status , wait ) to track async autofill execution and poll for completion \n Requires simplicity-cli installation and API key authentication via login, stdin, or environment variable \n Enforces validati
pdf-ocr-extraction
claude-office-skills/skills · Documents
Extract text from scanned documents and image-based PDFs using OCR technology.
pdf-processing-pro
davila7/claude-code-templates · Documents
Production-ready PDF processing with forms, tables, OCR, and batch operations. \n \n Includes 10+ pre-built CLI scripts for form analysis, filling, table extraction, text extraction, PDF merging, splitting, and validation \n All scripts feature comprehensive error handling with exit codes, input validation, type hints, and configurable logging for automation integration \n Supports complex workflows: form field detection and filling with validation, multi-page table extraction to CSV/Excel, and
pdf-ocr
yejinlei/pdf-ocr-skill · Documents
Dual-engine OCR for extracting text from scanned PDFs and images with local or cloud processing. \n \n Supports RapidOCR (local, free, no API key) and SiliconFlow API (cloud-based, high precision) with automatic fallback when local engine fails \n Handles scanned PDFs and multiple image formats (JPG, PNG, BMP, GIF, TIFF, WEBP) with Chinese and English text recognition \n Preserves text order and structure; automatically converts PDF pages to images for processing \n Batch processing capability f
pdf-to-markdown
duc01226/easyplatform · Documents
Convert PDF files to Markdown with automatic detection of native text vs scanned documents. \n \n Supports three conversion modes: auto-detect (default), native text extraction, and OCR for scanned PDFs \n Fast JavaScript-based processing with no system dependencies; returns JSON output with page count and conversion mode \n Handles basic table structure preservation and works cross-platform on Windows, macOS, and Linux \n OCR for scanned documents requires optional tesseract.js setup; complex m
davila7/claude-code-templates · Documents
Comprehensive PDF processing covering text extraction, merging, splitting, OCR, form filling, and encryption. \n \n Supports multiple Python libraries: pypdf for basic operations (merge, split, rotate, encrypt), pdfplumber for text and table extraction with layout preservation, and reportlab for creating new PDFs from scratch \n Includes command-line tools (pdftotext, qpdf, pdftk) for batch processing and scripting workflows \n Handles scanned PDFs via OCR using pytesseract and pdf2image to make
pdf-to-docx
claude-office-skills/skills · Documents
This skill enables conversion from PDF to editable Word documents using pdf2docx - a Python library that preserves layout, tables, images, and text formatting. Unlike OCR-based solutions, pdf2docx extracts native PDF content for accurate conversion.