vision▌
15 indexed skills · max 10 per page
pdf-vision-reader
childbamboo/claude-code-marketplace-sample · Documents
図表が多い PDF を画像化して、Claude の vision 機能で内容を解析・Markdown 化するスキルです。
blip-2-vision-language
davila7/claude-code-templates · Productivity
Comprehensive guide to using Salesforce's BLIP-2 for vision-language tasks with frozen image encoders and large language models.
vision-framework
dpearson2699/swift-ios-skills · Productivity
Detect text, faces, barcodes, objects, and body poses in images and video using on-device computer vision. Patterns target iOS 26+ with Swift 6.3, backward-compatible where noted.
axiom-vision
charleswiltgen/axiom · Productivity
Apple Vision Framework for computer vision tasks: subject segmentation, pose detection, text recognition, barcode scanning, and document processing. \n \n Covers 13+ Vision APIs across subject lifting, hand/body pose, person segmentation, text OCR, barcode detection, and document scanning with decision trees for choosing the right tool \n Includes 15 production patterns: combining APIs to exclude hands from objects, real-time gesture recognition, multi-person segmentation, fitness action classif
axiom-vision-ref
charleswiltgen/axiom · Productivity
axiom-vision-ref