image▌
70 indexed skills · max 10 per page
gemini-image
johnlindquist/claude · Productivity
Analyze images using Gemini's vision capabilities for OCR, UI analysis, and visual understanding. \n \n Supports PNG, JPEG, GIF, and WebP images including screenshots, diagrams, charts, and code snippets \n Built-in analysis templates for common tasks: text extraction, code recovery, UI/UX feedback, error diagnosis, and data extraction from charts \n Handles single and multiple image comparisons in a single request \n Requires Google Generative AI library and valid GEMINI_API_KEY environment var
image-enhancer
composiohq/awesome-claude-skills · Productivity
Enhance image resolution, sharpness, and clarity for professional-quality output. \n \n Analyzes and improves resolution, sharpness, and compression artifacts; intelligently upscales images while reducing noise \n Supports batch processing of entire directories and preserves original files automatically \n Optimizes output based on intended use case: web, print, social media, or presentations \n Works best with screenshots and digital images; accepts PNG and JPG formats with configurable output
videoagent-image-studio
pexoai/pexo-skills · Video
Unified access to 8 AI image generation models with automatic model selection and zero API key setup. \n \n Supports Midjourney, Flux (Pro/Dev/Schnell), Ideogram, Recraft, SDXL, and Nano Banana with automatic model routing based on user intent \n Handles Midjourney's async polling transparently; all models return consistent output format with image URLs \n Includes Midjourney actions (upscale, variation, reroll) and reference image support for style consistency \n All requests routed through hos
image-generation-mcp
supercent-io/skills-template · Productivity
Generate high-quality images via Gemini models with structured prompts, aspect ratios, and brand validation. \n \n Supports multiple Gemini models (gemini-3-pro-image, gemini-2.5-flash-image, gemini-2.5-pro-image) optimized for different quality and speed tradeoffs \n Enforces structured prompt format covering subject, style, lighting, mood, composition, aspect ratio, and brand colors to ensure consistent outputs \n Includes validation workflows across multiple agents for prompt completeness, st
p-image
inferen-sh/skills · Productivity
Fast, optimized image generation with Pruna's P-Image models via inference.sh CLI. \n \n Four model variants: P-Image for text-to-image, P-Image-LoRA with 11 preset styles, P-Image-Edit for image editing, and P-Image-Edit-LoRA for stylized edits \n Supports multiple aspect ratios (1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, custom) and multi-image compositing for collages and combinations \n Requires inference.sh CLI ( infsh ) and login; run models via infsh app run pruna/[model-name] with JSON input p
baoyu-cover-image
jimliu/baoyu-skills · Productivity
Generate customizable article cover images across 5 independent dimensions and 3 aspect ratios. \n \n Combines 6 image types (hero, conceptual, typography, metaphor, scene, minimal) with 10 color palettes and 7 rendering styles for fine-grained visual control \n Supports cinematic (2.35:1), widescreen (16:9), and square (1:1) aspects, plus additional ratios (4:3, 3:2, 3:4) \n Auto-analyzes article content to recommend dimensions, or accepts explicit flags for type, palette, rendering, text level
alicloud-ai-image-zimage-turbo-test
cinience/alicloud-skills · Cloud
Category: test
qwen-image-2-pro
inference-sh/skills · Productivity
Generate images with Alibaba Qwen-Image-2.0-Pro via inference.sh CLI. Best for professional text rendering and complex designs.
mermaid-to-image
zc277584121/marketing-skills · AI/ML
Convert ```mermaid code blocks in Markdown (or other text) files into PNG images, and replace the code blocks with image references. Useful for platforms that don't render Mermaid natively (GitHub Pages/Jekyll, Dev.to, etc.).
og-image-design
inference-sh/skills · Frontend
Create social sharing images (Open Graph) via inference.sh CLI.