gemini▌
18 indexed skills · max 10 per page
baoyu-danger-gemini-web
jimliu/baoyu-skills · Productivity
Reverse-engineered Gemini Web API for text and image generation with multi-turn conversation support. \n \n Supports text generation, image generation from prompts, and vision input via reference images for analysis or variation \n Includes four model options: gemini-3-pro (default), gemini-3-flash, gemini-3-flash-thinking, and gemini-3.1-pro-preview \n Multi-turn conversation via session IDs with automatic cookie caching and browser-based Google authentication \n Requires user consent before fi
google-gemini-embeddings
jezweb/claude-skills · Backend
$22
gemini-image
johnlindquist/claude · Productivity
Analyze images using Gemini's vision capabilities for OCR, UI analysis, and visual understanding. \n \n Supports PNG, JPEG, GIF, and WebP images including screenshots, diagrams, charts, and code snippets \n Built-in analysis templates for common tasks: text extraction, code recovery, UI/UX feedback, error diagnosis, and data extraction from charts \n Handles single and multiple image comparisons in a single request \n Requires Google Generative AI library and valid GEMINI_API_KEY environment var
gemini-live-api-dev
google-gemini/gemini-skills · Backend
Real-time bidirectional streaming with Gemini over WebSockets for audio, video, and text conversations. \n \n Supports audio input/output (16 kHz PCM), video frames, text, and automatic transcriptions with voice activity detection for interruption handling \n Includes native audio features: affective dialog, proactive audio, and thinking mode; function calling for synchronous and asynchronous tool use; and Google Search grounding \n Offers session management with context compression, resumption,
gemini-image-gen
jezweb/claude-skills · Productivity
Generate contextual images for web projects using the Gemini API. Produces hero backgrounds, OG cards, placeholder photos, textures, and style-matched variants.
gemini
davila7/claude-code-templates · Productivity
NEVER use --approval-mode default in background or non-interactive shells (like Claude Code tool calls). It will hang indefinitely waiting for approval prompts that cannot be provided.
google-gemini-file-search
jezweb/claude-skills · Backend
$22
gemini
steipete/clawdis · Productivity
Gemini CLI \n Use Gemini in one-shot mode with a positional prompt (avoid interactive mode). \n Quick start \n \n gemini \"Answer this question...\" \n gemini --model <name> \"Prompt...\" \n gemini --output-format json \"Return JSON\" \n \n Extensions \n \n List: gemini --list-extensions \n Manage: gemini extensions <command> \n \n Notes \n \n If auth is required, run gemini once interactively and follow the login flow. \n Avoid --yolo for safety. \n
gemini-imagegen
everyinc/compound-engineering-plugin · Productivity
Text-to-image and image editing using Google's Gemini API with multi-turn refinement support. \n \n Supports text-to-image generation, image editing, style transfer, and composition from up to 14 reference images \n Configurable resolution (1K, 2K, 4K) and 10 aspect ratios including square, landscape, portrait, and panoramic formats \n Multi-turn chat interface for iterative refinement and editing workflows \n Google Search grounding for generating images based on real-time data \n Requires GEMI
google-gemini-api
jezweb/claude-skills · Backend
Multimodal AI with Gemini 2.5 and 3 models, supporting text, images, video, audio, PDFs, function calling, thinking mode, and real-time web grounding. \n \n Supports three deployment approaches: Node.js SDK (@google/genai), fetch-based REST API for edge runtimes, and chat helpers for multi-turn conversations \n Handles multimodal inputs (images, video, audio, PDFs) with 1,048,576 token context window; thinking mode enabled by default for enhanced reasoning quality \n Includes function calling wi