tag

gemini

18 indexed skills · max 10 per page

skills (18)

baoyu-danger-gemini-web

jimliu/baoyu-skills · Productivity

3

Reverse-engineered Gemini Web API for text and image generation with multi-turn conversation support. \n \n Supports text generation, image generation from prompts, and vision input via reference images for analysis or variation \n Includes four model options: gemini-3-pro (default), gemini-3-flash, gemini-3-flash-thinking, and gemini-3.1-pro-preview \n Multi-turn conversation via session IDs with automatic cookie caching and browser-based Google authentication \n Requires user consent before fi

google-gemini-embeddings

jezweb/claude-skills · Backend

0

$22

gemini-image

johnlindquist/claude · Productivity

0

Analyze images using Gemini's vision capabilities for OCR, UI analysis, and visual understanding. \n \n Supports PNG, JPEG, GIF, and WebP images including screenshots, diagrams, charts, and code snippets \n Built-in analysis templates for common tasks: text extraction, code recovery, UI/UX feedback, error diagnosis, and data extraction from charts \n Handles single and multiple image comparisons in a single request \n Requires Google Generative AI library and valid GEMINI_API_KEY environment var

gemini-live-api-dev

google-gemini/gemini-skills · Backend

0

Real-time bidirectional streaming with Gemini over WebSockets for audio, video, and text conversations. \n \n Supports audio input/output (16 kHz PCM), video frames, text, and automatic transcriptions with voice activity detection for interruption handling \n Includes native audio features: affective dialog, proactive audio, and thinking mode; function calling for synchronous and asynchronous tool use; and Google Search grounding \n Offers session management with context compression, resumption,

gemini-image-gen

jezweb/claude-skills · Productivity

0

Generate contextual images for web projects using the Gemini API. Produces hero backgrounds, OG cards, placeholder photos, textures, and style-matched variants.

gemini

davila7/claude-code-templates · Productivity

0

NEVER use --approval-mode default in background or non-interactive shells (like Claude Code tool calls). It will hang indefinitely waiting for approval prompts that cannot be provided.

google-gemini-file-search

jezweb/claude-skills · Backend

0

$22

gemini

steipete/clawdis · Productivity

0

Gemini CLI \n Use Gemini in one-shot mode with a positional prompt (avoid interactive mode). \n Quick start \n \n gemini \"Answer this question...\" \n gemini --model <name> \"Prompt...\" \n gemini --output-format json \"Return JSON\" \n \n Extensions \n \n List: gemini --list-extensions \n Manage: gemini extensions <command> \n \n Notes \n \n If auth is required, run gemini once interactively and follow the login flow. \n Avoid --yolo for safety. \n

gemini-imagegen

everyinc/compound-engineering-plugin · Productivity

0

Text-to-image and image editing using Google's Gemini API with multi-turn refinement support. \n \n Supports text-to-image generation, image editing, style transfer, and composition from up to 14 reference images \n Configurable resolution (1K, 2K, 4K) and 10 aspect ratios including square, landscape, portrait, and panoramic formats \n Multi-turn chat interface for iterative refinement and editing workflows \n Google Search grounding for generating images based on real-time data \n Requires GEMI

google-gemini-api

jezweb/claude-skills · Backend

0

Multimodal AI with Gemini 2.5 and 3 models, supporting text, images, video, audio, PDFs, function calling, thinking mode, and real-time web grounding. \n \n Supports three deployment approaches: Node.js SDK (@google/genai), fetch-based REST API for edge runtimes, and chat helpers for multi-turn conversations \n Handles multimodal inputs (images, video, audio, PDFs) with 1,048,576 token context window; thinking mode enabled by default for enhanced reasoning quality \n Includes function calling wi

prevpage 1 / 2next