gemini▌
18 indexed skills · max 10 per page
gemini-computer-use
am-will/codex-skills · Productivity
Gemini 2.5 Computer Use browser automation with Playwright-based agent loops and safety confirmations. \n \n Implements a screenshot-to-action cycle: capture screen, send to Gemini, parse function calls, execute in Playwright, return results until task completion or turn limit \n Supports multiple browser options: bundled Chromium (default), Chrome/Edge channels via COMPUTER_USE_BROWSER_CHANNEL , or custom executables like Brave \n Includes safety confirmation workflow that prompts users before
gemini-interactions-api
google-gemini/gemini-skills · Backend
Unified interface for Gemini models and agents with server-side state, streaming, and tool orchestration. \n \n Supports multiple current models (gemini-3-flash-preview, gemini-3-pro-preview, gemini-2.5-flash/pro) and the Deep Research agent; automatically substitute deprecated model IDs with current alternatives \n Offload conversation history to the server via previous_interaction_id for stateful multi-turn interactions without manual history management \n Built-in tool orchestration including
gemini-watermark-remover
rookie-ricardo/erduo-skills · Productivity
Remove visible Gemini AI watermarks from images using reverse alpha blending. \n \n Provides a Python CLI tool that strips the bottom-right Gemini watermark without requiring HTML or server-side components \n Uses pre-captured watermark alpha maps (48px and 96px variants) and reverse alpha blending to reconstruct the underlying image \n Supports any image format that Pillow can read and write, with output format inferred from file extension \n Includes detailed algorithm documentation covering d
gemini-designer
oil-oil/gemini-designer · Frontend
Delegate design tasks to Gemini via ZenMux API. Gemini creates HTML page designs, SVG icons, and provides design advice.
gemini-api-dev
google-gemini/gemini-skills · Backend
Build applications with Google's Gemini models, supporting multimodal content, function calling, and structured outputs across Python, JavaScript, Go, and Java. \n \n Access current Gemini 3 models (Pro, Flash, Pro Image) with 1M token context; legacy Gemini 2.x and 1.5 models are deprecated \n Supports text generation, image/audio/video understanding, function calling, structured JSON output, code execution, context caching, and embeddings \n Official SDKs available: google-genai (Python), @goo
gemini-guide
jezweb/claude-skills · Frontend
Look up Gemini API documentation and SDK patterns when building with Google Gemini. This skill brings Gemini docs TO Claude — it does not call Gemini.
gemini-peer-review
jezweb/claude-skills · Productivity
Consult Gemini as a coding peer for a second opinion on code quality, architecture decisions, debugging, or security reviews.
gemini
softaworks/agent-toolkit · Productivity
Comprehensive code review and large-context analysis using Gemini 3 Pro with flexible approval modes. \n \n Supports five Gemini models ranging from flagship reasoning (Gemini 3 Pro) to cost-optimized alternatives, with 1M token input windows for analyzing entire codebases and documentation sets \n Three approval modes: default for interactive terminal prompts, auto_edit for auto-approving code suggestions, and yolo for fully automated background execution \n Critical safety requirement: use --a