productivityai-ml

Peekaboo (macOS Screen Capture)

steipete

by steipete

Peekaboo empowers mac how to screen capture, mac screenshot, and window management with tools for screen snip on mac and

Enables macOS screen capture and window management with tools for taking screenshots, analyzing images, and controlling application windows

github stars

2.6K

0 commentsdiscussion

Both formats append explainx.ai attribution and the canonical URL for this MCP server listing.

macOS 15.0+ Sequoia requiredMulti-provider AI support including local OllamaNatural language agent automation

best for

  • / macOS automation and GUI testing
  • / AI assistants that need screen awareness
  • / Automating repetitive desktop tasks

capabilities

  • / Capture pixel-accurate screenshots of windows or entire screens
  • / Analyze screen content using AI vision models
  • / Control applications through clicks, typing, and hotkeys
  • / Navigate menus and discover UI elements
  • / Manage windows and switch between applications
  • / Automate multi-screen workflows

what it does

Provides high-fidelity screen capture and GUI automation tools for macOS. Enables AI assistants to see your screen, take screenshots, and control applications through natural language commands.

about

Peekaboo (macOS Screen Capture) is a community-built MCP server published by steipete that provides AI assistants with tools and capabilities via the Model Context Protocol. Peekaboo empowers mac how to screen capture, mac screenshot, and window management with tools for screen snip on mac and It is categorized under productivity, ai ml.

how to install

You can install Peekaboo (macOS Screen Capture) in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

license

MIT

Peekaboo (macOS Screen Capture) is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

readme

Peekaboo 🫣 - Mac automation that sees the screen and does the clicks.

Peekaboo Banner

npm package License: MIT macOS 15.0+ (Sequoia) Swift 6.2 node >=22 Download macOS Homebrew Ask DeepWiki

Peekaboo brings high-fidelity screen capture, AI analysis, and complete GUI automation to macOS. Version 3 adds native agent flows and multi-screen automation across the CLI and MCP server.

Note: v3 is currently in beta (3.0.0-beta4) and has a few known issues; see the changelog for details.

What you get

  • Pixel-accurate captures (windows, screens, menu bar) with optional Retina 2x scaling.
  • Natural-language agent that chains Peekaboo tools (see, click, type, scroll, hotkey, menu, window, app, dock, space).
  • Menu and menubar discovery with structured JSON; no clicks required.
  • Multi-provider AI: GPT-5.1 family, Claude 4.x, Grok 4-fast (vision), Gemini 2.5, and local Ollama models.
  • MCP server for Claude Desktop and Cursor plus a native CLI; the same tools in both.
  • Configurable, testable workflows with reproducible sessions and strict typing.
  • Requires macOS Screen Recording + Accessibility permissions (see docs/permissions.md).

Install

  • macOS app + CLI (Homebrew):
    brew install steipete/tap/peekaboo
    
  • MCP server (Node 22+, no global install needed):
    npx -y @steipete/peekaboo
    

Quick start

# Capture full screen at Retina scale and save to Desktop
peekaboo image --mode screen --retina --path ~/Desktop/screen.png

# Click a button by label (captures, resolves, and clicks in one go)
peekaboo see --app Safari --json-output | jq -r '.data.snapshot_id' | read SNAPSHOT
peekaboo click --on "Reload this page" --snapshot "$SNAPSHOT"

# Run a natural-language automation
peekaboo "Open Notes and create a TODO list with three items"

# Run as an MCP server (Claude/Cursor)
npx -y @steipete/peekaboo

# Minimal Claude Desktop config snippet (Developer → Edit Config):
# {
#   "mcpServers": {
#     "peekaboo": {
#       "command": "npx",
#       "args": ["-y", "@steipete/peekaboo"],
#       "env": {
#         "PEEKABOO_AI_PROVIDERS": "openai/gpt-5.1,anthropic/claude-opus-4"
#       }
#     }
#   }
# }
CommandKey flags / subcommandsWhat it does
see--app, --mode screen/window, --retina, --json-outputCapture and annotate UI, return snapshot + element IDs
click--on <id/query>, --snapshot, --wait, coordsClick by element ID, label, or coordinates
type--text, --clear, --delay-msEnter text with pacing options
presskey names, --repeatSpecial keys and sequences
hotkeycombos like cmd,shift,tModifier combos (cmd/ctrl/alt/shift)
scroll--on <id>, --direction up/down, --ticksScroll views or elements
swipe--from/--to, --duration, --stepsSmooth gesture-style drags
drag--from/--to, modifiers, Dock/Trash targetsDrag-and-drop between elements/coords
move--to <id/coords>, --screen-indexPosition the cursor without clicking
windowlist, move, resize, focus, set-boundsMove/resize/focus windows and Spaces
applaunch, quit, relaunch, switch, listLaunch, quit, relaunch, switch apps
spacelist, switch, move-windowList or switch macOS Spaces
menulist, list-all, click, click-extraList/click app menus and extras
menubarlist, clickTarget status-bar items by name/index
docklaunch, right-click, hide, show, listInteract with Dock items
dialoglist, click, input, file, dismissDrive system dialogs (open/save/etc.)
image--mode screen/window/menu, --retina, --analyzeScreenshot screen/window/menu bar (+analyze)
listapps, windows, screens, menubar, permissionsEnumerate apps, windows, screens, permissions
tools--verbose, --json-output, --no-sortInspect native Peekaboo tools
configinit, show, add, login, modelsManage credentials/providers/settings
permissionsstatus, grantCheck/grant required macOS permissions
run.peekaboo.json, --output, --no-fail-fastExecute .peekaboo.json automation scripts
sleep--duration (ms)Millisecond delays between steps
clean--all-snapshots, --older-than, --snapshotPrune snapshots and caches
agent--model, --dry-run, --resume, --max-steps, audioNatural-language multi-step automation
mcpserve (default)Run Peekaboo as an MCP server

Models and providers

  • OpenAI: GPT-5.1 (default) and GPT-4.1/4o vision
  • Anthropic: Claude 4.x
  • xAI: Grok 4-fast reasoning + vision
  • Google: Gemini 2.5 (pro/flash)
  • Local: Ollama (llama3.3, llava, etc.)

Set providers via PEEKABOO_AI_PROVIDERS or peekaboo config add.

Learn more

Development basics

  • Requirements: macOS 15+, Xcode 16+/Swift 6.2. Node 22+ only if you run the pnpm docs/build helper scripts (core CLI/app/MCP are Swift-only).
  • Install deps: pnpm install then pnpm run build:cli or pnpm run test:safe.
  • Lint/format: pnpm run lint && pnpm run format.

License

MIT

FAQ

What is the Peekaboo (macOS Screen Capture) MCP server?
Peekaboo (macOS Screen Capture) is a Model Context Protocol (MCP) server profile on explainx.ai. MCP lets AI hosts (e.g. Claude Desktop, Cursor) call tools and resources through a standard interface; this page summarizes categories, install hints, and community ratings.
How do MCP servers relate to agent skills?
Skills are reusable instruction packages (often SKILL.md); MCP servers expose live capabilities. Teams frequently combine both—skills for workflows, MCP for APIs and data. See explainx.ai/skills and explainx.ai/mcp-servers for parallel directories.
How are reviews shown for Peekaboo (macOS Screen Capture)?
This profile displays 25 aggregated ratings (sample rows for discoverability plus signed-in user reviews). Average score is about 4.6 out of 5—verify behavior in your own environment before production use.

Use Cases

Extended AI Capabilities

Add new capabilities to Claude beyond text generation

Example

Access external data sources, execute code, interact with tools and services

Transform Claude from chatbot to action-taking agent

Context Enhancement

Provide Claude with access to relevant context and data

Example

Load project documentation, access knowledge bases, query databases

Get more accurate, context-aware responses

Workflow Automation

Automate multi-step workflows combining AI and external tools

Example

Research → Summarize → Create document → Send notification

Complete complex tasks end-to-end without manual steps

Implementation Guide

Prerequisites

  • Claude Desktop 0.7.0+ or Cursor IDE with MCP support
  • Basic understanding of MCP architecture and capabilities
  • Access credentials for integrated services (if required)
  • Willingness to experiment and iterate on configuration

Time Estimate

15-60 minutes depending on server complexity

Installation Steps

  1. 1.Install MCP server: npm install -g [package-name] or via GitHub
  2. 2.Add server configuration to ~/.claude/mcp.json
  3. 3.Provide required credentials and configuration
  4. 4.Restart Claude Desktop to load new server
  5. 5.Test basic functionality with simple prompts
  6. 6.Explore capabilities and experiment with use cases
  7. 7.Document successful patterns for reuse

Troubleshooting

  • MCP server not loading: Check config syntax, verify installation
  • Connection errors: Check network, firewall, credentials
  • Feature not working: Read server docs, check required parameters
  • Performance issues: Monitor resource usage, check for network latency
  • Conflicts with other servers: Check port assignments, namespace collisions

Best Practices

✓ Do

  • +Read server documentation thoroughly before setup
  • +Start with simple use cases to validate functionality
  • +Test in non-production environment first
  • +Monitor resource usage and performance
  • +Keep servers updated for bug fixes and new features
  • +Document configuration for team members
  • +Use environment variables for sensitive configuration

✗ Don't

  • Don't grant overly permissive access to MCP servers
  • Don't skip reading security considerations in docs
  • Don't expose sensitive data without proper controls
  • Don't run untrusted MCP servers without code review
  • Don't ignore error messages—investigate root cause

💡 Pro Tips

  • Combine multiple MCP servers for powerful workflows
  • Create custom MCP servers for your specific needs
  • Share successful configurations with team
  • Use MCP inspector for debugging
  • Join MCP community for tips and troubleshooting

Technical Details

Architecture

Model Context Protocol standardizes how AI hosts (Claude, Cursor) communicate with external tools and data sources through server implementations.

Protocols

  • Model Context Protocol (MCP)
  • JSON-RPC 2.0
  • stdio or HTTP transport

Compatibility

  • Claude Desktop
  • Cursor IDE
  • Custom MCP clients

When to Use This

✓ Use When

Use when you need Claude to access external data, execute actions, or integrate with tools. Best for extending AI capabilities beyond conversation.

✗ Avoid When

Avoid when native integrations exist (use official APIs directly), for real-time critical systems, or when security/compliance requires zero external dependencies.

Integration

  • Tool composition: Chain multiple MCP tools in workflows
  • Context augmentation: Provide AI with relevant external data
  • Action delegation: Let AI execute tasks on external systems
  • Bidirectional sync: Keep AI context and external systems in sync

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.

List & Promote Your MCP Server

Share your MCP server with the developer community

GET_STARTED →
MCP server reviews

Ratings

4.625 reviews
  • Maya Gonzalez· Dec 12, 2024

    Peekaboo (macOS Screen Capture) is a well-scoped MCP server in the explainx.ai directory — install snippets and categories matched our Claude Code setup.

  • Ava Gill· Nov 3, 2024

    Peekaboo (macOS Screen Capture) is among the better-indexed MCP projects we tried; the explainx.ai summary tracks the official description.

  • Ava Rao· Oct 22, 2024

    According to our notes, Peekaboo (macOS Screen Capture) benefits from clear Model Context Protocol framing — fewer ambiguous “AI plugin” claims.

  • Oshnikdeep· Sep 9, 2024

    Strong directory entry: Peekaboo (macOS Screen Capture) surfaces stars and publisher context so we could sanity-check maintenance before adopting.

  • Noor Kim· Sep 5, 2024

    Peekaboo (macOS Screen Capture) is a well-scoped MCP server in the explainx.ai directory — install snippets and categories matched our Claude Code setup.

  • Ganesh Mohane· Aug 28, 2024

    Useful MCP listing: Peekaboo (macOS Screen Capture) is the kind of server we cite when onboarding engineers to host + tool permissions.

  • Amelia Ghosh· Aug 24, 2024

    Peekaboo (macOS Screen Capture) has been reliable for tool-calling workflows; the MCP profile page is a good permalink for internal docs.

  • Sakshi Patil· Jul 19, 2024

    We evaluated Peekaboo (macOS Screen Capture) against two servers with overlapping tools; this profile had the clearer scope statement.

  • Meera Li· Jul 15, 2024

    According to our notes, Peekaboo (macOS Screen Capture) benefits from clear Model Context Protocol framing — fewer ambiguous “AI plugin” claims.

  • Yash Thakker· Jul 11, 2024

    Peekaboo (macOS Screen Capture) reduced integration guesswork — categories and install configs on the listing matched the upstream repo.

showing 1-10 of 25

1 / 3