browser-automationdeveloper-tools

BrowserCat

by browsercat

BrowserCat offers cloud-based Selenium test automation for software testing, enabling LLM-driven web navigation and inte

Enables LLMs to interact with web pages through cloud-based browser automation for navigation, screenshot capture, element interaction, and JavaScript execution without local browser installation.

github stars

5

Cloud-based — no local browser neededReal browser environmentScreenshot capture with element targeting

best for

  • / Web scraping and data extraction
  • / Automated testing of web applications
  • / AI agents that need to interact with websites
  • / Content monitoring and verification

capabilities

  • / Navigate to any web page
  • / Take full page or element screenshots
  • / Click and hover on page elements
  • / Fill forms and select dropdown options
  • / Execute JavaScript in browser console
  • / Access browser console logs

what it does

Provides cloud-based browser automation for LLMs to navigate websites, interact with elements, and take screenshots without installing browsers locally.

about

BrowserCat is an official MCP server published by browsercat that provides AI assistants with tools and capabilities via the Model Context Protocol. BrowserCat offers cloud-based Selenium test automation for software testing, enabling LLM-driven web navigation and inte It is categorized under browser automation, developer tools.

how to install

You can install BrowserCat in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

license

MIT

BrowserCat is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

readme

BrowserCat MCP Server

A Model Context Protocol server that provides browser automation capabilities using BrowserCat's cloud browser service. This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment without needing to install browsers locally.

Components

Tools

  • browsercat_navigate
    • Navigate to any URL in the browser
    • Input: url (string)
  • browsercat_screenshot
    • Capture screenshots of the entire page or specific elements
    • Inputs:
      • name (string, required): Name for the screenshot
      • selector (string, optional): CSS selector for element to screenshot
      • width (number, optional, default: 800): Screenshot width
      • height (number, optional, default: 600): Screenshot height
  • browsercat_click
    • Click elements on the page
    • Input: selector (string): CSS selector for element to click
  • browsercat_hover
    • Hover elements on the page
    • Input: selector (string): CSS selector for element to hover
  • browsercat_fill
    • Fill out input fields
    • Inputs:
      • selector (string): CSS selector for input field
      • value (string): Value to fill
  • browsercat_select
    • Select an option from a dropdown menu
    • Inputs:
      • selector (string): CSS selector for select element
      • value (string): Value to select
  • browsercat_evaluate
    • Execute JavaScript in the browser console
    • Input: script (string): JavaScript code to execute

Resources

The server provides access to two types of resources:

  1. Console Logs (console://logs)
    • Browser console output in text format
    • Includes all console messages from the browser
  2. Screenshots (screenshot://<name>)
    • PNG images of captured screenshots
    • Accessible via the screenshot name specified during capture

Key Features

  • Cloud-based browser automation
  • No local browser installation required
  • Console log monitoring
  • Screenshot capabilities
  • JavaScript execution
  • Basic web interaction (navigation, clicking, form filling)

Configuration to use BrowserCat MCP Server

Environment Variables

The BrowserCat MCP server requires the following environment variable:

NPX Configuration

{
  "mcpServers": {
    "browsercat": {
      "command": "npx",
      "args": ["-y", "@browsercatco/mcp-server"],
      "env": {
        "BROWSERCAT_API_KEY": "your-api-key-here"
      }
    }
  }
}

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.