Overview

Browser Operator is Zeus’s browser automation extension, enabling AI Agents to control the user’s browser to perform automated tasks.

Browser Operator is implemented as a Chrome extension, allowing AI to:

Navigate to web pages
Click elements
Fill out forms
Extract content
Take screenshots
Execute JavaScript

Installation

From Chrome Web Store

Visit the Chrome Web Store
Search for “Zeus Browser Operator”
Click “Add to Chrome”
Confirm installation

Developer Mode Installation

Download the extension source code
Open chrome://extensions
Enable “Developer mode”
Click “Load unpacked”
Select the extension directory

Architecture

Connection Flow

1. Install Extension

After installation, the extension displays a Zeus icon in the toolbar.

Click the extension icon
Log in to your Zeus account (or scan QR code to log in)
Authorize extension access

3. Establish Connection

Supported Actions

Action	Description	Parameters
`browser_navigate`	Navigate to URL	`url`
`browser_back`	Go back	-
`browser_forward`	Go forward	-
`browser_refresh`	Refresh page	-

Element Interaction

Action	Description	Parameters
`browser_click`	Click element	`ref`
`browser_type`	Append text input	`ref`, `text`
`browser_fill`	Clear and input text	`ref`, `text`
`browser_select`	Select dropdown option	`ref`, `value`
`browser_hover`	Hover mouse	`ref`

Page Actions

Action	Description	Parameters
`browser_scroll`	Scroll page	`direction`, `amount`
`browser_screenshot`	Take screenshot	`fullPage`
`browser_get_text`	Get element text	`ref`
`browser_snapshot`	Get page structure	-

Advanced Actions

Action	Description	Parameters
`browser_wait`	Wait for specified time	`seconds`
`browser_execute_js`	Execute JavaScript	`script`
`browser_handle_dialog`	Handle dialog	`accept`, `promptText`

Element References (ref)

Browser Operator uses element references (refs) to identify page elements.

Getting a ref

Use browser_snapshot to get the page structure, which returns an element list with refs. Each element includes a ref (unique identifier), tag (HTML tag), text (element text), and more.

Using a ref

After obtaining a ref, you can use it in subsequent actions (such as browser_click, browser_fill) to precisely locate the target element.

Security Mechanisms

Permission Control

Per-action approval - Each action can be configured to require user confirmation
Domain restrictions - Restrict actions to specific domains only
Action logging - All actions are logged

Sandbox Environment

The extension runs in a sandbox environment
Cannot access sensitive browser data (such as passwords or cookies)
Cannot access data from other extensions

Authentication Security

Uses JWT Token authentication
Tokens are refreshed periodically
Supports device revocation

Use Cases

Automated Form Filling

User: Help me fill out this registration form

Zeus: I'll use Browser Operator to fill out the form automatically.
1. First, get the page structure...
2. Locate input fields and fill in the information...
3. Click the submit button...

Web Data Extraction

User: Extract all product information from this page

Zeus: I'll extract the page data.
1. Analyze the page structure...
2. Locate product elements...
3. Extract names, prices, descriptions...

Automated Testing

User: Test this login flow

Zeus: I'll execute a login test.
1. Navigate to the login page...
2. Enter test credentials...
3. Click login...
4. Verify login success...

Troubleshooting

Extension Shows “Not Connected”

Check if you’re logged in
Refresh the extension (click refresh on chrome://extensions)
Check if the WebSocket server is running
Check browser console for errors

Action Execution Failed

Ensure the page has fully loaded
Check if the element ref is valid
Try using browser_snapshot to refresh the element list
Check if a popup is blocking the action

Blank Screenshot

Wait for page rendering to complete
Check for iframe content
Try using fullPage: false parameter

MCP Server

Browser Operator also runs as an MCP server and can be invoked by other MCP clients. All browser action tools (such as browser_navigate, browser_click, etc.) are exposed through the standard MCP protocol.

Web

Desktop

Chrome Extension

iOS

CLI

Overview

Overview

Installation

From Chrome Web Store

Developer Mode Installation

Architecture

Connection Flow

1. Install Extension

3. Establish Connection

Supported Actions

Navigation

Element Interaction

Page Actions

Advanced Actions

Element References (ref)

Getting a ref

Using a ref

Security Mechanisms

Permission Control

Sandbox Environment

Authentication Security

Use Cases

Automated Form Filling

Web Data Extraction

Automated Testing

Troubleshooting

Extension Shows “Not Connected”

Action Execution Failed

Blank Screenshot

MCP Server

Integration with Lingda Platform

Authentication Flow

Tool Invocation Flow

Web

Desktop

Chrome Extension

iOS

CLI

​Overview

​Installation

​From Chrome Web Store

​Developer Mode Installation

​Architecture

​Connection Flow

​1. Install Extension

​2. Login and Authorization

​3. Establish Connection

​Supported Actions

​Navigation

​Element Interaction

​Page Actions

​Advanced Actions

​Element References (ref)

​Getting a ref

​Using a ref

​Security Mechanisms

​Permission Control

​Sandbox Environment

​Authentication Security

​Use Cases

​Automated Form Filling

​Web Data Extraction

​Automated Testing

​Troubleshooting

​Extension Shows “Not Connected”

​Action Execution Failed

​Blank Screenshot

​MCP Server

​Integration with Lingda Platform

​Authentication Flow

​Tool Invocation Flow

Overview

Installation

From Chrome Web Store

Developer Mode Installation

Architecture

Connection Flow

1. Install Extension

2. Login and Authorization

3. Establish Connection

Supported Actions

Navigation

Element Interaction

Page Actions

Advanced Actions

Element References (ref)

Getting a ref

Using a ref

Security Mechanisms

Permission Control

Sandbox Environment

Authentication Security

Use Cases

Automated Form Filling

Web Data Extraction

Automated Testing

Troubleshooting

Extension Shows “Not Connected”

Action Execution Failed

Blank Screenshot

MCP Server

Integration with Lingda Platform

Authentication Flow

Tool Invocation Flow