AI browser agent

The AI browser agent that returns a verdict

Kane CLI by TestMu AI is a browser agent that drives a real Chrome browser from plain-English goals. The path can vary, the verdict cannot: a pass needs real evidence. Run it locally, scale it on Browser Cloud. Free to install.

npm install -g @testmuai/kane-cli

or read the documentation

What an AI browser agent actually is

A browser agent drives a real web browser to reach a goal you describe in plain English, instead of you clicking through it. Kane CLI uses an LLM to read the rendered page and work out the path, the way a person would.

But AI is not deterministic, and TestMu AI does not pretend otherwise. The model decides how to reach an element, never whether it passed. A pass needs real evidence: DOM state, a URL change, network responses, or your own assertions.

That is the difference from agentic browsers that hand back a prose summary. Kane CLI returns a binary status, NDJSON evidence, and a replay link, then scales on Browser Cloud: real Chrome on demand, backed by enterprise infrastructure.

Kane CLI driving an AI browser agent flow in a real Chrome browser

One Browser Agent, Every Use Case

Run flows from any terminal

Pushed a fix? Run a checkout flow from your terminal before you open the PR. Real browser, pass or fail in under 2 minutes.

Multi-environment in one command

Same flow, different URLs. Run against staging and prod by changing the URL. No duplicate scripts to maintain.

Headless or visible

Watch exactly what the agent does in visible mode while debugging. Switch to headless for CI runs without changing a single line.

A browser agent you can put in CI

Plain-English goals, AI perception, and a verdict anchored to evidence.

Plain-English objectives

Describe the outcome, not the selectors. 'Log in, add an item to the cart, and confirm the total updates.' The agent resolves the path. No XPath, no Page Object Models, no framework boilerplate to maintain.

Real Chrome, not a synthetic DOM

The agent perceives and acts on a real Chrome instance over the Chrome DevTools Protocol, taking only actions a real user could take. No injected JavaScript to fake a click, so every verdict reflects how your app actually behaves.

AI perception, deterministic verdict

The LLM reads the rendered page like a user, but the verdict is not its opinion. A pass requires explicit evidence: DOM state, URL changes, network responses, console logs, or your assertions. The model never declares success.

Autoheal without losing the contract

Intent anchors to the user-facing element, so a reworded label or new CSS class does not break the run. The agent works through cookie banners and redirects, up to 50 steps per flow, while the pass condition stays exactly the same.

Evidence-typed NDJSON for agents

Run with --agent and every action, observation, and assertion becomes a typed JSON line, ending in a run_end verdict Claude Code, Cursor, or Codex can parse. The proof behind pass or fail is structured data, not prose to scrape.

Local or cloud, one syntax

Drive flows headed from your terminal, then run the same objective headless in CI or at scale on Browser Cloud. One natural-language syntax, one dashboard, no rewrite when a laptop flow becomes a parallel cross-browser run.

From a local run to production confidence

Start in your terminal

Start in your terminal

Validate on the cloud

Validate on the cloud

Release with confidence

Release with confidence

Run your browser agent at scale on Browser Cloud

When one Chrome on your laptop is not enough, Browser Cloud gives every agent real, full-featured Chrome on demand.

Real Chrome sessions on demand

Provision real, full-featured Chrome in the cloud and run many sessions in parallel, with no fleet to stand up or patch. The same cloud powers 1.5 billion tests a year for 18,000+ enterprises.

Built-in tunnel to private apps

A tunnel ships inside Browser Cloud, so a cloud agent can reach localhost, staging, and apps behind your VPN without exposing them to the public internet. No third-party tunnel to wire up.

Full session transparency

Every session records video, console logs, network logs, and step-by-step replay automatically. When an agent does something unexpected, you watch what it saw instead of guessing from a stack trace.

Stealth, persistence, and compliance

Best-effort stealth and persistent login state keep multi-step agent runs alive, backed by SOC 2 Type II, ISO 27001, HIPAA, and GDPR. Enterprise-grade where it counts, honest about what is best-effort.

Browser Cloud session dashboard showing parallel agent sessions with video replay and command, network, and console logs

Put a browser agent to work in three steps

1

Install Kane CLI

Run npm install -g @testmuai/kane-cli and sign in with your TestMu AI account. Nothing to wire up, no selectors to record, no framework to scaffold.

2

Point it at any URL

Aim it at a local dev server, a staging link, or production. Drive flows from the terminal, or hand them to Claude Code, Cursor, or Codex with the --agent flag. Scale to Browser Cloud when you need parallel sessions.

3

State the goal and the proof

Write the journey in plain English and the condition that makes it a pass. The agent reaches it in real Chrome, then grants a pass only when the evidence confirms that state, with a shareable replay.

Get Started With Kane CLI

🎉 Launch offer: Bonus credits for the first 3 months on paid plans

Choose the right plan for you

Local test authoring via CLI

Auto-heal & vision

View test cases on UI

Test Manager

Free

Free

$0

/month

200 Credits

Resets in every

30 days

Free tier
Starter

Starter

$19

/month

2000 Credits

Launch: 4,000 Credits (+100%)

Bonus for first 3 months

Free tier
Most Popular
Pro

Pro

$99

/month

10,000 Credits

Launch: 15,000 Credits (+50%)

Bonus for first 3 months

Complimentary License
Enterprise

Enterprise

Get access to solutions built on Enterprise-Grade Security, Privacy, and Compliances.

Need more credits?

Got a bigger use case in mind?

Let’s talk

Multiple seats

Get the technical rundown

Blog

A look at Kane CLI. What we built, what it does, and where it is headed.

Documentation

Everything you need to install, configure, and run Kane CLI in under 2 minutes.

GitHub

Browse the source, file issues, and follow the roadmap on GitHub.

Frequently asked questions

An AI browser agent is software that drives a real web browser to reach a goal you describe in plain English, instead of a human clicking through it. Kane CLI by TestMu AI uses an LLM to read the rendered page and resolve the path to your objective, then drives a real Chrome browser through it. You describe the outcome, for example "log in and confirm the dashboard loads," and the agent figures out the steps. The difference from a chatbot is that Kane CLI returns a binary pass or fail anchored to evidence, not a prose summary. For the framework-level view, see AI browser automation.

The honest position from TestMu AI: the LLM is not deterministic, but the validation contract is. The model decides how to reach an element, and that path can vary run to run, but it does not get to decide the test passed. A pass is granted only when the expected state is verified through explicit evidence: DOM state, stable selectors, accessibility labels, URL changes, network responses, screenshots, console logs, or your own assertions. In CI you key the pipeline off the evidence-backed exit code: 0 when the expected state is verified, 1 on a failed assertion, 2 on an error such as auth, and 3 on timeout. That is what makes a non-deterministic model safe to gate on.

Yes. Because the verdict is machine-readable, AI coding agents can close their own loop. Run Kane CLI with the --agent flag and it streams NDJSON, one typed event per line, ending in a run_end event carrying the verdict, an evidence summary, extracted values, and a dashboard link. Claude Code, Cursor, Codex, and Gemini CLI parse that stream to learn whether the UI they generated actually works, then fix the regression before you open the browser. Point your agent at the guide at testmuai.com/kane-cli/agents.md and it installs, authenticates, and verifies in a real browser on its own.

When one Chrome on your laptop is not enough, the same objective runs on Browser Cloud, the browser infrastructure built for AI agents. It provisions real, full-featured Chrome sessions on demand and runs many in parallel, with a built-in tunnel to reach localhost and staging, automatic session video and logs for debugging, and persistent login state across runs, all on the same cloud that powers 1.5 billion tests a year for 18,000+ enterprises. Start in your terminal, then scale out without a rewrite. The Browser Cloud documentation and the browser infrastructure for AI agents post cover the setup.

Yes. Beyond the --agent stream, TestMu AI exposes browser automation over the Model Context Protocol, so any MCP-compatible agent can navigate, interact, and read the page as structured tool calls inside its own task loop, with no codegen step. See the browser automation MCP server for the tool surface. Use the MCP server when the agent needs to act on the web in real time, and the CLI when you want a verified pass or fail to gate on.

Security comes from two design choices. First, the agent acts only on a real Chrome browser over the DevTools Protocol, limited to actions a real user could take, with no injected JavaScript that fakes UI state, so every run leaves an honest evidence trail you can audit. Second, when you scale on Browser Cloud the infrastructure carries SOC 2 Type II, ISO 27001, HIPAA, and GDPR compliance, with credentials supplied through environment variables and CI secrets rather than hardcoded. Treat stealth and CAPTCHA handling as best-effort, never a guarantee, and keep your access key out of source control.

Traditional browser automation anchors every action to a brittle selector or XPath, so a renamed label or a new CSS class breaks the run and a human fixes the script. A browser agent anchors to intent, the user-facing element like "the Submit button", and re-resolves it when the page shifts, autohealing cosmetic drift and pushing through unexpected modals up to 50 steps per flow. You write what you want, not the path to get there. Any completed run still exports to native Playwright, and you can scale framework suites on the Automation Cloud grid, so there is no lock-in.

Yes, Kane CLI is free to install and free to run against your own Chrome, so you can prove out the LLM-perception-plus-evidence workflow at zero cost. The Starter tier adds 100 credits with no credit card. Cloud runs on the TestMu AI grid are billed against your plan only when you scale to remote browsers, geo coverage, or parallel sessions. Start free, verify a real journey with shareable evidence in under five minutes, and pay only when you need scale.

Give your coding agent eyes in a real browser

Point your AI coding agent at the Kane CLI guide and it will install, authenticate, and run a browser agent with the --agent flag, then read the verified pass or fail on its own.

Point your agent to: testmuai.com/kane-cli/agents.md