AI browser automation

AI browser automation that returns a verdict not a summary

Kane CLI by TestMu AI uses an LLM to read and drive a real Chrome browser from plain-English objectives. The path can vary, but the verdict cannot: a pass needs explicit evidence. Free to install.

npm install -g @testmuai/kane-cli

or read the documentation

AI browser automation, anchored to evidence

Most browser automation breaks on selectors. Kane CLI takes plain-English objectives, uses an LLM for perception and reasoning, and resolves the path to your goal in a real Chrome browser. Describe the outcome, not the XPath.

But AI is not deterministic, and TestMu AI does not pretend otherwise. The model decides how to reach an element. It does not decide the test passed. A pass is granted only when the expected state is verified through explicit evidence.

That is the difference from agentic browsers that hand back a prose summary. Kane CLI returns a binary status, NDJSON evidence, and a persistent replay link. If two runs disagree, that is surfaced as a signal, never hidden.

Kane CLI running an AI browser automation flow in a real browser

What AI browser automation looks like with Kane CLI

Plain-English objectives, AI perception, and a verdict anchored to evidence.

Plain-English objectives

Describe the outcome, not the selectors. 'Log in, add an item to the cart, and confirm the order total updates.' The agent resolves the path. No XPath, no Page Object Models, no framework boilerplate to maintain.

AI perception with deterministic verdicts

The LLM reads the rendered page like a user, but the verdict is not its opinion. A pass requires explicit evidence: DOM state, URL changes, network responses, console logs, or your assertions. The model never decides the test passed.

Real Chrome, not a synthetic DOM

The LLM perceives and acts on a real Chrome instance over the DevTools Protocol, taking only actions a real user could take. No injected JavaScript to skip a modal or force a hidden click, so the evidence behind every verdict reflects how your app actually behaves.

Autoheal without losing the contract

Intent is anchored to the user-facing element, so a reworded label or new CSS class does not break the run. The model adapts its path through cookie banners and redirects, up to 50 steps per flow, while the pass condition it must verify stays exactly the same.

Evidence-typed NDJSON output

Run with --agent and every action, observation, and assertion becomes a typed JSON line, ending in a run_end verdict your coding agent or CI script can parse. The proof behind pass or fail is structured data, not prose to scrape or logs to grep.

Parallel verification by user and region

One objective that names multiple users or regions branches automatically into parallel child agents, each in its own browser session with its own evidence trail. Verify admin, editor, and viewer journeys from a single prompt with no shell loops.

Build up confidence before you ship

Start in your terminal

Start in your terminal

Validate on the cloud

Validate on the cloud

Release with confidence

Release with confidence

Built for AI agents and humans alike

Kane CLI and KaneAI share the same automation engine and dashboard.

Perception is the LLM's job, the verdict is not

AI reads the rendered page and figures out how to reach your goal, the part scripts are bad at. But it never gets to declare success. The pass is granted by the verification contract only when the expected state shows up in the evidence.

A non-deterministic path, a deterministic outcome

Run to run, the model may click a different way to reach the same element, and that is fine. What stays fixed is the condition being checked, so a green result always means the same thing was actually true in a real browser.

Disagreement is a signal, not a hidden flake

When two runs reach different verdicts on the same flow, Kane CLI surfaces it as drift or app flakiness rather than burying it behind a green check. Every verdict ships with a video, a step trace, and a replay link you can drop into a PR or bug report.

Automate any browser flow in three steps

1

Install Kane CLI

Run npm install -g @testmuai/kane-cli and sign in with your TestMu AI account. Nothing to wire up, no selectors to record, no framework to scaffold.

2

Point it at any URL

Aim it at a local dev server, a staging link, or production. Drive flows from the terminal yourself, or hand them to Claude Code, Cursor, or Codex with the --agent flag.

3

State the goal and the proof

Write the journey in plain English and the condition that makes it a pass. The LLM works out how to reach it in real Chrome, then grants a pass only when the evidence confirms that state, with a shareable replay.

Get Started With Kane CLI

🎉 Launch offer: Bonus credits for the first 3 months on paid plans

Choose the right plan for you

Local test authoring via CLI

Auto-heal & vision

View test cases on UI

Test Manager

Free

Free

$0

/month

200 Credits

Resets in every

30 days

Free tier
Starter

Starter

$19

/month

2000 Credits

Launch: 4,000 Credits (+100%)

Bonus for first 3 months

Free tier
Most Popular
Pro

Pro

$99

/month

10,000 Credits

Launch: 15,000 Credits (+50%)

Bonus for first 3 months

Complimentary License
Enterprise

Enterprise

Get access to solutions built on Enterprise-Grade Security, Privacy, and Compliances.

Need more credits?

Got a bigger use case in mind?

Let’s talk

Multiple seats

Get the technical rundown

Blog

A look at Kane CLI. What we built, what it does, and where it is headed.

Documentation

Everything you need to install, configure, and run Kane CLI in under 2 minutes.

GitHub

Browse the source, file issues, and follow the roadmap on GitHub.

Frequently asked questions

AI browser automation drives a real browser from plain-English objectives instead of hand-written selectors and scripts. Kane CLI by TestMu AI uses an LLM for perception and reasoning, so it reads the rendered page the way a person does, then resolves the path to your goal. You describe the outcome, for example "log in and confirm the dashboard loads," and the agent drives a real Chrome browser through it. No XPath, no Page Object Models, no framework boilerplate. The difference from a chatbot is that Kane CLI returns a binary pass or fail anchored to evidence, not a prose summary.

The honest position from TestMu AI: the LLM is not deterministic, but the validation contract is. The model decides how to reach an element, run to run that path can vary, but it does not get to decide the test passed. A pass is granted only when the expected state is verified through explicit evidence: DOM state, stable selectors, accessibility labels, URL changes, network responses, screenshots, console logs, or your own assertions. With a stable app, data, and environment, the outcome stays consistent. If two runs disagree, that is surfaced as a signal of app flakiness or drift, never hidden behind a green check.

Yes. Because the verification contract is machine-readable, AI coding agents can close their own loop. Run Kane CLI with the --agent flag and it streams NDJSON, one typed event per line, ending in a run_end event carrying the verdict, evidence summary, extracted values, and a dashboard link. Claude Code, Cursor, Codex, and Gemini CLI parse that stream to learn whether their generated UI actually works, then fix the regression before you ever open the browser. Point your agent at the guide at testmuai.com/kane-cli/agents.md and it installs, authenticates, and verifies in a real browser on its own.

Yes, and the deterministic verdict is what makes a non-deterministic model safe to gate on. Authenticate with your TestMu AI credentials, pass --headless and --timeout, and key the pipeline off the evidence-backed exit code: 0 when the expected state is verified, 1 on a failed assertion, 2 on an error such as auth or a Chrome crash, and 3 on timeout. The same binary that runs headed on your laptop runs headless on GitHub Actions, GitLab CI, Jenkins, or Bitbucket. Every run uploads to Test Manager with its evidence trail, so an unverified flow never reaches production.

Yes. Any completed run exports to native Python Playwright code with one flag. The exported code uses the testmu NPM library, which preserves the original plain-English step alongside the generated script, so autoheal resilience survives the export. You can also import existing Playwright or Selenium scripts and convert them into intent-driven flows. Two-way migration means no lock-in and no throwing away the suite you already have.

Yes, the CLI is free to install and free to run against your own Chrome, so you can prove out the LLM-perception-plus-evidence workflow at zero cost. The Starter tier from TestMu AI adds 100 credits with no credit card. Cloud runs on the TestMu AI grid are billed against your plan only when you scale to remote browsers, geo coverage, or MultiOS. Start free, verify a real journey with shareable evidence in under five minutes, and pay only when you need scale.

Give your coding agent eyes in a real browser

Point your AI coding agent at the Kane CLI guide and it will install, authenticate, and run plain-English browser automation with the --agent flag, then read the verified pass or fail on its own.

Point your agent to: testmuai.com/kane-cli/agents.md