Maintenance & Control for your Playwright test automation

You got Playwright setup. You probably even got Claude or Gemini to help automate a few dozen tests.

You look around and think: Now what? Is this enough coverage? Who’s going to maintain these? Not me, right?

Introducing Amikoo. Not another no-code/low-code tool, and certainly not another “bring everything to us” test automation platform. Amikoo plugs directly into your dev stack to solve the two biggest remaining challenges in testing:

- Ongoing maintenance & repair of complex tests (Beta available now)

- Ongoing monitoring and analysis of test coverage (Coming soon!)

The first 100 to sign up gain free access to
Amikoo Playwright MCP

Without Amikoo
With Amikoo

Playwright & LLMs are a strong start for test automation, but leave critical gaps unresolved

With Playwright

  • Easily create new test cases
  • Control browser to map out test steps
  • Log test executions

With an LLM like Claude, you can

  • Suggest new test cases to Playwright (with in-depth prompt engineering)
  • Enable some NLP test case creation

Gaps & challenges remaining

Maintenance

  • Flaky tests that reduce confidence in CI
  • Wasted time with high test maintenance and frequent breakage

Visibility & Control

  • Unsure what to test, why, and how much
  • Poor visibility into what is tested, what is missing, and where risk exists
  • Lag between bug reporting and fix
Captura de pantalla 2026-02-18 124914-1

MuukTest’s Maintenance & Repair MCP and QA Control Hubcomplete existing stacks for lasting coverage

With Playwright

  • Easily create new test cases
  • Control browser to map out test steps
  • Log test executions

With an LLM like Claude, you can

  • Suggest new test cases to Playwright (with in-depth prompt engineering)
  • Enable some NLP test case creation

With Amikoo

  • Detect broken & outdated tests 24/7
  • Suggests fixes, repairs, re-writes and new  tests as product, usage changes
  • Translates these needs to your LLMs without need for prompt engineering
  • Define, score, monitor test coverage and highlight areas not covered
  • Report bugs & potential fixes in real time
  • Managed infrastructure for limitless parallel test runs at scale
Captura de pantalla 2026-02-18 125426

How Amikoo Approaches Maintenance Differently

Amikoo MCP was designed to reduce the cognitive burden placed on LLMs during test repair. Instead of relying entirely on prompt engineering and tool selection inside a single model context, Amikoo separates orchestration from execution.

Playwright Healer Agent

Amikoo Maintenance MCP

Multiple prompts and tools available to the LLM
 Single maintenance tool (call_repair_agent
Tool selection and repair strategy determined dynamically by the LLM
Structured repair flow:
read → invoke → repair → re-run → validate 
Full repair reasoning handled within the user’s LLM context
Repair execution handled by hosted maintenance agents
No dedicated maintenance agent layer
LLM responsibility minimized to orchestration

Amikoo Maintenance MCP vs Playwright Healer Agent

 

To benchmark real-world performance, we ran the same broken Playwright test suite through both systems and measured how effectively each could detect failures and repair the tests without human intervention.
 
Compared to the native Playwright Healer Agent, Amikoo completed repair tasks nearly 4x faster and delivered significantly higher repair success.

Amikoo Maintenance MCP

Playwright Healer Agent

  • Avg. Time per Repair / 100 Repairs

    5 min / ~8.3 hours 25 min / ~41.6 hours
  • Average Steps

    31.8 31.8
  • Repair Success Rate

    91% 49%
 
Playwright’s healer relies entirely on the user’s LLM for reasoning and repair. Every failure is solved in isolation, within a growing context window.
 

Amikoo’s maintenance agents operate on an optimized metadata layer refined through years of real-world automation work and the execution and repair of hundreds of thousands of tests. That historical intelligence allows Amikoo to reason about failures more deterministically. Resulting in lower token usage, structured repair flow, and more consistent outcomes at scale.

 

Join the early access list!