What is agent-qa

Understand agent-qa, then jump into the core docs for tests, suites, hooks, platforms, and agent workflows.

Overview

agent-qa is an open-source AI end-to-end testing tool for web and mobile apps. Tests can be written in natural language, with no hardcoded CSS or XPath selectors. Tests are self-healing, and with every run, the agent continuously learns and creates contextual memory like a QA engineer.

Quickstart

Install agent-qa, initialize the workspace, validate the generated files, and run the first local QA workflow.

Explore demos

Open curated agent-qa runs in the interactive demo viewer and inspect how real test steps execute.

Configuration

Review the file-backed configuration system for global settings, tests, suites, hooks, variables, secrets, devices, rules, and auth.

Memory

Review how file-backed product, suite, and test observations make future runs more product-aware.

Caching

Reuse cached action plans for similar subsequent runs, with landing-page examples showing 5x faster execution and 3x fewer planner tokens.

Tests

Write one plain-English YAML journey with stable intent, reviewable steps, and source-controlled expectations.

Web Testing

Run browser flows against local or staged web apps without hardcoded selectors or fragile DOM coupling.

Auth state

Capture a named web login once, reuse it by logical name, pass the active state to hooks, and keep credential material out of artifacts.

Suites

Bundle related tests, targets, configuration, and hooks into repeatable workflows for a feature area.

Mobile Testing

Use the same natural-language workflow model for mobile app journeys while preserving artifacts locally.

Hooks

Prepare data, call project scripts, verify side effects, and clean up state before or after a run.

How agent-qa works

Write tests in natural language

Describe the user journey in plain English: what to open, what to do, and what to assert. agent-qa turns that intent into repeatable execution.

Bring your own LLM

Use any hosted or local model. It also works with Codex and Claude Code subscriptions.

Run tests

Run tests from either the dashboard or the CLI. Group multiple tests into a suite to execute user journeys end-to-end.

Generate memory and context

Each run passes through a memory curator that records product behavior, test intent, and suite context, then creates contextual memory for future test runs.

Improve the next run

Future executions use the evolved agent context. That memory helps the agent adapt to product changes at runtime, recover from brittle UI drift, and become more reliable over time.

Grow a product-aware QA agent

The end state is a QA agent with enough context about your product to author tests, run them, inspect failures, and keep improving the testing loop.