Curator

Learn how the memory curator turns run evidence into added, confirmed, deprecated, deleted, or unchanged observations.

The curator runs after test execution and decides whether the run produced useful behavioral memory. It is selective: memory should capture product behavior that helps future runs, not generic testing tactics or obvious page trivia.

A.U.D.N. decisions

The curator asks for A.U.D.N. decisions:

add: write a new behavioral observation.
update: confirm an existing observation that was relevant and correct.
deprecate: penalize an existing observation that the run contradicted.
noop: leave memory unchanged.

The implementation records update decisions as confirmation deltas in the memory log. noop decisions do not write files.

What gets added

New observations start with trust 0.5.

new observation trust = 0.5
confirmed_count = 0
contradicted_count = 0

The curator chooses a scope:

product scope for structural behavior that helps future tests across the product
suite scope for behavior tied to a suite sequence or position
test scope for behavior specific to one test

Suite observations include the suite position and suite snapshot so they can be matched safely later.

Confirmation and deprecation

When the curator confirms an observation, trust increases by trustConfirmDelta, last_confirmed is updated, and confirmed_count increases by one.

When the curator deprecates an observation, trust decreases by trustContradictDelta and contradicted_count increases by one.

If trust reaches zero, agent-qa deletes the observation file instead of keeping a zero-trust memory entry.

Curator

A.U.D.N. decisions

What gets added

Confirmation and deprecation

Failed runs

Suite cleanup

Curator lock

Security checks

On this page