REVIEW-auth-refresh
review packet / example
AC-001 pass
Evidence: `npm test auth-refresh` pasted, exit 0.
AC-002 unverified
Evidence missing. Human attention required before merge.
Don't mark it done without evidence.
Structured agent work, checked at every step.
Define the work, run agents, verify outputs, preserve evidence.
loop preview
# current run
$ corpus review
PASS AC-001 — output pasted
UNVERIFIED AC-002 — manual resize pending
HUMAN ATTENTION — retry.ts changed outside scope
review packet / example
AC-001 pass
Evidence: `npm test auth-refresh` pasted, exit 0.
AC-002 unverified
Evidence missing. Human attention required before merge.
Don't mark it done without evidence.
Agents can produce a plausible diff faster than a team can verify it. corpus makes the handoff, scope, evidence, and memory visible before the change ships.
INTAKE
A ticket enters as prose. corpus preserves it, then turns it into requirements.
SCOPE
A task packet names what to change, what not to touch, and how to verify.
EVIDENCE
A Pass needs pasted output, a CI link, or a named manual observation.
LEDGER
Lessons move from a run summary into findings so the next task can load them.
Pull, Spec, Task, Run, Review, Close. Each point creates or checks an artifact the next point can read.
workflow / six steps
The mark maps to the workflow: Pull, Spec, Task, Run, Review, Close. Each pass leaves a file the next step can use.
Capture the ticket and intent in an intake file.
Write requirements, each with a verification method.
Hand the agent a bounded packet: scope, do-not-change, verify.
The agent implements and pastes real evidence.
Check evidence per requirement; human attention where needed.
Merge, save findings, update the board.
Close feeds the next Pull — the loop closes
Specs and reviews are working records. The paper treatment shows up where commands, notes, and evidence live.
spec example / acceptance criterion
AC-003 — Expired refresh token redirects to login
The client must clear local session state and route the user to `/login`.
Verify with: auth-refresh-expired.test
Write the contract once. Give the agent a task packet, not a drifting chat thread.
The review packet shows evidence per requirement and routes the exceptions to a human.
One task, one branch, one place to inspect what changed before it joins the main line.
Every rule says whether it is convention, checklist, toolable, or actually enforced.
No. Your coding tool writes the code. corpus structures the files around it: specs, tasks, reviews, findings, and the status board.
No. Agents can run checks and paste evidence; a human or independent reviewer owns the result.
No. corpus works as plain markdown. The optional CLI scaffolds, checks, isolates worktrees, and reconciles facts; it does not become a model loop.
The six points map to the six-step loop: Pull, Spec, Task, Run, Review, Close. It is a diagram before it is a mark.
Copy the kit, write the contract, hand the agent a bounded task, and review the evidence. The loop scales because the first pass is small.