Skip to content

Warm recordings, file-journey-bug tasks, and run-recorded-mcp#40

Open
bhackett1024 wants to merge 5 commits into
mainfrom
warm-recordings
Open

Warm recordings, file-journey-bug tasks, and run-recorded-mcp#40
bhackett1024 wants to merge 5 commits into
mainfrom
warm-recordings

Conversation

@bhackett1024

Copy link
Copy Markdown
Contributor

Summary

Adds warm-recording tracking to QA tasks, splits FAST MODE journey bug-filing into a second-stage task, and generalizes the polish setup harness.

Warm recordings

  • tasks now carry recording_id, recording_warmed_at, and no_retry.
  • When a task that operates on a recording (a polish pass task, or a file-journey-bug task) is scheduled, warm-recording-background drives RecordingOverview and waits up to 20 min. On success it stamps recording_warmed_at on the recording's queued tasks; on timeout/error it fails them (non-retryable) and records a Replay MCP error so the failure surfaces in admin activity.
  • claimNextTask prioritizes dispatch by warm readiness: (1) warmed-recording tasks, (2) no-recording tasks, (3) not-yet-warmed-recording tasks (last).

FAST MODE journeys report a bug signal; a second task files it

  • A fast-mode stepped journey's runner no longer files bugs. When it sees one it reports bug_seen: true plus a bug_data blob (description of what it encountered + browser history) in its setup_output.
  • saveTestRunSetupResult then schedules one file-journey-bug task, associated with the journey's recording and linked from the run's task page (test_run.file_journey_bug_task_id), which files the bug from the recording.

run-recorded-mcp

  • Generalized the former scripts/polish/run-recorded.ts into scripts/run-recorded-mcp.ts with two modes (polish, file-journey-bug) — both operate on a recording via the Replay MCP with no browser, mirroring run-recorded-browse.ts.

Journey eval ordering

  • container-events defers a fast-mode journey run's finalization — and a journey eval's result callback — until the file-journey-bug task completes, so the bug is filed before the run is finalized / the eval verdict is reported.

Guidance

  • Seeded a file-journey-bug-script stub and updated journey-run-script guidance context for the new report-a-signal behavior. Real script content is authored in the prod DB via the guidance API.

🤖 Generated with Claude Code

…-mcp

- tasks now carry recording_id + recording_warmed_at (+ no_retry). A recording-
  operating task (polish / file-journey-bug) fires warm-recording-background,
  which drives RecordingOverview up to 20 min: success stamps warmed_at on the
  recording's queued tasks; timeout/error fails them (non-retryable) and records
  a Replay MCP error for admin activity.
- claimNextTask prioritizes dispatch by warm readiness: warmed recording first,
  no recording next, not-yet-warmed recording last. Used by polish tasks.
- FAST MODE journeys no longer file bugs: their runner reports bug_seen + a
  bug_data blob into setup_output; saveTestRunSetupResult then schedules a
  file-journey-bug task (associated with the journey's recording, linked from the
  run's task page) that files the bug from the recording with no browser.
- Generalized the polish setup harness into scripts/run-recorded-mcp.ts with two
  modes (polish, file-journey-bug); both operate on a recording via the Replay
  MCP without a browser. Removed scripts/polish/run-recorded.ts.
- container-events defers a fast-mode journey run's finalization — and a journey
  eval's result callback — until the file-journey-bug task completes, so the bug
  is filed before the run finalizes / the eval verdict is reported.
- Seeded file-journey-bug-script guidance stub; updated journey-run-script
  guidance context for the new FAST MODE report-a-signal behavior.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@replay-qa

replay-qa Bot commented Jul 4, 2026

Copy link
Copy Markdown

Replay QA

✅ QA run passed for bffd0e75575d (warm-recordings) — no bugs found.

Bugs found Journeys added Journeys updated Journeys removed
0 0 0 0

Full report in Replay QA →

@github-actions

github-actions Bot commented Jul 4, 2026

Copy link
Copy Markdown

🔎 Loop QA preview (all-k8s backend): https://pr-40--loop-qa-j63k5x.netlify.app

App Builder and others added 3 commits July 4, 2026 03:48
…ecture

Add docs/backend/warm-recordings.md (the dedicated architecture doc, alongside
run-recorded-browse.md) covering warming, run-recorded-mcp's two modes, the
file-journey-bug task, and journey-eval second-task ordering. Cross-reference it
from tasks-and-containers.md, journeys.md, admin-run-journey-eval.md,
test-runs.md, and run-recorded-browse.md; fix the stale journey harness path.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant