Conversation
…xperiment (kaizen qwibitai#322) - --test-task flag: synthetic fast task that creates a trivial PR instead of running /make-a-dent. Completes in <2 min for pipeline iteration. - --experiment flag: extra diagnostics — main HEAD before/after pull, per-PR merge status tracking, auto-merge queue visibility. - checkMergeStatus(): new exported function that checks PR state via gh CLI, returns merged/auto_queued/open/closed/unknown. - buildPrompt() now exported and supports test_task mode. - BatchState extended with test_task and experiment optional fields. - 13 new tests (42 total, all passing). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…-overnight-testable
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
--test-taskflag: synthetic fast task that creates a trivial PR instead of running/make-a-dent. Completes in <2 min for pipeline iteration.--experimentflag: extra diagnostics — main HEAD before/after pull, per-PR merge status tracking, auto-merge queue visibility.checkMergeStatus(): new exported function that checks PR state viaghCLI, returnsmerged/auto_queued/open/closed/unknown.buildPrompt()now exported and supportstest_taskmode.BatchStateextended withtest_taskandexperimentoptional fields.Context
This is Step 0 of the overnight-dent merge pipeline investigation (PRD qwibitai#321). The overnight-dent batch runner creates PRs but doesn't merge them. Before fixing the pipeline, we need fast iteration tools to test hypotheses about why merges fail.
Test plan
buildPromptwithtest_task=trueproduces synthetic prompt without/make-a-dentbuildPromptwithtest_task=falsestill uses/make-a-dentcheckMergeStatusreturns correct states for merged/queued/open/error cases--test-taskand--experimentflags parsed in trampoline, passed to state.json./scripts/overnight-dent.sh --test-task --max-runs 1 "probe"(deferred to Step 1)Fixes Garsson-io/kaizen#322
🤖 Generated with Claude Code