loop: graduated guardrails — identical-call and consecutive-mistake detectors (closes #344) by erain · Pull Request #352 · erain/glue

erain · 2026-06-09T23:07:01Z

Closes #344. Roadmap P2.8.

Two graduated detectors (RunRequest.Guardrails, on by default, Disabled opt-out — same shape as RetryPolicy):

Detector	Streak	At 3	At 5/6
Identical call (name + canonicalized args hash, single-call rounds)	resets on any arg change	inject corrective user message	`ErrRepeatedToolCalls` (typed) at 5
All-error tool rounds	resets on any success	inject corrective user message	`ErrTooManyMistakes` (typed) at 6

Injected messages marked glue/guardrail in metadata; EventGuardrail carries kind/count/action (nudge|halt) for UIs and the goal loop.
Policy: nudge-first-halt-second (Gemini CLI's graduated response), Cline's thresholds.
Deliberately omitted: the "no tool used" reprompt — glue turns legitimately end without tools; the narrated-stall case is providers/gemini + loop: next-speaker check and stall recovery for Gemini #343's AutoContinue. Context-usage-in-env-details deferred to providers: per-model capability registry + tool-owned prompt snippets #345 (needs capability/registry plumbing).

Tests: nudge-then-halt for both detectors (call counts, event actions, typed errors), reset-on-different-args, reset-on-success, disabled passthrough, nudge message shape. One existing test (default-max-turns) now disables guardrails since its scripted 50 identical calls are exactly what the detector halts. Full suite + vet green.

🤖 Generated with Claude Code

…etectors (closes #344) Two detectors watch every tool round, on by default with opt-out via RunRequest.Guardrails (zero value = defaults, mirroring RetryPolicy): - Identical-call: a single-call round whose name+canonicalized-args hash matches the previous round's extends a streak. At 3 the loop injects a corrective user message ("it will keep producing the same result — change arguments or approach"); at 5 the run ends with the typed ErrRepeatedToolCalls. Any change of arguments resets. - Consecutive mistakes: rounds where every tool result IsError. At 3 a corrective message ("re-read the error messages before acting"); at 6 the run ends with ErrTooManyMistakes. Any success resets. Injected messages are marked glue/guardrail in metadata; EventGuardrail reports kind/count/action so UIs and the goal loop can surface them. Graduated nudge-then-halt policy per Gemini CLI, with Cline's thresholds (docs/coding-harness-roadmap.md P2.8). The "no tool used" reprompt from the roadmap is deliberately omitted: glue turns legitimately end without tool calls, and the narrated-stall case is covered by AutoContinue (#343). TestRunMaxTurnsDefaultIs32 now disables guardrails — it scripts 50 identical calls to test the turn budget, which the repeat detector would correctly halt first. Co-Authored-By: Claude Fable 5 <[email protected]>

github-actions · 2026-06-09T23:08:07Z

glue-review

No concerns — LGTM.

New guardrails for repeated tool calls and consecutive mistakes are well-implemented with appropriate graduated nudges and halt limits, backed by a comprehensive test suite.

🤖 Posted by glue-review.

erain merged commit cc3f98c into main Jun 9, 2026
5 checks passed

erain deleted the issue/344-loop-guardrails branch June 9, 2026 23:09

This was referenced Jun 9, 2026

loop: mistake and loop guardrails — consecutive-mistake counter, identical-call detector, no-tool reprompt #344

Closed

Project Tracker: Peggy — personal-assistant agent on glue #110

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loop: graduated guardrails — identical-call and consecutive-mistake detectors (closes #344)#352

loop: graduated guardrails — identical-call and consecutive-mistake detectors (closes #344)#352
erain merged 1 commit into
mainfrom
issue/344-loop-guardrails

erain commented Jun 9, 2026

Uh oh!

github-actions Bot commented Jun 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

erain commented Jun 9, 2026

Uh oh!

github-actions Bot commented Jun 9, 2026

glue-review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant