Add recon-pass delta log for scaling demo spec verification#13
Add recon-pass delta log for scaling demo spec verification#13lukefwalton wants to merge 1 commit into
Conversation
Records the verification pass against live main (60b727f): NEXT-STEPS.md is present (dependency satisfied), the gold schema is expectAnswerMode not must-* tags, the keyless headline needs committed query vectors, the int8 path reuses retrieve/no-leak/store without forking, and README fix 2.1 (the "eight lines" claim) has not landed. Reconnaissance only; no demo code, corpus, harness, or NEXT-STEPS edits. https://claude.ai/code/session_01CQQe5VjjDgpCj7hYcoVv8Y
Automated Checks (advisory, non-blocking)✅ All checks passed. Standards ComplianceThis looks docs-only ( That said, this doc now appears to make normative claims about boundary/eval semantics, so the main standards risk is documentation drift rather than runtime behavior. In particular, the rows called out in the PR description around:
are worth spot-checking against the live code/tests before merge, because future work could incorrectly implement the doc instead of the code if any of those assertions are off. That’s especially relevant to the standards on mode semantics staying aligned with gold eval and not loosening grounding. SummaryThis PR adds a recon/delta section to the scaling demo documentation, reconciling spec assumptions with the repository state at commit Reviewer: most of the risk is in whether R4/R5/R8/R9 accurately describe current code and eval behavior — the rest looks like documentation bookkeeping. What to pay attention to
Things I noticed🟡 Yellow flags — consider for this PR or a follow-up:
Good patterns
Suggested improvements
Questions for the author
Surmado Code Review (v1.2-mt) is an automated review, designed to work alongside human judgment. Want to change your STANDARDS.md or YML? Edit it directly, or tune it with our AI agent Scout. Comment |
Summary
Documents the reconnaissance pass verification of the scaling demo specification against the live repository state at commit
60b727f. This delta log reconciles the spec's assumptions with actual implementation details discovered during pre-build analysis.Changes
docs/scaling-demo/scaling-demo-delta-log.mdwith 9 verification rows (R1–R9)NEXT-STEPS.mddependency status and linking requirementsmust-*modes)production-scaling.mdlocation and cross-linkingNotable Details
judgeRetrievalon committed vectors) and optional key-gated answer-mode tierretrieve,assembleEvidence,judgeRetrieval/judgeAnswer,answerQuestion) are reusable without forkingforbidSources(near-floor source) is the mechanism for keyless refuse-case verdictsINDEX_SCHEMA_VERSION = 2https://claude.ai/code/session_01CQQe5VjjDgpCj7hYcoVv8Y