Skip to content

test: cover role extractor helper fallbacks#535

Open
seonghobae wants to merge 6 commits into
developfrom
codex/clean-role-extractor-helper-tests
Open

test: cover role extractor helper fallbacks#535
seonghobae wants to merge 6 commits into
developfrom
codex/clean-role-extractor-helper-tests

Conversation

@seonghobae

Copy link
Copy Markdown
Collaborator

Summary

  • adds focused RoleExtractor helper coverage for chord tie behavior
  • asserts fallback extraction notes when no audio features are provided
  • replaces the useful test-only slice of 🧪 Add unit tests for RoleExtractor #385 without unrelated desktop, design-system, workflow, or backup-file changes

Verification

  • uv run pytest tests/test_roles.py
  • uv run ruff check tests/test_roles.py
  • uv run pytest
  • python3 scripts/checks/security_gates.py
  • python3 scripts/checks/verify_supply_chain.py
  • python3 scripts/checks/verify_security_notes.py
  • git diff --check

Note: the full service test suite passed with 435 tests and the existing 3 librosa warnings.

Supersedes #385.

Copilot AI review requested due to automatic review settings July 2, 2026 09:16

Copilot AI left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Note

Copilot couldn't run its full agentic review because no GitHub Actions runner was available. Make sure your repository has a runner available to run Copilot's review, or add a copilot-setup-steps.yml file specifying one with the runs-on attribute. See the docs for more details.

Adds targeted test coverage for role extraction fallbacks and chord tie-breaking behavior in the analysis engine.

Changes:

  • Adds unit tests for _most_common_chord tie-breaking (prefers first chord encountered on ties).
  • Adds a test asserting RoleExtractor.extract() sets extraction_notes when called without audio features.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread services/analysis-engine/tests/test_roles.py Outdated
Comment thread services/analysis-engine/tests/test_roles.py Outdated
Comment thread services/analysis-engine/tests/test_roles.py Outdated
@seonghobae

Copy link
Copy Markdown
Collaborator Author

Addressed the role extractor test review feedback in 92ba08d.

Changes:

  • removed the direct import/test of private _most_common_chord
  • covered the same chord tie-break behavior through the public RoleExtractor.extract() API with mocked ChordRecognizer/PitchTracker dependencies
  • relaxed the extraction-notes assertion to the stable computed handoffs behavior signal instead of an exact full copy string

Verification:

  • uv run pytest tests/test_roles.py - 9 passed
  • uv run ruff check src tests/test_roles.py - passed
  • uv run pytest - 435 passed
  • uv run ruff check src tests - passed
  • python3 scripts/checks/security_gates.py - passed
  • python3 scripts/checks/verify_supply_chain.py - passed
  • python3 scripts/checks/verify_security_notes.py - passed
  • git diff --check - passed

Note: the prior security-audit failure is the shared Rust quick-xml advisory path; #525 now carries the repo-controlled upstream-owned exception update for that queue-wide gate.

@seonghobae seonghobae enabled auto-merge (squash) July 2, 2026 10:50
@opencode-agent

opencode-agent Bot commented Jul 2, 2026

Copy link
Copy Markdown

OpenCode Review Overview

  • Head SHA: dbab09b14d7ccbb3f4e7263a4fdc5ecf26d20b57
  • Workflow run: 28623343704
  • Workflow attempt: 1
  • Gate result: APPROVE (approval step)

Pull request overview

OpenCode reviewed the current-head bounded evidence and found no blocking issues.

Findings

No blocking findings.

Summary

Approval sufficiency: bounded evidence supplied affirmative approval evidence for changed files, coverage/docstring posture, risk surfaces, and current-head verification; approval is not based merely on the absence of known blockers.
Verification posture: CodeGraph evidence was initialized and bounded current-head evidence reviewed for changed-file evidence including services/analysis-engine/tests/test_roles.py.
Linter/static: workflow/static review evidence is bounded by the current-head GitHub Checks gate and changed-file evidence.
TDD/regression: coverage execution evidence and focused changed hunks were reviewed from bounded-review-evidence.md.
Coverage: coverage execution evidence reports supported repository test suites passed.
Docstring coverage: coverage execution evidence reports configured repository docstring gates passed or docstring coverage was advisory.
DAG: CodeGraph/source-backed behavior map connects services/analysis-engine/tests/test_roles.py to the affected review, runtime, or workflow path and required checks.
PoC/execution: coverage-evidence job executed on the current head and reported PASS.
DDD/domain: workflow and repository-governance invariants were reviewed against changed files in bounded evidence.
CDD/context: CodeGraph evidence, changed-file history, and focused hunks were reviewed from bounded-review-evidence.md.
Similar issues: changed-file history evidence was reviewed for comparable local precedents.
Claim/concept check: bounded evidence, repository source, current-head workflow evidence, and, where numeric, scientific, statistical, or literature-backed claims are affected, original-paper/formula evidence and parameter-recovery expectations were used for claims.
Standards search: standards and external-source checks are delegated to configured OpenCode web_search/Context7/DeepWiki sources when applicable; no evidence-backed standards blocker is present in bounded evidence.
Compatibility/convention: changed workflow/script conventions, object naming, and reserved-word safety for schema/API/config/code surfaces were checked in bounded evidence.
Breaking-change/backcompat: deployment evidence and changed-file history were checked for backward-compatibility risk.
Performance: changed surfaces were checked for performance risk in bounded evidence.
Developer experience: changed automation, review, test, setup, and maintenance surfaces were checked for helpful or obstructive DX impact in bounded evidence.
User experience: connected user, operator, API, CLI, documentation, review-comment, status-check, rendering, and workflow-reader behavior was checked for contradictions against code, docs, and tests in bounded evidence.
Visual/DOM: Playwright visual, DOM locator, ARIA snapshot, console, and responsive evidence were checked when a web UI surface was present; for non-web surfaces, API/CLI/log/docs/workflow interaction evidence was reviewed instead.
Accessibility/i18n: accessibility, localization, and human-readable text surfaces were checked where UI, CLI, API message, docs, logs, or review text changed.
Supply-chain/license: dependency, package, model, container, and external-tool changes were checked in bounded evidence.
Packaging: package, build, test, lint, and security contracts were checked in bounded evidence.
Security/privacy: workflow-token, review-gate, and repository-automation security/privacy boundaries were checked in bounded evidence.

  • Result: APPROVE
  • Reason: Test-only change with full coverage and passing tests.
  • Head SHA: dbab09b14d7ccbb3f4e7263a4fdc5ecf26d20b57
  • Workflow run: 28623343704
  • Workflow attempt: 1

Changed-File Evidence Map

flowchart LR
  PR["PR changed files"] --> Evidence["OpenCode bounded evidence"]
  Evidence --> S1["Test: test_roles.py"]
  S1 --> I1["regression suite"]
  I1 --> R1["Review risk: Test: test_roles.py"]
  R1 --> V1["targeted test run"]
Loading

@opencode-agent opencode-agent Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

OpenCode reviewed the current-head bounded evidence and found no blocking issues.

Findings

No blocking findings.

Summary

Approval sufficiency: bounded evidence supplied affirmative approval evidence for changed files, coverage/docstring posture, risk surfaces, and current-head verification; approval is not based merely on the absence of known blockers.
Verification posture: CodeGraph evidence was initialized and bounded current-head evidence reviewed for changed-file evidence including services/analysis-engine/tests/test_roles.py.
Linter/static: workflow/static review evidence is bounded by the current-head GitHub Checks gate and changed-file evidence.
TDD/regression: coverage execution evidence and focused changed hunks were reviewed from bounded-review-evidence.md.
Coverage: coverage execution evidence reports supported repository test suites passed.
Docstring coverage: coverage execution evidence reports configured repository docstring gates passed or docstring coverage was advisory.
DAG: CodeGraph/source-backed behavior map connects services/analysis-engine/tests/test_roles.py to the affected review, runtime, or workflow path and required checks.
PoC/execution: coverage-evidence job executed on the current head and reported PASS.
DDD/domain: workflow and repository-governance invariants were reviewed against changed files in bounded evidence.
CDD/context: CodeGraph evidence, changed-file history, and focused hunks were reviewed from bounded-review-evidence.md.
Similar issues: changed-file history evidence was reviewed for comparable local precedents.
Claim/concept check: bounded evidence, repository source, current-head workflow evidence, and, where numeric, scientific, statistical, or literature-backed claims are affected, original-paper/formula evidence and parameter-recovery expectations were used for claims.
Standards search: standards and external-source checks are delegated to configured OpenCode web_search/Context7/DeepWiki sources when applicable; no evidence-backed standards blocker is present in bounded evidence.
Compatibility/convention: changed workflow/script conventions, object naming, and reserved-word safety for schema/API/config/code surfaces were checked in bounded evidence.
Breaking-change/backcompat: deployment evidence and changed-file history were checked for backward-compatibility risk.
Performance: changed surfaces were checked for performance risk in bounded evidence.
Developer experience: changed automation, review, test, setup, and maintenance surfaces were checked for helpful or obstructive DX impact in bounded evidence.
User experience: connected user, operator, API, CLI, documentation, review-comment, status-check, rendering, and workflow-reader behavior was checked for contradictions against code, docs, and tests in bounded evidence.
Visual/DOM: Playwright visual, DOM locator, ARIA snapshot, console, and responsive evidence were checked when a web UI surface was present; for non-web surfaces, API/CLI/log/docs/workflow interaction evidence was reviewed instead.
Accessibility/i18n: accessibility, localization, and human-readable text surfaces were checked where UI, CLI, API message, docs, logs, or review text changed.
Supply-chain/license: dependency, package, model, container, and external-tool changes were checked in bounded evidence.
Packaging: package, build, test, lint, and security contracts were checked in bounded evidence.
Security/privacy: workflow-token, review-gate, and repository-automation security/privacy boundaries were checked in bounded evidence.

  • Result: APPROVE
  • Reason: Test-only change with full coverage and passing tests.
  • Head SHA: dbab09b14d7ccbb3f4e7263a4fdc5ecf26d20b57
  • Workflow run: 28623343704
  • Workflow attempt: 1

Changed-File Evidence Map

flowchart LR
  PR["PR changed files"] --> Evidence["OpenCode bounded evidence"]
  Evidence --> S1["Test: test_roles.py"]
  S1 --> I1["regression suite"]
  I1 --> R1["Review risk: Test: test_roles.py"]
  R1 --> V1["targeted test run"]
Loading

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants