The Agent-Native CLI Standard

Eight principles for CLI tools operated by AI agents.

AI agents operate CLIs differently than humans do. They can't answer interactive prompts, can't parse vague output, and can't recover from errors that don't say what to do next. The principles below define what agent-native means in RFC 2119 language, with machine-readable requirements[] so an auditor (and graders) can pin against the standard.

Early-stage spec. Ideas, debate, and contributions are welcome: pressure-tests against any principle, real-CLI grading submissions, counter-examples that argue a tier sits wrong, or new principles that the genre is missing. The spec is published as status: active because the contracts are stable enough to cite, not because anything here is locked. See CONTRIBUTING.md for how to file each type.

The four artifacts

Spec: this repo. Eight RFC 2119 principles plus machine-readable requirements[] in YAML frontmatter. Currently v0.5.0; all principles ship as status: active.
Linter: anc. Scores any CLI repo against the spec. Pins against requirement IDs, not prose.
Skill bundle: agentnative-skill. Agent-facing guide that teaches agents how to invoke anc and remediate findings against the spec; vendors the principles for offline use.
Leaderboard: anc.dev/scorecards. Top CLIs graded live; submit a PR to add yours.

Quick start

brew install brettdavies/tap/agentnative
anc audit .

Also installable via cargo install agentnative or platform-specific archives on GitHub Releases.

Run anc audit . --output json for machine-readable scorecards. Per-principle filtering via anc audit . --principle <1-8>. anc emit schema prints the scorecard JSON Schema (draft 2020-12) for downstream consumers. For a sample scorecard, see the anc README or a live one at anc.dev/scorecards.

Principles

Each principle is a single file under principles/. Click through for the full contract: definition, why agents need it, the tiered requirements, evidence to look for, and anti-patterns.

#	Principle	Summary
P1	Non-Interactive by Default	Never block on TTY input during normal operation
P2	Structured, Parseable Output	Offer machine-readable formats alongside human text
P3	Progressive Help Discovery	Layer help from one-liner to full reference
P4	Fail Fast with Actionable Errors	Distinct exit codes, structured error output, fix suggestions
P5	Safe Retries and Explicit Mutation Boundaries	Idempotent reads, explicit mutation, dry-run support
P6	Composable and Predictable Command Structure	Consistent grammar, composable subcommands
P7	Bounded, High-Signal Responses	Predictable output size, pagination, filtering
P8	Discoverable Through Agent Skill Bundles	Ship a skill bundle and an install path so agents find it

Reading the spec

Every principle file pairs a machine-readable frontmatter block with prose in a fixed order: Definition, Why Agents Need It, Requirements (MUST / SHOULD / MAY), Evidence, Anti-Patterns, and Pressure-test notes. Both halves are load-bearing: machines parse the frontmatter, humans read the prose, and a CI validator keeps the two in sync.

The frontmatter carries a requirements[] array (one entry per MUST/SHOULD/MAY bullet) that auditors and graders pin against instead of prose. Each entry has:

id: a stable p<n>-<level>-<slug> identifier, unique across all eight files. Tooling references these, so they survive prose edits.
level: must, should, or may, with RFC 2119 / RFC 8174 semantics.
applicability: universal, or conditional: gated on a prose {if: "<reason>"} clause, or on a machine-checkable {kind: conditional, antecedent: {audit_id: "<id>"}} whose named verifier decides whether the requirement binds.
summary: one sentence, mirrored by the prose bullet.

principles/AGENTS.md is the full authoring and governance contract: frontmatter fields, requirement-ID conventions, the conditional-applicability propagation table, the last-revised discipline, the status lifecycle, and the coupled-release protocol with the linter.

Status

All eight principles are status: active. Published as part of the standard, not drafts. This is a working spec accepting pressure-tests, not a manifesto. File substantive critique via the pressure-test issue template; a principle moves to under-review when a finding might change MUST/SHOULD/MAY tiers, then back to active once resolved. See principles/AGENTS.md for the full status lifecycle and pressure-test protocol.

Versioning

The spec uses semver-adjacent versioning:

MINOR: new or changed MUSTs
PATCH: SHOULD/MAY changes, prose edits

Each principle carries an independent last-revised date in frontmatter. The date updates when any MUST/SHOULD/MAY in that principle changes tier, is added, or is removed. Prose-only edits do not update the date.

Current version: see VERSION.

Scoring

Conformance is scored against the requirement IDs in requirements[], not against prose, and from shipped-binary behavior only: behavioral-layer rows observed by running the tool, not its source or project layout. The status taxonomy, the credit-weighted formula, the tunable tier weights, the 70% badge-eligibility floor, and the cohort bands are all defined in principles/scoring.md. See docs/badge.md for the badge claim convention.

Badge

CLI tools whose scorecards meet the 70% floor (see Scoring) can embed a live-score badge in their READMEs:

[![agent-native](https://anc.dev/badge/<tool>.svg)](https://anc.dev/scorecards/<tool>)

For example, anc is scored against the spec and embeds its own live badge:

The badge text reflects the tool's current score from the live scorecard; clicking through shows the per-requirement breakdown. See docs/badge.md for the claim convention: eligibility, embed URL, version pinning, honesty expectation, regression behavior.

Decision records

P1: behavioral MUST wording: why the MUST describes observable behavior instead of enumerating prompt and TUI APIs, and what the automated-audit verification boundary is.

principles/: the eight principle files (the standard itself).
principles/AGENTS.md: authoring and governance contract: frontmatter shape, requirement IDs, conditional applicability, status lifecycle, coupled-release protocol.
principles/scoring.md: leaderboard scoring formula, status taxonomy, eligibility floor, cohort bands.
docs/badge.md: badge claim convention: eligibility, embed shapes, version pinning, regression behavior.
docs/decisions/: decision records for non-obvious spec choices.
CONTRIBUTING.md: contribution shapes, tier breakdown, AI disclosure, human co-sign, release protocol. CHANGELOG.md is the version history; RELEASES.md documents how a release is cut.

Acknowledgements

The principles in this spec descend from multiple streams of agent-CLI thinking that converged in late 2025 and 2026, not from a single source.

Foundational CLI doctrine: Adam Wiggins' 12-factor methodology (the env-var contract in P1 inherits Factor III; the stdout-as-event-stream framing in P2 inherits Factor XI), the POSIX Utility Conventions (argument grammar, env-var naming), the Command Line Interface Guidelines (load-bearing for P3, P5, and P6), Eric S. Raymond's The Art of Unix Programming, the NO_COLOR standard, and the XDG Base Directory Specification.
Agent-CLI synthesis (parallel work, 2025–2026): Anthropic's Writing tools for agents, Sriram Madapusi Vasudevan's InfoQ pieces on AI-agent CLIs (2025-08), Cloudflare's The CLI for all of Cloudflare (2026-04), Andrej Karpathy on terminal-as-legacy-tech, the Speakeasy and AppleBOY/Wu three-layer (API/CLI/skills) architecture, Michael Yuan's compound-failure framing, and other contemporaneous voices on dev.to and Medium.
Trevin's 7 Principles for Agent-Friendly CLIs (2026-03-26) and follow-up 10 Principles for Agent-Native CLIs (2026-05-04) named the genre and gave it momentum. Trevin's seven-axis decomposition is the proximate ancestor of this spec's initial draft.
What this project adds is mechanism: the anc linter (with auto-fix runway), the agentnative-skill bundle that teaches agents how to invoke anc and remediate findings, the live leaderboard, the score badge, and the cross-repo coupled-release norm. The principles above name the contract; this tooling makes conformance verifiable so a CLI can claim agent-native and a reader can audit.

Voice and identity decisions for the spec live in BRAND.md and the spec-channel PRODUCT.md.

Contributing

Three shapes of contribution, in order of cost:

Signal (pressure-test against a principle, grading-finding for the leaderboard, or a spec question): file an issue with the matching template at github.com/brettdavies/agentnative/issues/new/choose.
Proposal (new principle, MUST/SHOULD/MAY tier change, applicability-clause change): open a design issue first; the maintainer signs off before spec text lands.
Code: PR against dev (per branch discipline). Spec edits, governance docs, release infrastructure, validator tooling.

Local setup:

git clone https://github.com/brettdavies/agentnative
cd agentnative
git config core.hooksPath scripts/hooks  # mirror CI locally on every push
bun scripts/validate-principles.mjs

The full tier breakdown, AI disclosure requirements, the human co-sign policy for spec changes, and the coupled release protocol live in CONTRIBUTING.md. Cross-repo routing: linter bugs (false positives, scoring bugs) go to brettdavies/agentnative-cli; site bugs (rendering, performance) to brettdavies/agentnative-site.

License

Spec text: CC BY 4.0
anc auditor tool: MIT or Apache-2.0
Scripts under scripts/: MIT or Apache-2.0

See LICENSE for the full carve-out; per-file SPDX headers name the exact license per file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Agent-Native CLI Standard

The four artifacts

Quick start

Principles

Reading the spec

Status

Versioning

Scoring

Badge

Decision records

Related

Acknowledgements

Contributing

License

About

Licenses found

Uh oh!

Releases 6

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
docs		docs
principles		principles
scripts		scripts
.gitignore		.gitignore
.markdownlint-cli2.yaml		.markdownlint-cli2.yaml
AGENTS.md		AGENTS.md
BRAND.md		BRAND.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
PRODUCT.md		PRODUCT.md
README.md		README.md
RELEASES-RATIONALE.md		RELEASES-RATIONALE.md
RELEASES.md		RELEASES.md
VERSION		VERSION
cliff.toml		cliff.toml

Folders and files

Latest commit

History

Repository files navigation

The Agent-Native CLI Standard

The four artifacts

Quick start

Principles

Reading the spec

Status

Versioning

Scoring

Badge

Decision records

Related

Acknowledgements

Contributing

License

About

Topics

Resources

License

Licenses found

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 6

Uh oh!

Contributors

Uh oh!

Languages