Skip to content

Habitat Maturity Model (14 dimensions)

The Agentic Experience 5-Level Habitat Maturity Model is the spine of the assessment. It describes what a team's habitat actually delivers across fourteen dimensions, each placed L1–L5. Verbs are in bold — the verb is the finding.

Dimension L1 L2 L3 L4 L5
Agent behaviour Dictating Commanding (prompting) Regulating Orchestrating Supervising
Agent input short ad-hoc prompts larger prompts, commands plans co-authored with an agent iteratively refined specs refined specs + customer/observable metrics
Workflow safe runtime, generic prompts/commands saved harness engineered workflow defined workflow automated (agentic runtime)
Operating model Chat with agent Prompt-engineering humans drive / verify humans in the loop humans certify
Teams provide basic team-specific constitution comprehensive product-specific constitution full product-specific constitution custom product-specific runtime
Output role (I am…) Running Inspecting Standardising Specifying Certifying
Output artefact executable / artifact code process & consistency rules clear criteria evidence
Humans review output only code implementation in detail specs comprehensive evidence
Work patterns partial task completion small task completion e2e development semi-autonomous work mostly-autonomous
Agent composition single single + saved patterns primary + read-only critics bounded ensemble (harness-composed) self-orchestrating constellations
Agents… Assist individuals Complete basic tasks Develop small changes (stories) Implement larger changes (epics) Implement autonomously
Testing Manual inspection Asserting (unit tests) Verifying (functional / business) Validating (comprehensive automation) Assuring (multi-perspective + post-deploy)
Observability Eyeballs Captured Instrumented Aggregated Closed loop
Governance trust-based, ambient conventional Constitutional Policy-as-code Continuous certification

The dimensions are scored L1–L5, not L0–L5: L1 is the "ad-hoc but present" floor. A repo with essentially no AI-collaboration evidence sits at the L1 floor on every dimension by definition.

The headline Habitat Maturity Level

The overall Habitat Maturity Level is the rounded mean of the fourteen placements, with the weakest dimensions named as the ceiling — a habitat is only as mature as the dimensions its work actually flows through. A high mean dragged down by one L1 dimension is reported as "L3, held back by L1 Observability", not a flat L3.

How each dimension is placed

Eight dimensions are repo-observable — placed evidence-first from the scan:

Dimension Signals that raise the placement
Workflow saved prompts/commands (L2); a harness document — HARNESS.md / CONSTRAINTS.md (L3); defined workflow scripts or CI pipelines (L4); automated agentic runtime (L5)
Teams provide richness of CLAUDE.md / AGENTS.md (basic → comprehensive → full constitution); product-specific skills; custom runtime / prod-like agent environments (L5)
Agent input ad-hoc prompt traces (L1); saved prompt/command libraries (L2); plan documents (L3); a specs/ directory (L4); specs + observable metrics (L5)
Output artefact raw artifacts (L1); code (L2); process & consistency rule docs (L3); acceptance-criteria documents (L4); evidence artefacts — audits, CI evidence (L5)
Agent composition custom agents; read-only critic/reviewer agents; an orchestrator with safety gates; agent-team docs; multi-agent workflow scripts
Testing test suites; coverage enforcement; mutation testing; tests-before-merge gates; system/regression suites; agent-authored tests; prod-like environments
Observability agent-activity logging; metrics capture; dashboards; per-PR acceptance / mutation-kill tracking; OTel config; closed-loop signals
Governance HARNESS.md constraint count + enforcement ratio; policy-as-code CI checks; the unverified → agent → deterministic promotion ladder; governance-audit cadence

Six dimensions are behavioural — they describe how the team works, not what the filesystem holds. They're inferred from the repo-observable dimensions and from the clarifying questions, and flagged (inferred) when not directly evidenced: Agent behaviour, Operating model, Output role, Humans review, Work patterns, Agents….

Evidence-first vs survey

  • Evidence-first (default) — repo-observable dimensions placed from the scan; behavioural ones inferred.
  • Survey (opt-in) — marker statements administered on a 1–5 scale, two per level, for a rigorous per-dimension score. See Run the precise survey.

The four headline axes

Four dimensions — Agent composition (reported as Composition), Testing, Observability, Governance — are the most repo-observable and map cleanly onto the three disciplines. They're surfaced as the Operational Axes (Part D) table in the report. The Habitat Build Gap, however, uses the mean of all fourteen dimensions.

Provenance

The fourteen dimensions and their verbs are the Agentic Experience 5-Level Habitat Maturity Model (TechTalk.AI / Agentic Engineering). The AI Literacy framework's ALCI drew its four operational axes from this model; this instrument scores against the model in full. See The Sovereign Engineer.