Agentic AI Atlas

II.

Page JSON

page:qa-00-qa-architecture

Structured · live

QA Architecture json

Inspect the normalized record payload exactly as the atlas UI reads it.

File · wiki/qa/00-qa-architecture.mdCluster · wiki

Record JSON

{
  "id": "page:qa-00-qa-architecture",
  "_kind": "Page",
  "_file": "wiki/qa/00-qa-architecture.md",
  "_cluster": "wiki",
  "attributes": {
    "nodeKind": "Page",
    "title": "QA Architecture",
    "displayName": "QA Architecture",
    "slug": "qa/00-qa-architecture",
    "articlePath": "wiki/qa/00-qa-architecture.md",
    "article": "# QA Architecture\n\n> Phase-5 deliverable. The test pyramid, evidence-boundary checks, and regression detection regime that gate every change to the v6 ecosystem.\n\n## Test pyramid\n\n```\n            ┌──────────────────────────┐\n            │     E2E (handful)        │   schema → regenerate → SDK consumer compiles\n            ├──────────────────────────┤\n            │  Integration (per-gen)   │   generator vs known-graph fixture\n            ├──────────────────────────┤\n            │  Generator-output checks │   manifest correctness, idempotency, hash\n            ├──────────────────────────┤\n            │  Schema validation       │   ontology-schema.yaml, invariants.yaml, examples/\n            └──────────────────────────┘\n```\n\n### Layer 1 — Schema validation (largest)\n\nRuns on every PR. Implements `graph/schema/validation-rules.md`:\n\n- **Structural**: every example YAML parses against the relevant NodeKind in `ontology-schema.yaml`. All `ref<>` fields resolve. No orphans outside the explicitly-orphan-allowed kinds.\n- **Invariants**: `invariants.yaml` rules execute against the loaded graph. V-1.x..V-12.x as enumerated in the validation-rules spec.\n- **Markdown ↔ YAML parity** (V-12.5): every NodeKind file under `schema/node-kinds/` describes the same attributes (with same types and required-flags) as the YAML.\n- **Evidence completeness** (V-2.1): every fact-bearing claim has at least one `EvidenceSource`.\n\n### Layer 2 — Generator output checks\n\nFor every generator:\n\n- **Determinism**: run twice; outputs byte-identical.\n- **Manifest correctness**: declared `consumes` matches the nodes/edges actually traversed.\n- **Idempotency**: third run on disk produces zero diff.\n- **No hand-edits**: outputs under `wiki/generated/`, `wiki/generators/templates/<gen>/__fixtures__/expected/`, etc., are byte-equal to the regenerated form. CI fails any PR where a generator output diverges from a fresh regeneration on the same input graph.\n\n### Layer 3 — Integration tests\n\nFor each of the 11 generators (10 + Gap sync), a test loads a frozen graph fixture from `wiki/qa/tests/fixtures/<gen>/in/`, runs the generator, and snapshot-compares against `wiki/qa/tests/fixtures/<gen>/out/`. Fixtures change only when the generator's contract changes — and then in the same PR.\n\n### Layer 4 — E2E\n\nA small number of end-to-end scenarios:\n\n1. **Schema bump propagates**: add an attribute to a NodeKind in `ontology-schema.yaml`; run all generators; assert the wiki, SDK types, and OpenAPI artifact reflect it.\n2. **SDK consumer compiles**: a fixture project under `wiki/qa/tests/sdk-consumer/` imports the regenerated SDK and `tsc --noEmit` succeeds.\n3. **Gap close cascade**: open a fake `Gap` node, fire the propagation generator, observe `propagationStatus` flip across all seven levels.\n\nE2E runs nightly and on release tags, not per-PR.\n\n## Evidence-boundary checks\n\nThe graph claims facts about the outside world. The boundary between \"graph claim\" and \"external source\" is where staleness creeps in. Phase 5 enforces:\n\n- **Every claim has live evidence**. CI fails if any `Claim` has no `EvidenceSource` in its `evidenceSourceIds` (V-2.1) or if every linked source is past its `freshnessWindowDays` (V-2.6).\n- **Reachability sweeps** (weekly job). Every `EvidenceSource{kindLabel: web}` has its `reachabilityCheck` re-run. `status != ok` for >7 days opens a Level-1 Gap automatically.\n- **Trust-floor enforcement** (V-2.4). Safety-critical attributes (capability binding, install method, sandbox profile) MUST NOT be backed solely by `synthetic` evidence. CI rejects.\n- **Attestation freshness**. `Attestation` records have explicit `expiresAt`. Past-expiry attestations move the claim back to \"unattested\" and fail any policy that requires attestation.\n\nSee [`04-evidence-boundaries.md`](./04-evidence-boundaries.md) for boundary-specific tooling.\n\n## Regression detection\n\nThree kinds of regression are watched:\n\n1. **Generator output regression**: a PR that touches a generator template must include the regenerated output. CI runs the generator and fails if the diff is missing or different from the PR's diff.\n2. **Schema regression**: any breaking change to `ontology-schema.yaml` (removed attribute, removed NodeKind, changed enum domain) must be accompanied by a major `CatalogVersion` bump and a `MigrationSpec`. Detected by comparing the PR's schema against `main`'s.\n3. **Coverage regression**: see [`03-coverage.md`](./03-coverage.md). A drop in NodeKind / Edge / Capability / Evidence-freshness / Generator coverage fails the build.\n\n## Spot-check queries (search smoke tests)\n\nTo verify the wiki search index, the test suite runs a fixed set of natural-language queries against the rendered site and asserts the expected node ids appear in the top-N results. Examples:\n\n- \"extended thinking\" → `capability:extended-thinking`, `model:claude-opus-4-7`\n- \"PreToolUse hook\" → `hook-surface:pre-tool-use`\n- \"MCP stdio transport\" → `mcp-transport:stdio`\n\nUpdates to the spot-check list track schema bumps; they are themselves a Level-2 deliverable.\n\n## Runtime budget\n\n- Schema validation: <30 s.\n- Generator suite (11 generators on full graph): <2 min.\n- Integration tests: <5 min.\n- E2E: <15 min (nightly).\n\nExceeding budget is a P1 gap.\n",
    "documents": []
  },
  "outgoingEdges": [],
  "incomingEdges": []
}

QA Architecture json

Inspect the normalized record payload exactly as the atlas UI reads it.

File · wiki/qa/00-qa-architecture.mdCluster · wiki

Record JSON

{
  "id": "page:qa-00-qa-architecture",
  "_kind": "Page",
  "_file": "wiki/qa/00-qa-architecture.md",
  "_cluster": "wiki",
  "attributes": {
    "nodeKind": "Page",
    "title": "QA Architecture",
    "displayName": "QA Architecture",
    "slug": "qa/00-qa-architecture",
    "articlePath": "wiki/qa/00-qa-architecture.md",
    "article": "# QA Architecture\n\n> Phase-5 deliverable. The test pyramid, evidence-boundary checks, and regression detection regime that gate every change to the v6 ecosystem.\n\n## Test pyramid\n\n```\n            ┌──────────────────────────┐\n            │     E2E (handful)        │   schema → regenerate → SDK consumer compiles\n            ├──────────────────────────┤\n            │  Integration (per-gen)   │   generator vs known-graph fixture\n            ├──────────────────────────┤\n            │  Generator-output checks │   manifest correctness, idempotency, hash\n            ├──────────────────────────┤\n            │  Schema validation       │   ontology-schema.yaml, invariants.yaml, examples/\n            └──────────────────────────┘\n```\n\n### Layer 1 — Schema validation (largest)\n\nRuns on every PR. Implements `graph/schema/validation-rules.md`:\n\n- **Structural**: every example YAML parses against the relevant NodeKind in `ontology-schema.yaml`. All `ref<>` fields resolve. No orphans outside the explicitly-orphan-allowed kinds.\n- **Invariants**: `invariants.yaml` rules execute against the loaded graph. V-1.x..V-12.x as enumerated in the validation-rules spec.\n- **Markdown ↔ YAML parity** (V-12.5): every NodeKind file under `schema/node-kinds/` describes the same attributes (with same types and required-flags) as the YAML.\n- **Evidence completeness** (V-2.1): every fact-bearing claim has at least one `EvidenceSource`.\n\n### Layer 2 — Generator output checks\n\nFor every generator:\n\n- **Determinism**: run twice; outputs byte-identical.\n- **Manifest correctness**: declared `consumes` matches the nodes/edges actually traversed.\n- **Idempotency**: third run on disk produces zero diff.\n- **No hand-edits**: outputs under `wiki/generated/`, `wiki/generators/templates/<gen>/__fixtures__/expected/`, etc., are byte-equal to the regenerated form. CI fails any PR where a generator output diverges from a fresh regeneration on the same input graph.\n\n### Layer 3 — Integration tests\n\nFor each of the 11 generators (10 + Gap sync), a test loads a frozen graph fixture from `wiki/qa/tests/fixtures/<gen>/in/`, runs the generator, and snapshot-compares against `wiki/qa/tests/fixtures/<gen>/out/`. Fixtures change only when the generator's contract changes — and then in the same PR.\n\n### Layer 4 — E2E\n\nA small number of end-to-end scenarios:\n\n1. **Schema bump propagates**: add an attribute to a NodeKind in `ontology-schema.yaml`; run all generators; assert the wiki, SDK types, and OpenAPI artifact reflect it.\n2. **SDK consumer compiles**: a fixture project under `wiki/qa/tests/sdk-consumer/` imports the regenerated SDK and `tsc --noEmit` succeeds.\n3. **Gap close cascade**: open a fake `Gap` node, fire the propagation generator, observe `propagationStatus` flip across all seven levels.\n\nE2E runs nightly and on release tags, not per-PR.\n\n## Evidence-boundary checks\n\nThe graph claims facts about the outside world. The boundary between \"graph claim\" and \"external source\" is where staleness creeps in. Phase 5 enforces:\n\n- **Every claim has live evidence**. CI fails if any `Claim` has no `EvidenceSource` in its `evidenceSourceIds` (V-2.1) or if every linked source is past its `freshnessWindowDays` (V-2.6).\n- **Reachability sweeps** (weekly job). Every `EvidenceSource{kindLabel: web}` has its `reachabilityCheck` re-run. `status != ok` for >7 days opens a Level-1 Gap automatically.\n- **Trust-floor enforcement** (V-2.4). Safety-critical attributes (capability binding, install method, sandbox profile) MUST NOT be backed solely by `synthetic` evidence. CI rejects.\n- **Attestation freshness**. `Attestation` records have explicit `expiresAt`. Past-expiry attestations move the claim back to \"unattested\" and fail any policy that requires attestation.\n\nSee [`04-evidence-boundaries.md`](./04-evidence-boundaries.md) for boundary-specific tooling.\n\n## Regression detection\n\nThree kinds of regression are watched:\n\n1. **Generator output regression**: a PR that touches a generator template must include the regenerated output. CI runs the generator and fails if the diff is missing or different from the PR's diff.\n2. **Schema regression**: any breaking change to `ontology-schema.yaml` (removed attribute, removed NodeKind, changed enum domain) must be accompanied by a major `CatalogVersion` bump and a `MigrationSpec`. Detected by comparing the PR's schema against `main`'s.\n3. **Coverage regression**: see [`03-coverage.md`](./03-coverage.md). A drop in NodeKind / Edge / Capability / Evidence-freshness / Generator coverage fails the build.\n\n## Spot-check queries (search smoke tests)\n\nTo verify the wiki search index, the test suite runs a fixed set of natural-language queries against the rendered site and asserts the expected node ids appear in the top-N results. Examples:\n\n- \"extended thinking\" → `capability:extended-thinking`, `model:claude-opus-4-7`\n- \"PreToolUse hook\" → `hook-surface:pre-tool-use`\n- \"MCP stdio transport\" → `mcp-transport:stdio`\n\nUpdates to the spot-check list track schema bumps; they are themselves a Level-2 deliverable.\n\n## Runtime budget\n\n- Schema validation: <30 s.\n- Generator suite (11 generators on full graph): <2 min.\n- Integration tests: <5 min.\n- E2E: <15 min (nightly).\n\nExceeding budget is a P1 gap.\n",
    "documents": []
  },
  "outgoingEdges": [],
  "incomingEdges": []
}