II.
Page JSON
Structured · livepage:qa-00-qa-architecture
QA Architecture json
Inspect the normalized record payload exactly as the atlas UI reads it.
{
"id": "page:qa-00-qa-architecture",
"_kind": "Page",
"_file": "wiki/qa/00-qa-architecture.md",
"_cluster": "wiki",
"attributes": {
"nodeKind": "Page",
"title": "QA Architecture",
"displayName": "QA Architecture",
"slug": "qa/00-qa-architecture",
"articlePath": "wiki/qa/00-qa-architecture.md",
"article": "# QA Architecture\n\n> Phase-5 deliverable. The test pyramid, evidence-boundary checks, and regression detection regime that gate every change to the v6 ecosystem.\n\n## Test pyramid\n\n```\n ┌──────────────────────────┐\n │ E2E (handful) │ schema → regenerate → SDK consumer compiles\n ├──────────────────────────┤\n │ Integration (per-gen) │ generator vs known-graph fixture\n ├──────────────────────────┤\n │ Generator-output checks │ manifest correctness, idempotency, hash\n ├──────────────────────────┤\n │ Schema validation │ ontology-schema.yaml, invariants.yaml, examples/\n └──────────────────────────┘\n```\n\n### Layer 1 — Schema validation (largest)\n\nRuns on every PR. Implements `graph/schema/validation-rules.md`:\n\n- **Structural**: every example YAML parses against the relevant NodeKind in `ontology-schema.yaml`. All `ref<>` fields resolve. No orphans outside the explicitly-orphan-allowed kinds.\n- **Invariants**: `invariants.yaml` rules execute against the loaded graph. V-1.x..V-12.x as enumerated in the validation-rules spec.\n- **Markdown ↔ YAML parity** (V-12.5): every NodeKind file under `schema/node-kinds/` describes the same attributes (with same types and required-flags) as the YAML.\n- **Evidence completeness** (V-2.1): every fact-bearing claim has at least one `EvidenceSource`.\n\n### Layer 2 — Generator output checks\n\nFor every generator:\n\n- **Determinism**: run twice; outputs byte-identical.\n- **Manifest correctness**: declared `consumes` matches the nodes/edges actually traversed.\n- **Idempotency**: third run on disk produces zero diff.\n- **No hand-edits**: outputs under `wiki/generated/`, `wiki/generators/templates/<gen>/__fixtures__/expected/`, etc., are byte-equal to the regenerated form. CI fails any PR where a generator output diverges from a fresh regeneration on the same input graph.\n\n### Layer 3 — Integration tests\n\nFor each of the 11 generators (10 + Gap sync), a test loads a frozen graph fixture from `wiki/qa/tests/fixtures/<gen>/in/`, runs the generator, and snapshot-compares against `wiki/qa/tests/fixtures/<gen>/out/`. Fixtures change only when the generator's contract changes — and then in the same PR.\n\n### Layer 4 — E2E\n\nA small number of end-to-end scenarios:\n\n1. **Schema bump propagates**: add an attribute to a NodeKind in `ontology-schema.yaml`; run all generators; assert the wiki, SDK types, and OpenAPI artifact reflect it.\n2. **SDK consumer compiles**: a fixture project under `wiki/qa/tests/sdk-consumer/` imports the regenerated SDK and `tsc --noEmit` succeeds.\n3. **Gap close cascade**: open a fake `Gap` node, fire the propagation generator, observe `propagationStatus` flip across all seven levels.\n\nE2E runs nightly and on release tags, not per-PR.\n\n## Evidence-boundary checks\n\nThe graph claims facts about the outside world. The boundary between \"graph claim\" and \"external source\" is where staleness creeps in. Phase 5 enforces:\n\n- **Every claim has live evidence**. CI fails if any `Claim` has no `EvidenceSource` in its `evidenceSourceIds` (V-2.1) or if every linked source is past its `freshnessWindowDays` (V-2.6).\n- **Reachability sweeps** (weekly job). Every `EvidenceSource{kindLabel: web}` has its `reachabilityCheck` re-run. `status != ok` for >7 days opens a Level-1 Gap automatically.\n- **Trust-floor enforcement** (V-2.4). Safety-critical attributes (capability binding, install method, sandbox profile) MUST NOT be backed solely by `synthetic` evidence. CI rejects.\n- **Attestation freshness**. `Attestation` records have explicit `expiresAt`. Past-expiry attestations move the claim back to \"unattested\" and fail any policy that requires attestation.\n\nSee [`04-evidence-boundaries.md`](./04-evidence-boundaries.md) for boundary-specific tooling.\n\n## Regression detection\n\nThree kinds of regression are watched:\n\n1. **Generator output regression**: a PR that touches a generator template must include the regenerated output. CI runs the generator and fails if the diff is missing or different from the PR's diff.\n2. **Schema regression**: any breaking change to `ontology-schema.yaml` (removed attribute, removed NodeKind, changed enum domain) must be accompanied by a major `CatalogVersion` bump and a `MigrationSpec`. Detected by comparing the PR's schema against `main`'s.\n3. **Coverage regression**: see [`03-coverage.md`](./03-coverage.md). A drop in NodeKind / Edge / Capability / Evidence-freshness / Generator coverage fails the build.\n\n## Spot-check queries (search smoke tests)\n\nTo verify the wiki search index, the test suite runs a fixed set of natural-language queries against the rendered site and asserts the expected node ids appear in the top-N results. Examples:\n\n- \"extended thinking\" → `capability:extended-thinking`, `model:claude-opus-4-7`\n- \"PreToolUse hook\" → `hook-surface:pre-tool-use`\n- \"MCP stdio transport\" → `mcp-transport:stdio`\n\nUpdates to the spot-check list track schema bumps; they are themselves a Level-2 deliverable.\n\n## Runtime budget\n\n- Schema validation: <30 s.\n- Generator suite (11 generators on full graph): <2 min.\n- Integration tests: <5 min.\n- E2E: <15 min (nightly).\n\nExceeding budget is a P1 gap.\n",
"documents": []
},
"outgoingEdges": [],
"incomingEdges": []
}