Agentic AI Atlas

II.

MetaCluster overview

meta-cluster:evaluation

Reference · live

Evaluation (meta) overview

Inspect the raw attributes, linked wiki pages, and inbound or outbound graph edges for meta-cluster:evaluation.

MetaClusterOutgoing · 7Incoming · 7

Attributes

displayName

Evaluation (meta)

clusterNumber

scope

Conceptual aggregation of NodeKinds covering benchmark / evaluation machinery: Benchmark and TestSet (the test scaffold), EvalRun (one execution against a target), EvalHarness (the runnable harness side), Judge (LLM / human / programmatic grader), Rubric (scoring criteria), and SkillArea (the named expertise area benchmarks are commonly bound against). Members live mainly in editorial cluster 11-benchmarks, with SkillArea in 9-domain; each MetaNodeKind records the truthful editorial slug.

parentClusterId

null

Outgoing edges

contains_meta_node_kind7

meta-node-kind:benchmark·MetaNodeKindBenchmark (meta)
meta-node-kind:test-set·MetaNodeKindTestSet (meta)
meta-node-kind:eval-run·MetaNodeKindEvalRun (meta)
meta-node-kind:eval-harness·MetaNodeKindEvalHarness (meta)
meta-node-kind:judge·MetaNodeKindJudge (meta)
meta-node-kind:rubric·MetaNodeKindRubric (meta)
meta-node-kind:skill-area·MetaNodeKindSkillArea (meta)

Incoming edges

in_cluster7

meta-node-kind:benchmark·MetaNodeKindBenchmark (meta)
meta-node-kind:test-set·MetaNodeKindTestSet (meta)
meta-node-kind:eval-run·MetaNodeKindEvalRun (meta)
meta-node-kind:eval-harness·MetaNodeKindEvalHarness (meta)
meta-node-kind:judge·MetaNodeKindJudge (meta)
meta-node-kind:rubric·MetaNodeKindRubric (meta)
meta-node-kind:skill-area·MetaNodeKindSkillArea (meta)