II.
EvalResult overview
Reference · liveeval-result:mmlu.phi-3-medium.001
eval-result:mmlu.phi-3-medium.001 overview
Inspect the raw attributes, linked wiki pages, and inbound or outbound graph edges for eval-result:mmlu.phi-3-medium.001.
Attributes
evalRunId
metricName
accuracy
score
0.766
unit
fraction
passFail
pass
reportedAt
2024-05-21T00:00:00Z
Outgoing edges
belongs_to_eval_run1
scored_against1
- benchmark:mmlu·BenchmarkMMLU
Incoming edges
None.