II.
EvalResult overview
Reference · liveeval-result:mmlu.mistral-large-2.001
eval-result:mmlu.mistral-large-2.001 overview
Inspect the raw attributes, linked wiki pages, and inbound or outbound graph edges for eval-result:mmlu.mistral-large-2.001.
Attributes
evalRunId
metricName
accuracy
score
0.84
unit
fraction
passFail
pass
reportedAt
2024-07-24T00:00:00Z
Outgoing edges
belongs_to_eval_run1
scored_against1
- benchmark:mmlu·BenchmarkMMLU
Incoming edges
None.