II.
EvalResult overview
Reference · liveeval-result:mbpp.qwen-2-5-coder-32b.001
eval-result:mbpp.qwen-2-5-coder-32b.001 overview
Inspect the raw attributes, linked wiki pages, and inbound or outbound graph edges for eval-result:mbpp.qwen-2-5-coder-32b.001.
Attributes
evalRunId
metricName
pass@1
score
0.902
unit
fraction
passFail
pass
reportedAt
2024-11-12T00:00:00Z
Outgoing edges
belongs_to_eval_run1
scored_against1
- benchmark:mbpp·BenchmarkMBPP
Incoming edges
None.