II.
EvalRun overview
Reference · liveeval-run:swe-bench.llama-3-1-405b.2024-07
eval-run:swe-bench.llama-3-1-405b.2024-07 overview
Inspect the raw attributes, linked wiki pages, and inbound or outbound graph edges for eval-run:swe-bench.llama-3-1-405b.2024-07.
Attributes
target
benchmarkId
testSetId
targetId
runAt
2024-07-23T00:00:00Z
runBy
artificial-analysis
configHash
Outgoing edges
evaluates_target1
- model:llama-3-1-405b-instruct@current·ModelVersionLlama 3.1 405B Instruct
for_benchmark1
- benchmark:swe-bench-verified·BenchmarkSWE-bench Verified
uses_test_set1
- test-set:swe-bench-verified-2024-12·TestSetSWE-bench Verified 2024-12
Incoming edges
belongs_to_eval_run1
- eval-result:swe-bench.llama-3-1-405b.001·EvalResult