Atlas Graph Explorer
Wiki
Graph
Edges
Home
EvalRun
eval-run:bfcl.claude-sonnet-4-5.2025-09
eval-run:bfcl.claude-sonnet-4-5.2025-09
eval-run:bfcl.claude-sonnet-4-5.2025-09
EvalRun
benchmarks/eval-runs/eval-runs-anthropic.yaml
·
Open in Graph →
overview
json
graph
Attributes
target
model:claude-sonnet-4-5@current
benchmarkId
benchmark:berkeley-function-calling
testSetId
test-set:bfcl-v3
targetId
model:claude-sonnet-4-5@current
runAt
2025-09-29T00:00:00Z
runBy
berkeley-gorilla
configHash
sha256:placeholder-claude-sonnet-4-5-bfcl-v3
Outgoing edges
(3)
evaluates_target
1
model:claude-sonnet-4-5@current
·
ModelVersion
for_benchmark
1
benchmark:berkeley-function-calling
·
Benchmark
Berkeley Function Calling Leaderboard (BFCL)
uses_test_set
1
test-set:bfcl-v3
·
TestSet
Berkeley Function Calling Leaderboard v3
Incoming edges
(1)
belongs_to_eval_run
1
eval-result:bfcl.claude-sonnet-4-5.001
·
EvalResult