Atlas Graph Explorer

scope-boundary:tau-bench.scope

ScopeBoundarysourceref-scope/scope-boundaries/tau-bench.yaml·Open in Graph →

Attributes

subjectId

inScope

Conversational tool-use benchmark — agent must complete user-facing tasks (airline / retail) over multi-turn dialogue while invoking domain tools and respecting policy.

outOfScope

Single-turn evaluations, code-generation tasks, and benchmarks without a tool-use harness.

outOfScopeReasonIds

out-of-scope-reason:future-phase
out-of-scope-reason:implementation-detail

Outgoing edges (1)

bounds_subject1

benchmark:tau-bench·Benchmarktau-bench

Incoming edges (0)

None.