II.
LibraryAgent overview
Reference · livelib-agent:ai-agents-conversational--agent-evaluator
agent-evaluator overview
Designs evaluation frameworks and benchmarks
Attributes
displayName
agent-evaluator
description
Designs evaluation frameworks and benchmarks
libraryPath
library/specializations/ai-agents-conversational/agents/agent-evaluator/AGENT.md
specialization
ai-agents-conversational
role
Safety and Evaluation Specialist
expertise
- Evaluation framework design
- Benchmark creation
- Metric selection
- Test automation
- Quality assurance
Outgoing edges
lib_applies_to_domain1
- domain:software-engineering·DomainSoftware Engineering
lib_belongs_to_specialization1
- specialization:ai-agents-conversational·Specialization
lib_implements_workflow2
- workflow:ml-model-lifecycle·WorkflowML Model Lifecycle
- workflow:feature-development·Workflow
lib_involves_role3
- role:ml-engineer·RoleMachine Learning Engineer
- role:backend-engineer·RoleBackend Engineer
- role:ai-champion·RoleAI Champion
lib_requires_skill_area2
- skill-area:eval-driven-development·SkillAreaEval-Driven LLM Development
- skill-area:agent-simulation-testing·SkillAreaAgent Simulation and Testing
Incoming edges
uses_agent3
- lib-process:ai-agents-conversational--agent-evaluation-framework·LibraryProcessagent-evaluation-framework
- lib-process:ai-agents-conversational--react-agent-implementation·LibraryProcessreact-agent-implementation
- lib-process:ai-agents-conversational--regression-testing-agent·LibraryProcessregression-testing-agent