Atlas Graph Explorer
Wiki
Graph
Edges
Home
SkillArea
skill-area:eval-driven-development
Eval-Driven LLM Development
skill-area:eval-driven-development
SkillArea
domain/skill-areas/skill-areas-ai-ml.yaml
·
Open in Graph →
overview
json
graph
Attributes
displayName
Eval-Driven LLM Development
description
Defining evals before features — golden sets, rubric scoring, LLM-as-judge with calibration, and regression gates.
domains
specialization:ai-agents-conversational
expertiseLevels
intermediate
expert
Outgoing edges
(1)
applies_to
1
specialization:ai-agents-conversational
·
Specialization
AI Agents & Conversational AI
Incoming edges
(46)
addresses
2
skill:babysitter-retrospect
·
Skill
babysitter:retrospect
skill:babysitter-accomplish-status
·
Skill
babysitter:accomplish-status
requires_expertise
8
responsibility:ai-agent-usage-review
·
Responsibility
AI Agent Usage Review
responsibility:ai-tooling-evaluation
·
Responsibility
AI Tooling Evaluation
role:ai-champion
·
Role
AI Champion
role:data-scientist
·
Role
Data Scientist
role:ml-engineer
·
Role
ML Engineer
role:planner
·
Role
Planner
role:ml-engineer-convergent
·
Role
ML Engineer
role:product-owner
·
Role
Product Owner
requires_skill_area
36
skill-area:hallucination-mitigation-fact-checking
·
SkillArea
Hallucination Mitigation and Fact Checking
skill-area:agent-simulation-testing
·
SkillArea
Agent Simulation and Testing
workflow:rag-pipeline-evaluation
·
Workflow
RAG Pipeline Evaluation
workflow:ai-content-moderation-review
·
Workflow
AI Content Moderation Review
workflow:ai-agent-adoption-rollout
·
Workflow
AI Agent Adoption Rollout
workflow:ai-usage-review
·
Workflow
AI Agent Usage Review
workflow:ai-knowledge-sharing
·
Workflow
AI Knowledge Sharing
workflow:ai-pair-programming-governance
·
Workflow
AI Pair-Programming Governance
workflow:ai-model-license-compliance
·
Workflow
AI Model License Compliance
workflow:algo-strategy-backtesting
·
Workflow
Algorithmic Strategy Backtesting
workflow:adas-validation-cycle
·
Workflow
ADAS Validation Cycle
workflow:process-simulation-review
·
Workflow
Process Simulation Review
workflow:model-fairness-audit
·
Workflow
Model Fairness Audit
workflow:ml-model-versioning-governance
·
Workflow
ML Model Versioning Governance
workflow:adaptive-learning-model-review
·
Workflow
Adaptive Learning Model Review
workflow:landing-page-optimization-cycle
·
Workflow
Landing Page Optimization Cycle
workflow:growth-experiment-review
·
Workflow
Growth Experiment Review
workflow:growth-experimentation-platform-setup
·
Workflow
Growth Experimentation Platform Setup
workflow:quality-control-audit
·
Workflow
Quality Control Audit
workflow:underwriting-model-validation
·
Workflow
Underwriting Model Validation
workflow:contract-automation-review
·
Workflow
Contract Automation Review
workflow:hypothesis-driven-experiment
·
Workflow
Hypothesis-Driven Experiment
workflow:prompt-regression-testing
·
Workflow
Prompt Regression Testing
workflow:llm-eval-pipeline
·
Workflow
LLM Evaluation Pipeline
workflow:model-card-maintenance
·
Workflow
Model Card Maintenance
workflow:impact-measurement-review
·
Workflow
Impact Measurement Review
workflow:computational-experiment-validation
·
Workflow
Computational Experiment Validation
workflow:competitive-landscape-analysis
·
Workflow
Competitive Landscape Analysis
workflow:quant-model-peer-review
·
Workflow
Quant Model Peer Review
workflow:quantum-algorithm-benchmarking
·
Workflow
Quantum Algorithm Benchmarking
workflow:error-correction-validation
·
Workflow
Error Correction Validation
workflow:revenue-forecasting-model-calibration
·
Workflow
Revenue Forecasting Model Calibration
workflow:support-chatbot-performance-review
·
Workflow
Support Chatbot Performance Review
workflow:ai-agent-adoption-rollout
·
Workflow
AI Agent Adoption Rollout
workflow:ai-usage-review
·
Workflow
AI Agent Usage Review
workflow:ai-knowledge-sharing
·
Workflow
AI Knowledge Sharing