II.
SkillArea overview
Reference · liveskill-area:model-evaluation
Model Evaluation & Selection overview
Evaluating ML model performance — cross-validation strategies, metric selection (precision/recall/F1, AUC-ROC, RMSE), statistical significance testing, hyperparameter optimization, and model comparison frameworks.
Attributes
displayName
Model Evaluation & Selection
description
Evaluating ML model performance — cross-validation strategies,
metric selection (precision/recall/F1, AUC-ROC, RMSE), statistical
significance testing, hyperparameter optimization, and model
comparison frameworks.
domains
expertiseLevels
- intermediate
- expert
Outgoing edges
applies_to1
- specialization:data-science-ml·Specialization
prerequisite_for_learning1
- skill-area:model-serving·SkillAreaModel Serving
Incoming edges
lib_requires_skill_area5
- lib-agent:ai-agents-conversational--fine-tuning-specialist·LibraryAgentfine-tuning-specialist
- lib-agent:ai-agents-conversational--retrieval-optimizer·LibraryAgentretrieval-optimizer
- lib-agent:data-science-ml--model-evaluator·LibraryAgentmodel-evaluator
- lib-skill:ai-agents-conversational--rag-reranking·LibrarySkillrag-reranking
- lib-skill:data-science-ml--sklearn-model-trainer·LibrarySkillsklearn-model-trainer
prerequisite_for_learning2
- skill-area:machine-learning·SkillAreaMachine Learning
- skill-area:feature-engineering·SkillAreaFeature Engineering
requires_expertise3
- responsibility:model-training-quality·ResponsibilityModel training quality
- responsibility:model-drift-monitoring·ResponsibilityModel drift monitoring
- responsibility:model-quality-assurance·ResponsibilityModel quality assurance
requires_skill_area2
- stack-profile:legal-document-automation·StackProfileLegal Document Automation Stack (Python, NLP, Elasticsearch, FastAPI, React, S3)
- stack-profile:synthetic-data-generation·StackProfileSynthetic Data Generation Stack (Python, PyTorch, FastAPI, PostgreSQL, S3)