II.
Role overview
Reference · liverole:ml-infrastructure-engineer
ML Infrastructure Engineer overview
Builds the platform for training, serving, and monitoring ML models — GPU clusters, experiment tracking, model registries, and inference pipelines at scale.
Attributes
displayName
ML Infrastructure Engineer
isAgentic
false
requiredCapabilities
[]
requiredDomains
[]
description
Builds the platform for training, serving, and monitoring ML models —
GPU clusters, experiment tracking, model registries, and inference
pipelines at scale.
Outgoing edges
applies_to2
- domain:ml-ops·DomainMLOps
- domain:infrastructure·DomainInfrastructure
holds_responsibility3
- responsibility:platform-reliability·ResponsibilityPlatform reliability
- responsibility:model-quality-assurance·ResponsibilityModel quality assurance
- responsibility:capacity-planning·ResponsibilityCapacity Planning
requires_expertise3
- skill-area:containerization·SkillArea
- skill-area:terraform-infrastructure·SkillAreaTerraform Infrastructure as Code
- skill-area:machine-learning-frameworks·SkillAreaMachine Learning Frameworks
Incoming edges
None.