Jobs
Jobs
All JobsApply Now
San Francisco, CA$220K - $300KFull-timeOn-site
InfrastructureKubernetesMLOpsPython
Apply NowPosted April 5, 2026
About the role
Lead the ML Platform team at Scale AI, building the infrastructure that powers data labeling and model evaluation at unprecedented scale. You'll own the end-to-end platform that thousands of ML engineers depend on daily.
Responsibilities
- Lead a team of 6-8 engineers building ML platform infrastructure
- Design and operate Kubernetes-based ML training and serving systems
- Build developer tools and abstractions that accelerate ML workflows
- Partner with product teams to translate requirements into scalable architecture
- Drive reliability, observability, and cost-efficiency of ML infrastructure
Requirements
- 7+ years of software engineering, with 3+ years leading platform/infra teams
- Deep Kubernetes expertise — you've operated large clusters in production
- Experience building ML training or serving infrastructure
- Strong communication skills — you can align engineering and product stakeholders
- BS/MS in Computer Science or equivalent experience
Interested in this role?
Apply directly on Scale AI's website.
Hiring for AI agent roles? Post a job and reach the engineers building the agent ecosystem.