Director, Product Management - Inference as a Service (IaaS)
About the Role
You will lead the product vision and roadmap for an Inference as a Service platform. You will gather and prioritize customer requirements from AI developers and enterprise users, collaborate with engineering, data center operations, and energy teams to meet GPU availability and latency targets, and work with sales to support customer onboarding and SLA alignment. You will define product KPIs, collect and analyze usage data, iterate on the platform based on performance insights, and ensure compliance, cost-efficiency, and environmental goals are met.
Requirements
- 8–12+ years of product management experience, ideally in data infrastructure, cloud services, or AI/ML domains
- Strong technical understanding of ML inference workloads and GPU or accelerator environments
- Proven experience launching infrastructure products or platforms used by technical teams
- Excellent communication and stakeholder management skills
- Experience with Kubernetes, Triton, or model-serving platforms is a plus
- Preferred experience in high-performance computing, Bitcoin mining operations, immersion cooling or liquid-cooled infrastructure, SCADA or PLC systems, and energy infrastructure is beneficial
Responsibilities
- Define and drive the product vision and roadmap for the IaaS inference offering
- Gather and prioritize customer requirements from AI developers, enterprise users, and internal stakeholders
- Collaborate with engineering, data center operations, and energy teams to meet GPU availability and latency targets
- Support customer onboarding and ensure alignment with SLAs alongside sales and business development
- Identify market trends and emerging AI inference patterns and integrate them into product strategy
- Define product KPIs, collect usage data, and iterate quickly based on performance insights
- Ensure the platform meets compliance, cost-efficiency, and environmental objectives
