New
Senior Product Manager
![]() | |
![]() United States, Texas, Irving | |
![]() 7000 State Highway 161 (Show on map) | |
![]() | |
OverviewThe Azure High Performance Computing & Artificial Intelligence (HPC & AI) compute platform team defines and delivers the hardware roadmap, software, and services that enable our users to run technical computing workloads in Azure - extreme scale AI training and inference including large language models, traditional HPC modelling and simulations, remote visualization, and immersive gaming experiences. We are responsible for providing Azure customers with supercomputer-class capabilities to accelerate their research, drive competitive differentiation, and finding answers to some of the most difficult questions of science and industry. We are looking for a Senior Product Manager who has a proven track record working with complex and distributed engineering systems that are used in operating large-scale cloud solutions. This role requires a candidate with an ability to deliver technical solutions, model Microsoft values and support cross-organization programs to make them successful.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesFleet Health Optimization: Analyze telemetry and operational data to identify regressions, inefficiencies, and trends. Develop and execute end-to-end plans to maintain optimal fleet health and reduce operational overhead.Cross-Team Collaboration: Partner with hardware, software, and Azure engineering teams to align priorities, unblock dependencies, and drive impactful solutions.Workflow Automation: Identify manual processes and lead initiatives to automate them, improving reliability and scalability of operations.Telemetry: Leverage existing telemetry, identify gaps and evolve platform insights as needed to support business goals.Program Execution: Manage timelines, track deliverables, and communicate status updates to stakeholders across engineering, capacity planning, and finance teams.AI Agent Enablement: Define use cases, data requirements, and integration strategies for AI agents that support automated diagnostics and decision-making workflows. Ensure performance and efficacy of AI agents.Stakeholder Engagement: Facilitate communication across diverse teams to ensure alignment and smooth execution of complex programs. |