Back to jobsJob overview

About the role

Software Engineer II at Microsoft

Required Skills

c++pythonai infrastructurevirtualizationdistributed systemshigh performance computingperformance analysisgpuoperating systems

About the Role

Software Engineer II role in Azure HPC/AI team, focusing on designing and delivering next-gen AI training platforms. Responsibilities include hardware/software interactions, device virtualization, and performance analysis of GPU workloads in VMs. This position impacts Microsoft's AI infrastructure and initiatives.

Key Responsibilities

  • Analyzes functionality, integration, and performance issues at hardware/software stack levels
  • Designs and codes solutions to improve correctness, stability, and performance of AI training VMs
  • Optimizes, debugs, refactors, and reuses code for performance and maintainability
  • Acts as Designated Responsible Individual (DRI) and collaborates on-call for system monitoring
  • Develops playbooks and maintains communication with partners across Microsoft ecosystem

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related technical field OR equivalent experience
  • Technical engineering experience with coding in languages like C, C++, C#, Java, JavaScript, or Python
  • Experience in HPC or Machine Learning
  • Ability to pass Microsoft Cloud Background Check security screening

Nice to Have:

  • Master's Degree in Computer Science or related technical field OR equivalent experience
  • Familiarity with Machine Learning, AI Infrastructure, Operating Systems fundamentals, virtualization, Accelerators
  • Experience on Distributed Systems, HPC/ML middleware, Co-Designing Hardware-Software, Profiling and Performance Analysis Tools

Benefits & Perks

  • Industry leading healthcare