Back to jobsJob overview

About the role

Software Engineer II at Microsoft

Required Skills

distributed systemsc++pythonai/mlgenerative aiagile methodologiesobservabilitycloud computingazure

About the Role

Software Engineer II role on the Azure Compute Availability Platform team, focusing on ensuring high SLA for Azure VMs through AI-driven monitoring and repair services. Responsibilities include designing hyperscale services, integrating predictive AI models, and collaborating with data scientists to enhance platform resilience and automation.

Key Responsibilities

  • Partners with stakeholders to determine project requirements and leads design of change management features
  • Develops high-quality, maintainable code and supports livesite as Designated Responsible Individual (DRI)
  • Collaborates with data scientists to integrate predictive models for hardware anomaly detection and live migration
  • Leads initiatives to embed AI-driven diagnostics and root cause analysis into availability services
  • Evaluates and integrates large-scale AI models into control plane services for smarter repair decisions

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including C, Rust, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements, including Microsoft Cloud Background Check

Nice to Have:

  • Master's Degree in Computer Science or related technical field AND technical engineering experience
  • Ability and passion for designing and building highly available distributed systems at scale
  • Ability to exercise sound judgment in ambiguous situations
  • Experience with agile methodologies and willingness to adopt them

Benefits & Perks

  • Industry leading healthcare