Back to jobsJob overview

About the role

Software Engineering at Microsoft

Required Skills

pythonc#javaazureaisredistributed systemsautomation

About the Role

This role is for a Software Engineer in the Azure Reliability team, focusing on Site Reliability Engineering (SRE) to enhance Azure's availability, reliability, and operability. Responsibilities include improving system reliability through SLOs, building automation, influencing product architecture, and leveraging AI to scale reliability efforts. The position requires collaboration, problem-solving, and a customer-obsessed mindset.

Key Responsibilities

  • Improve reliability of key Azure products through SRE practices
  • Define system reliability goals via Service Level Objectives (SLOs)
  • Build reusable automation and processes for reliability
  • Influence product architecture and roadmaps for reliability
  • Leverage AI to detect anomalies and automate operational workflows

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related technical discipline with coding experience
  • Proven experience coding in languages like C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience

Nice to Have:

  • Familiarity with modern distributed software design patterns and cloud systems architecture
  • Proven track record in building, shipping, and operating reliable solutions
  • Proficiency in programming languages like C#/Java/Python
  • Experience with data technologies (SQL/NoSQL/etc.)
  • Experience with Azure
  • Experience in AI adoption with tools like GitHub Copilot, Azure OpenAI

Benefits & Perks

  • Industry leading healthcare