Back to jobsJob overview

About the role

Site Reliability Engineer II at Microsoft

Required Skills

pythonbashpowershellcloud infrastructureautomationsite reliability engineeringdistributed systemsaimonitoring

About the Role

The Site Reliability Engineer II ensures safe software deployments and operational excellence for Azure Cloud. They leverage automation, AI, and telemetry to maintain reliability at scale. Responsibilities include writing automation scripts, managing safe deployment processes, and responding to incidents.

Key Responsibilities

  • Write code or scripts to automate scalable operations processes
  • Create, test, and deploy changes through safe deployment processes
  • Use tools and AI to troubleshoot system availability and performance
  • Enable team velocity for reliable and safe production deployments
  • Respond to incidents during on-call rotations and develop monitoring alerts

Required Skills & Qualifications

Must Have:

  • 4+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree with 1+ year experience OR Master's Degree
  • 1+ years experience in Cloud Infrastructure and Data Center Expertise
  • 1+ years experience in Programming and Automation Skills with Python and Bash or PowerShell
  • Ability to pass Microsoft Cloud Background Check

Nice to Have:

  • 5+ years technical experience OR Bachelor's Degree with 2+ years experience OR Master's Degree with 1+ year experience
  • 1+ year people management experience

Benefits & Perks

  • Industry leading healthcare