Back to jobsJob overview

About the role

Principal Software Engineer at Microsoft

Required Skills

pythonc++javaazurekubernetesmicroservicescontainersai/mldistributed systems

About the Role

Principal Software Engineer role focused on improving Azure cloud reliability through SRE practices. Responsibilities include defining SLOs, enhancing observability, building automation, and leveraging AI for incident prediction. Requires deep technical expertise in distributed systems and cloud architecture.

Key Responsibilities

  • Defining system reliability goals through Service Level Objectives (SLOs)
  • Enhancing production posture with improvements in observability and operability
  • Building reusable automation and processes for reliability goals
  • Influencing product architecture and roadmaps for reliability design
  • Leveraging AI to detect anomalies and automate operational workflows

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related field AND 6+ years technical engineering experience with coding (C, C++, C#, Java, JavaScript, Python) OR equivalent experience
  • 5+ years experience in building, shipping and operating reliable solutions
  • 5+ years experience with modern distributed software design patterns and cloud systems architecture (microservices, containers, load balancing, queuing, caching)
  • Ability to pass Microsoft Cloud Background Check

Nice to Have:

  • Master's Degree in Computer Science or related field AND 8+ years experience OR Bachelor's Degree AND 12+ years experience
  • Experience as a technical lead or engineering manager
  • Experience working on large and unfamiliar codebases (millions of lines of code)
  • Experience with open-source projects, Kubernetes, Linux and containers
  • Experience with data technologies (SQL/NoSQL)
  • Experience with Azure
  • Experience in AI adoption with tools like GitHub Copilot, Azure OpenAI

Benefits & Perks

  • Industry leading healthcare