Back to jobsJob overview

About the role

Senior Software Engineer at Microsoft

Required Skills

pythondistributed systemsflinkkafkakubernetesgpu inferencetritononnxopentelemetry

About the Role

Senior Software Engineer role focused on building real-time data pipelines and GPU inference serving systems for large-scale ML models. Responsibilities include designing ETL pipelines, optimizing performance, and collaborating with applied scientists. Requires extensive experience with distributed systems, streaming frameworks, and GPU inference technologies.

Key Responsibilities

  • Design & code real-time ETL/feature pipelines (e.g., Flink or Spark Structured Streaming)
  • Define and meet SLOs with OpenTelemetry/Prometheus/Grafana for metrics, tracing, and alerting
  • Implement robust queuing/streaming with Kafka/Pulsar
  • Optimize GPU inference services on Triton Inference Server (or ONNX Runtime/TensorRT)
  • Profile & optimize end-to-end: CPU/GPU utilization, memory, I/O, vectorization, caching

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 6+ years building distributed systems in production
  • Production experience with streaming frameworks (Flink or Spark) and messaging (Kafka)
  • Hands-on with Kubernetes and containers; comfort with service ops (logs, metrics, scaling)
  • Practical experience with GPU inference on Triton or ONNX Runtime/TensorRT (model packaging, runtime tuning, batching)

Nice to Have:

  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience
  • Experience with real-time feature stores or embedding pipelines
  • Prior contributions to GPU batching, dynamic scheduling, or multi-model serving

Benefits & Perks

  • Industry leading healthcare