Back to jobsJob overview

About the role

Sr. ML Performance Engineer, AWS Neuron, Annapurna Labs at Amazon Development Centre Canada ULC

Required Skills

machine learninghigh-performance computingdistributed systemscompiler optimizationperformance profilingpytorchtensorflowjaxaws neuron sdk

About the Role

Senior ML Performance Engineer role at AWS Neuron/Annapurna Labs focused on optimizing machine learning workloads on AWS custom ML accelerators (Inferentia/Trainium). The position involves analyzing system-level performance across the entire technology stack, implementing compiler optimizations, and working directly with customers to enable and optimize their ML models.

Key Responsibilities

  • Analyze and optimize system-level performance of machine learning models across frameworks to runtime
  • Conduct detailed performance analysis and profiling of ML workloads, identifying bottlenecks in large-scale ML systems
  • Work directly with customers to enable and optimize their ML models on AWS accelerators
  • Design and implement compiler optimizations, transforming manual improvements into automated compiler passes
  • Collaborate across teams to develop innovative optimization techniques for AWS Neuron SDK

Required Skills & Qualifications

Must Have:

  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience as a mentor, tech lead or leading an engineering team

Nice to Have:

  • 5+ years of full software development life cycle experience
  • Bachelor's degree in computer science or equivalent

Benefits & Perks

  • Inclusive team culture with diversity and inclusion initiatives
  • Work/life balance with flexible working hours
  • Mentorship and career growth opportunities
  • Broad mix of experience levels and knowledge sharing environment