JobsSr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs
Amazon logo

Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

Amazon

Location

Cupertino, CA

Type

Full-time

Posted

5/5/2026

Compensation

$193,300 - $261,500 per year

Undergraduate with 5+ Years of Experience
Approval 98.6%·Filings 19,451·New hires 10,113·
👑 Elite Sponsor
·FY 2025

Job description

The role involves working as a kernel engineer on the Annapurna Labs team at AWS, focusing on optimizing machine learning workloads for custom ML accelerators. Engineers will work at the intersection of software, hardware, and machine learning systems, leveraging deep hardware knowledge and ML expertise. The team is dedicated to maximizing performance and enabling customers' models on AWS accelerators. This position offers a unique opportunity to contribute to cutting-edge AI acceleration technology.

Requirements

  • 5+ years of non-internship professional software development experience
  • 5+ years of programming experience with at least one software programming language
  • 5+ years of experience leading design or architecture of new and existing systems
  • 5+ years of experience in the full software development life cycle
  • Experience as a mentor or tech lead
  • Bachelor's degree in computer science or equivalent
  • 6+ years of full software development experience
  • Expertise in accelerator architectures for ML or HPC such as GPUs, CPUs, FPGAs, or custom architectures
  • Experience with GPU kernel optimization and GPGPU computing
  • Demonstrated experience with NVIDIA PTX and/or AMD GPU ISA
  • Experience developing high performance libraries for HPC applications
  • Proficiency in low-level performance optimization for GPUs
  • Experience with LLVM/MLIR backend development for GPUs
  • Knowledge of ML frameworks and their GPU backends
  • Experience with parallel programming and optimization techniques
  • Understanding of GPU memory hierarchies and optimization strategies

Responsibilities

  • Design and implement high-performance compute kernels for ML operations
  • Analyze and optimize kernel-level performance across multiple generations of Neuron hardware
  • Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks
  • Implement compiler optimizations such as fusion, sharding, tiling, and scheduling
  • Work directly with customers to enable and optimize their ML models on AWS accelerators
  • Collaborate across teams to develop innovative kernel optimization techniques

Benefits

  • Employees at Amazon are often offered comprehensive health benefits—including multiple medical plan options (no pre-existing condition exclusions, 100% covered in-network preventive care), dental and vision plans, a 24/7 medical advice line from day one, expert second-opinion services, and broad mental-health support with several free counseling sessions (including pediatric). Financial wellness typically includes a 401(k) with company match (up to 2%), Restricted Stock Units (equity), FSAs, an emergency savings program, product and partner discounts, and even college-savings and home-purchase programs. Overall, the package is designed to support employees and their families’ health, finances, and day-to-day life.

Is this posting expired or inaccurate?