JobsSoftware Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference
Software Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference
AmazonSoftware Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference
AmazonLocation
Cupertino, CA
Type
Full-time
Posted
5/4/2026
Compensation
$165,200 - $223,600 per year
Undergraduate with 2+ Years of Experience
Approval 98.6%·Filings 19,451·New hires 10,113·
👑 Elite Sponsor
·FY 2025Job description
This role is focused on building distributed inference support for PyTorch within the Neuron SDK at Amazon Web Services. The Inference Enablement and Acceleration team is dedicated to optimizing machine learning models for AWS's custom ML accelerators, Trainium and Inferentia. Engineers in this team work closely with customers to ensure optimal performance of their machine learning workloads. The position offers a unique opportunity to work at the intersection of machine learning, high-performance computing, and distributed architectures.
Requirements
- Bachelor's degree in computer science or equivalent.
- 5+ years of non-internship professional software development experience.
- 5+ years of non-internship design or architecture experience.
- Fundamentals of machine learning and LLMs, their architecture, training, and inference lifecycles.
- Software development experience in C++ and Python.
- Strong understanding of system performance, memory management, and parallel computing principles.
- Familiarity with PyTorch, JIT compilation, and AOT tracing.
- Experience with online/offline inference serving in production environments.
Responsibilities
- Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators.
- Participate in all stages of the ML system development lifecycle including architecture design, implementation, and performance profiling.
- Build infrastructure to systematically analyze and onboard multiple models with diverse architecture.
- Analyze and optimize system-level performance across multiple generations of Neuron hardware.
- Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks.
- Work directly with customers to enable and optimize their ML models on AWS accelerators.
- Collaborate across teams to develop innovative optimization techniques.
Benefits
- Employees at Amazon are often offered comprehensive health benefits—including multiple medical plan options (no pre-existing condition exclusions, 100% covered in-network preventive care), dental and vision plans, a 24/7 medical advice line from day one, expert second-opinion services, and broad mental-health support with several free counseling sessions (including pediatric). Financial wellness typically includes a 401(k) with company match (up to 2%), Restricted Stock Units (equity), FSAs, an emergency savings program, product and partner discounts, and even college-savings and home-purchase programs. Overall, the package is designed to support employees and their families’ health, finances, and day-to-day life.
Is this posting expired or inaccurate?
