JobsSoftware Engineer II- AI/ML, AWS Neuron
Job description
The role involves working with the Annapurna Labs team at Amazon Web Services to develop and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators. The team focuses on building distributed training support for PyTorch and JAX within the Neuron SDK, ensuring high performance and efficiency on AWS Trainium silicon. This position offers a unique opportunity to collaborate across various technology layers and contribute to the future of AI acceleration technology. The environment encourages innovation, experimentation, and mentorship among team members.
Requirements
- 3+ years of non-internship professional software development experience or a Bachelor's degree in engineering or equivalent.
- 3+ years of experience in design or architecture of new and existing systems.
- Experience with Machine Learning and Large Language Model fundamentals, including architecture and optimization.
- Knowledge of system performance, memory management, and parallel computing principles.
- Experience in debugging, profiling, and implementing software engineering best practices in large-scale systems.
- Experience with PyTorch, JIT compilation, and AOT tracing.
- Experience with CUDA kernels or ML/low-level kernels.
- Experience with distributed training at scale.
Responsibilities
- Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators.
- Participate in all stages of the ML system development lifecycle including architecture design and performance profiling.
- Build infrastructure to systematically analyze and onboard multiple models with diverse architecture.
- Analyze and optimize system-level performance across multiple generations of Neuron hardware.
- Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks.
- Conduct comprehensive testing, including unit and end-to-end model testing.
- Work directly with customers to enable and optimize their ML models on AWS accelerators.
- Collaborate across teams to develop innovative optimization techniques.
Benefits
- Employees at Amazon are often offered comprehensive health benefits—including multiple medical plan options (no pre-existing condition exclusions, 100% covered in-network preventive care), dental and vision plans, a 24/7 medical advice line from day one, expert second-opinion services, and broad mental-health support with several free counseling sessions (including pediatric). Financial wellness typically includes a 401(k) with company match (up to 2%), Restricted Stock Units (equity), FSAs, an emergency savings program, product and partner discounts, and even college-savings and home-purchase programs. Overall, the package is designed to support employees and their families’ health, finances, and day-to-day life.
Is this posting expired or inaccurate?
