JobsSenior AI Hardware Systems Engineer, Annapurna Labs, Trainium Machine Learning Fleet Operations
Senior AI Hardware Systems Engineer, Annapurna Labs, Trainium Machine Learning Fleet Operations
AmazonSenior AI Hardware Systems Engineer, Annapurna Labs, Trainium Machine Learning Fleet Operations
AmazonLocation
Austin, TX
Type
Full-time
Posted
5/12/2026
Compensation
$159,200 - $215,300 per year
Undergraduate with 5+ Years of Experience
Approval 98.6%·Filings 19,451·New hires 10,113·
👑 Elite Sponsor
·FY 2025Job description
The Platform Development Engineer role at Annapurna Labs focuses on maintaining and optimizing a fleet of advanced machine learning servers. The engineer will work closely with hardware and software teams to debug issues and enhance customer experience. This position requires a strong background in both software development and hardware troubleshooting. The team is dedicated to ensuring high-quality performance and rapid incident response for cutting-edge ML products.
Requirements
- 4+ years of programming experience with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, or Ruby.
- 3+ years of professional software development experience or a Bachelor's degree in engineering or equivalent.
- 3+ years of experience designing or architecting new and existing systems.
- Experience in computer architecture and general troubleshooting/debugging of hardware.
- 2+ years of server hardware troubleshooting and repair experience.
- Master's degree or above in electrical engineering, computer engineering, or equivalent.
- Experience with SOC bring-up and post-silicon validation.
Responsibilities
- Be a member of a team responsible for system remediation, operational excellence, and customer experience on ML products.
- Utilize data to root cause hardware failures and identify trends on complex systems.
- Implement and improve system level testing across the product lifecycle.
- Develop maintainable, documented, tested, and reusable software.
- Dive deep on issues at the intersection of hardware and software.
- Maximize the health, sellability, and customer experience of the ML server platform.
- Review dashboards to identify trends and triage emergent issues.
- Partner with hardware and software engineering teams to debug and investigate issues.
- Manage software deployments and run status meetings to align stakeholders.
Benefits
- Employees at Amazon are often offered comprehensive health benefits—including multiple medical plan options (no pre-existing condition exclusions, 100% covered in-network preventive care), dental and vision plans, a 24/7 medical advice line from day one, expert second-opinion services, and broad mental-health support with several free counseling sessions (including pediatric). Financial wellness typically includes a 401(k) with company match (up to 2%), Restricted Stock Units (equity), FSAs, an emergency savings program, product and partner discounts, and even college-savings and home-purchase programs. Overall, the package is designed to support employees and their families’ health, finances, and day-to-day life.
Is this posting expired or inaccurate?
