JobsLead AI Infrastructure Engineer, Reinforcement Learning
Lead AI Infrastructure Engineer, Reinforcement Learning
AMDLead AI Infrastructure Engineer, Reinforcement Learning
AMDLocation
Santa Clara, CA
Type
Full-time
Posted
7/3/2026
Compensation
USD $178,500.00/Yr. – USD $255,000.00/Yr.
Undergraduate with 5+ Years of Experience
Approval 98.6%·Filings 728·New hires 184·
✓ Established Sponsor
·FY 2025Job description
The Lead AI Infrastructure Engineer, Reinforcement Learning at AMD will be responsible for managing reinforcement learning infrastructure at scale. This includes overseeing distributed policy and value training, rollout generation, and researcher-facing APIs across large GPU fleets. The role emphasizes improving the productivity of RL scientists by enhancing system reliability and performance. The ideal candidate will work closely with research scientists to ensure efficient and effective infrastructure solutions.
Requirements
- Bachelor's degree required; Master's or PhD preferred in Computer Science.
- Strong systems track record in machine learning platforms.
- Deep experience with PyTorch or JAX, NCCL/MPI-style distributed training, and GPU cluster orchestration.
- Prior ownership of RL training infrastructure or large-scale experiment management.
- Proficiency in C++ and Python performance tuning, I/O optimization, and containerized workloads.
Responsibilities
- Design and implement distributed RL training stacks integrated with AMD's schedulers and storage.
- Build high-throughput rollout workers, trajectory stores, and reward computation pipelines.
- Instrument jobs for debugging and implement autoscaling and preemption-safe checkpointing.
- Collaborate with research scientists on experiment templates and hyperparameter sweeps.
- Drive reliability through on-call rotations, runbooks, and postmortems for infrastructure incidents.
Benefits
- AMD provides a competitive 'Total Rewards' package that focuses on financial growth, health, and work-life balance.
Is this posting expired or inaccurate?
