H1BConnect Pro: Unlock advanced filters, H1B sponsorship insights, and unlimited job access.Subscribe now
AMD logo

Senior Software Development Engineer - LLM Kernel & Inference Systems

AMD
Santa Clara, CA Full-time 3/17/2026 $192k - $288k per year
PhD Entry-Level
Approval 98.6%Total filings 728New hires 184
Established Sponsor
FY 2025

Job Description

As a Senior Member of Technical Staff at AMD, you will lead efforts in optimizing Large Language Model (LLM) inference and kernel optimization for AMD GPUs. This role involves working on high-performance LLM serving, collaborating with internal teams and open-source communities to enhance GPU kernels and inference runtimes.

Requirements

  • Deep understanding of Large Language Model inference, including attention mechanisms and batching strategies.
  • Hands-on experience with LLM inference frameworks such as vLLM or SGLang.
  • Proven experience optimizing GPU kernels for deep learning workloads.
  • Experience designing and tuning large-scale inference systems across multiple GPUs and nodes.
  • Track record of meaningful upstream contributions to ML or systems-level open-source projects.
  • Strong proficiency in Python and C++, with experience in performance analysis and debugging.
  • Experience running and optimizing large-scale workloads on heterogeneous GPU clusters.
  • Solid foundation in compiler concepts and tooling like LLVM and ROCm.

Responsibilities

  • Optimize LLM Inference Frameworks for AMD GPUs.
  • Design and optimize GPU kernels critical to LLM inference.
  • Design, implement, and tune multi-GPU and multi-node inference strategies.
  • Collaborate with model and framework teams for hardware-aware optimizations.
  • Leverage compiler technologies to improve kernel fusion and memory access patterns.
  • Optimize the full inference stack from model execution to deployment.
  • Engage with open-source maintainers to upstream optimizations.
  • Apply best practices in software engineering, including performance benchmarking and testing.

Benefits

  • AMD provides a competitive 'Total Rewards' package that focuses on financial growth, health, and work-life balance.

Is this job posting expired or no longer available?