JobsAI Infrastructure Engineer
Job description
The role of DevOps / Platform Engineer at AMD focuses on building and operating large-scale GPU compute infrastructure that supports AI and ML workloads. The ideal candidate will have a passion for software engineering and strong leadership skills to manage multi-quarter projects. This position requires effective communication and collaboration within a dynamic team environment. The engineer will be involved in extending platform capabilities and ensuring optimal performance across various workloads.
Requirements
- 5+ years of experience in DevOps, Platform, or Infrastructure Engineering.
- Deep hands-on experience with Kubernetes and container orchestration at scale.
- Proven ability to design and deliver platform features that serve internal customers or developer teams.
- Experience building developer-facing platforms or internal developer portals.
Responsibilities
- Build and extend platform capabilities to enable new classes of workloads.
- Design and operate scalable orchestration systems using Kubernetes across both on-prem and multi-cloud environments.
- Develop platform features such as secret management, configuration management, and deployment automation for customers.
- Partner with development teams to extend the GPU developer platform with features, APIs, templates, and self-service workflows.
- Manage service lifecycle within Kubernetes using Helm and GitOps workflows.
- Apply expertise in storage and networking to design and integrate CSI drivers, persistent volumes, and network policies.
Benefits
- AMD provides a competitive 'Total Rewards' package that focuses on financial growth, health, and work-life balance.
Is this posting expired or inaccurate?
