JobsAI Systems Engineer, Tooling & Infrastructure, Optimus
Job description
As a Software Engineer for the Optimus team, you will develop tools and infrastructure to enhance neural network architecture, automate data and inference pipelines, and ensure data quality through visualization tools. This role involves building a Machine Learning Platform that streamlines the ML lifecycle, impacting how Humanoid Robots operate in the real world.
Requirements
- Strong programming experience in Python and vectorization APIs such as numpy
- Experience with distributed compute systems (k8s, Slurm, LSF, etc.) or experience with large datasets / data workflows
- Experience working in a team environment
- Experience in training deep learning models and designing and deploying automation systems for machine learning workflows is a plus
Responsibilities
- Automate data, inference, and auto-labeling pipelines
- Build the tooling and infrastructure for reporting and visualizing model metrics and performance
- Manage, analyze, and validate our training and test datasets
- Coordinate with the team managing the hardware cluster to maintain high availability / jobs throughput for Machine Learning
- Drive implementation of best practices and monitoring systems to proactively detect and address issues in our production environment
- Build and improve our Python training infrastructure for stable and faster training and validate our PyTorch models
Benefits
- Employees at Tesla are often offered day-one coverage with multiple medical options (some at $0 paycheck cost), dental/vision, company HSA contributions, a 401(k) match, and equity programs. Most roles also include paid time off and holidays, family-building support, employee assistance, commuter and childcare benefits, and access to discounts and wellness programs.
Is this posting expired or inaccurate?
