JobsInference Optimization Engineer (local / edge runtime)
Inference Optimization Engineer (local / edge runtime)
IntelInference Optimization Engineer (local / edge runtime)
IntelLocation
Santa Clara, CA, Hillsboro, OR, Folsom, CA, Phoenix, AZ
Type
Full-time
Posted
6/16/2026
Compensation
$170,500 - $315,490 per year
Undergraduate with 5+ Years of Experience
Master's with 5+ Years of Experience
Approval 96.6%·Filings 2,117·New hires 632·
💎 Strong Sponsor
·FY 2025Job description
The role focuses on optimizing inference engines for local and edge environments, ensuring models run efficiently on hardware owned by users. The team is dedicated to building agentic AI that balances local and cloud intelligence while maintaining data privacy and cost-effectiveness. Candidates will work with technologies like llama.cpp and vLLM, tuning performance for latency, throughput, and memory. This position is critical for developing low-cost AI solutions that are both powerful and trustworthy.
Requirements
- BS/MS in Computer Science, Electrical Engineering, Mathematics, or a related STEM field
- 5+ years of software development experience
- Strong proficiency in C++ and/or Python, with the ability to read systems-level code
- Understanding of how LLM inference works, including attention, KV cache, and decoding
- Experience in profiling and optimizing real performance problems on CPU or GPU
- Expertise in Linux, build systems, and low-level debugging
Responsibilities
- Profile and optimize local inference for latency, throughput, and memory on edge hardware
- Tune KV cache, continuous batching, and scheduling for interactive agent workloads
- Drive quantization strategy and validate quality impact with the Post-Training team
- Reduce CPU overhead and improve engine startup, model load, and lifecycle management
- Benchmark across hardware tiers and publish performance comparisons
- Contribute upstream fixes and patches to open-source engines where beneficial
Benefits
- Intel offers a comprehensive benefits package including competitive pay, stock programs, healthcare coverage, retirement plans, paid time off, parental leave, and programs supporting employee wellbeing and professional development.
Is this posting expired or inaccurate?
