Machine Learning Engineer – Model Optimization
Overview
Premier Talent Source is recruiting on behalf of a scaling AI client for a Machine Learning Engineer focused on model performance and deployment. This role will collaborate with data scientists and infrastructure teams to refine LLMs, improve inference speed, and ensure production-grade reliability.
No. of Vacancies
1
Specific Skills
Skills Required:
- 3+ years in ML engineering, ideally with LLMs or generative models
- Experience with distributed training, GPU acceleration, and MLOp
- Strong coding skills in Python and familiarity with ML framework
Responsible For
Responsibilities:
- Design and implement scalable ML models using PyTorch or TensorFlow
- Optimize transformer architectures for latency and throughput
- Collaborate on data preprocessing, feature engineering, and evaluation pipelines
- Deploy models using containerized workflows and cloud-native tools
- Monitor performance and iterate based on real-world feedback
Job Nature
Full Time
Educational Requirements
Educational Requirements:
- Bachelor’s or Master’s in CS, Math, or related field
Experience Requirements
3+ years in ML engineering
Job Location
Hybrid (SF Bay Area) or Remote
Salary
$150,000–$190,000 + equity
Job Level
Sr. Position
How to Apply
Interested candidates can send their resumes to wtownsend@premiertalentsource.com mentioning "Job Title" in the subject line.
Apply Online