Machine Learning Engineer – Model Optimization

Overview

Premier Talent Source is recruiting on behalf of a scaling AI client for a Machine Learning Engineer focused on model performance and deployment. This role will collaborate with data scientists and infrastructure teams to refine LLMs, improve inference speed, and ensure production-grade reliability.

No. of Vacancies
1
Specific Skills
Skills Required:
  • 3+ years in ML engineering, ideally with LLMs or generative models
  • Experience with distributed training, GPU acceleration, and MLOp
  • Strong coding skills in Python and familiarity with ML framework
Responsible For
Responsibilities:
  • Design and implement scalable ML models using PyTorch or TensorFlow
  • Optimize transformer architectures for latency and throughput
  • Collaborate on data preprocessing, feature engineering, and evaluation pipelines
  • Deploy models using containerized workflows and cloud-native tools
  • Monitor performance and iterate based on real-world feedback
Job Nature
Full Time
Educational Requirements
Educational Requirements:
  • Bachelor’s or Master’s in CS, Math, or related field
Experience Requirements
3+ years in ML engineering
Job Location
Hybrid (SF Bay Area) or Remote
Salary
$150,000–$190,000 + equity
Job Level
Sr. Position

Apply for this position

*
*
* Attach your resume. Max size 2mb Allowed Types: pdf
Scroll to Top