Dear Partners,
We are looking for a MLOPS Engineer candidate for one of our direct clients based in Austin, TX / Sunnyvale, CA . Please let me know if you have any strong candidates available.
Position: MLOPS Engineer
Location: Austin, TX / Sunnyvale, CA
Duration: 12+ months
Interview Type: Video
Primary Skill: MLOPs, GPU, Ray , GPU configuration
Core Responsibilities:
- Ray & GPU Management: Operate, monitor, and troubleshoot production and non-production environments with a deep understanding of Ray and GPU configurations.
- Cloud Automation: Automate service deployment and orchestration across multiple cloud platforms, specifically AWS and GCP.
- API Development: Design and implement REST or RPC APIs and other services using Python or Go.
- Performance & Reliability: Participate in capacity planning, scale testing, and disaster recovery. You will also be responsible for implementing and reporting on SLOs and SLIs.
- Collaboration: Work closely with engineering, QA, and program management teams to support and improve our ML pipelines.
What We're Looking For:
- Expertise in Ray for managing distributed ML workloads.
- Experience in automating and orchestrating services in AWS and GCP.
- Proficiency in designing and implementing APIs with Python or Go.
- Familiarity with Ray development is a plus.
- Experience with implementing SLOs and SLIs.
Raju Biswas | Team Lead
647-360-8603