Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

MLOps Engineer

Yotta Data Services Private Limited

9 - 14 years

0 Lacs

mumbai maharashtra india

Posted:3 months ago| Platform: Foundit logo

Apply

Skills Required

triton tensorflow serving mlflow kubeflow torchserve bentoml

Work Mode

On-site

Job Type

Full Time

Job Description

Location: Mumbai, India

Experience Level: 9 Plus Years

Minimum Qualification: Masters Degree in Computer Science, Engineering, or related field.

About the Role:

Were looking for a strategic Senior MLOps Engineer to lead the end-to-end design, implementation, and scaling of our AI infrastructure. Youll partner with researchers, product teams, and DevOps to turn prototypes into production services that meet strict SLAs for latency, reliability, and cost efficiency.

Responsibilities:

Core MLOps Pipelines: Design and implement scalable ML pipelines (training, evaluation, deployment) for LLMs, CV, and multimodal models .

Model Serving & CI/CD: Lead efforts in model serving, versioning, automated CI/CD, and real-time monitoring of AI workflows .

Inference-as-a-Service: Build and optimize GPU-backed serving infrastructure targeting p99 latency < 100 ms, 99.9% uptime, and > 80% GPU utilization .

Governance & Drift Detection: Drive initiatives on model governance, automated drift detection (?10% false positives), and data-management best practices .

Vector Search & Agent Orchestration: Integrate vector databases (Qdrant, Pinecone) for low-latency semantic retrieval, and build agentic workflows using LangChain or similar frameworks.

Enterprise Multi-Tenancy: Architect RBAC-driven, isolated ML services to securely serve 100500+ organizations.

Observability & Logging: Design Prometheus/Grafana dashboards, ELK/Fluentd logging pipelines, and alerting for all ML workloads.

CI/CD for Inference APIs: Maintain CI/CD pipelines for Python (FastAPI) and TypeScript (NestJS) inference services.

Metrics & Cost Optimization: Define and track SLAs/SLOs, optimize cloud spend by ? 20% year-over-year, and ensure GPU clusters operate at > 80% utilization.

Cross-Functional Leadership: Partner with AI researchers, product managers, and legal to align MLOps standards with compliance and roadmap goals.

Mentorship & Community: Mentor junior engineers, run quarterly brown-bags, own onboarding docs (upskill 5+ engineers/quarter), and publish ? 1 open-source contribution or talk annually.

Requirements

914 years in software engineering, including ? 4 years in MLOps or ML infrastructure

Strong expertise in cloud platforms (AWS/GCP/Azure), Kubernetes, Docker, Terraform, Helm, Kubeflow, and MLflow

Experience with inference frameworks (Triton, TensorFlow Serving, BentoML, TorchServe)

Familiarity with distributed training, workload schedulers, and GPU-cluster orchestration

Proficiency in Python, TypeScript, and infrastructure-as-code (Terraform, Helm, etc.)

Proven track record building reliable, scalable ML systems in production.

Plus These Critical Skills:

Vector DB integration (Qdrant, Pinecone)

Agent orchestration (LangChain, LlamaIndex)

Multi-tenant security and RBAC

Observability stacks (Prometheus/Grafana, ELK)

CI/CD for FastAPI/NestJS services

Preferred

Masters/PhD in CS/AI and certifications such as AWS ML Specialty, Google Cloud Professional ML Engineer, or CNCF CKA/CKAD.

Prior experience at AI-focused startups or enterprises scaling ML for 100500 orgs.

Understanding of low-latency streaming inference or agent-based LLM systems.

Excellent written and verbal communication, and a proven ability to drive consensus across functions.

More Jobs at Yotta Data Services Private Limited

Partner Manager

Chennai, Tamil Nadu, India

3 - 3 yrs

Salary: Not disclosed

Solution Engineer Manager

Mumbai, Maharashtra, India

8.0 - 8.0 yrs

Salary: Not disclosed

Security L2

Panvel, Maharashtra, India

5.0 - 8.0 yrs

Salary: Not disclosed

Product Manager - AI

Mumbai, Maharashtra, India

8.0 - 8.0 yrs

Salary: Not disclosed

Lead Artificial Intelligence

Mumbai, Maharashtra, India

Experience: Not specified

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

Yotta Data Services Private Limited

Login to

Please Verify Your Phone or Email

Confirm Action

MLOps Engineer