Senior Engineer- Machine Learning Platform

8 - 12 years

40 - 60 Lacs

Posted:None| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description


About the Role:

Role & responsibilities

Architecture & Development

  • Design and implement enterprise-scale ML infrastructure using Ray, Kubernetes, and cloud-native technologies
  • Architect high-performance model serving solutions handling millions of predictions per second
  • Build robust, scalable systems for model training, deployment, and monitoring
  • Lead technical decisions for critical ML platform components

MLOps & Infrastructure

  • Develop automated ML pipelines using Airflow and MLflow
  • Implement sophisticated monitoring and observability solutions
  • Optimize resource utilization across distributed computing environments
  • Design fault-tolerant, highly available ML systems

Performance Engineering

  • Optimize large-scale distributed systems for maximum throughput
  • Implement advanced memory management strategies for ML workloads
  • Design and optimize real-time inference systems
  • Tune system performance for production-grade ML operations

Preferred candidate profile

  • 8+ years of software engineering experience with distributed systems
  • 4+ years of hands-on experience building ML platforms
  • Deep expertise in Python and modern ML infrastructure tools
  • Proven experience with Kubernetes, containerization, and cloud platforms
  • Strong background in performance optimization and scalability
  • Experience with Ray, JupyterHub, MLflow, or similar ML platforms
  • Distributed Systems: Ray, Kubernetes, Docker or similar large scale distributed systems
  • ML Platforms: MLflow, Kubeflow, JupyterHub, Kubeflow
  • Infrastructure: AWS/GCP/Azure, Terraform
  • Languages: Python, Go
  • Observability: Prometheus, Grafana
  • CI/CD: GitLab, Jenkins

What Sets You Apart:

  • Contributions to open-source ML infrastructure projects
  • Experience with real-time, high-throughput inference systems
  • Background in cybersecurity or threat detection
  • Track record of leading technical initiatives
  • Experience with large-scale data processing systems

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Golang Skills

Practice Golang coding challenges to boost your skills

Start Practicing Golang Now
Crowdstrike logo
Crowdstrike

Computer and Network Security

Remote

RecommendedJobs for You

bengaluru, karnataka, india

kolkata, mumbai, new delhi, hyderabad, pune, chennai, bengaluru