Get alerts for new jobs matching your selected skills, preferred locations, and experience range.
5 - 8 years
7 - 10 Lacs
Bengaluru
Work from Office
About the Job: The Data Development Insights & Strategy (DDIS) team is seeking a Senior AI Engineer to design, scale, and maintain our AI model lifecycle framework within Red Hat's OpenShift AI and RHEL AI infrastructures. As a Senior AI Engineer, you will contribute to managing and optimizing large-scale AI models, collaborating with cross-functional teams to ensure high availability, continuous monitoring, and efficient integration of new model updates, while driving innovation through emerging AI technologies. In this role, you will leverage your expertise in AI, MLOps/LLMOps, cloud computing, and distributed systems to enhance model performance, scalability and operational efficiency. You'll work in close collaboration with the Products & Global Engineering(P&GE) and IT AI Infra teams, ensuring seamless model deployment and maintenance in a secure and high-performance environment. This is an exciting opportunity to drive AI model advancements and contribute to the operational success of mission-critical applications. What you will do? Develop and maintain the lifecycle framework for AI models within Red Hats OpenShift and RHEL AI infrastructure, ensuring security, scalability and efficiency throughout the process. Design, implement, and optimize CI/CD pipelines and automation for deploying AI models at scale using tools like Git, Jenkins, and Terraform, ensuring zero disruption during updates and integration. Continuously monitor and improve model performance using tools such as OpenLLMetry, Splunk, and Catchpoint, while responding to performance degradation and model-related issues. Work closely with cross-functional teams, including Products & Global Engineering(P&GE) and IT AI Infra teams, to seamlessly integrate new models or model updates into production systems with minimal downtime and disruption. Enable a structured process for handling feature requests (RFEs), prioritization, and resolution, ensuring transparent communication and timely resolution of model issues. Assist in fine-tuning and enhancing large-scale models, including foundational models like Mistral and LLama, while ensuring computational resources are optimally allocated (GPU management, cost management strategies). Drive performance improvements, model updates, and releases on a quarterly basis, ensuring that all RFEs are processed and resolved within 30 days. Collaborate with stakeholders to align AI model updates with evolving business needs, data changes, and emerging technologies. Contribute to mentoring junior engineers, fostering a collaborative and innovative environment. What you will bring? A bachelor's or masters degree in Computer Science, Data Science, Machine Learning, or a related technical field is required. Hands-on experience that demonstrates your ability and interest in AI engineering and MLOps will be considered in lieu of formal degree requirements. Experience programming in at least one of these languages: Python, with a strong understanding of Machine Learning frameworks and tools. Experience working with cloud platforms such as AWS, GCP, or Azure, and have familiarity with deploying and maintaining AI models at scale in these environments. As a Senior AI Engineer, you will be most successful if you have experience working with large-scale distributed systems and infrastructure, especially in production environments where AI and LLM models are deployed and maintained. You should be comfortable troubleshooting, optimizing, and automating workflows related to AI model deployment, monitoring, and lifecycle management. We value a strong ability to debug and optimize model performance and automate manual tasks wherever possible. Additionally, you should be well-versed in managing AI model infrastructure using containerization technologies like Kubernetes and OpenShift, and have hands-on experience with performance monitoring tools (e.g., OpenLLMetry, Splunk, Catchpoint). We also expect you to have a solid understanding of GPU-based computing and resource optimization, with a background in high-performance computing (e.g., CUDA, vLLM, MIG, TGI, TEI). Experience working in Agile development environments. Work collaboratively within cross-functional teams to solve complex problems and drive AI model updates will be key to your success in this role. Desired skills: 5+ years of experience in AI or MLOps, with a focus on deploying, maintaining, and optimizing large-scale AI models in production. Expertise in deploying and managing models in cloud environments (AWS, GCP, Azure) and containerized platforms like OpenShift or Kubernetes. Familiarity with large-scale distributed systems and experience managing their performance and scalability. Experience with performance monitoring and analysis tools such as OpenLLMetry, Prometheus, or Splunk. Deep understanding of GPU-based deployment strategies and computational cost management. Strong experience in managing model lifecycle processes, from training to deployment, monitoring, and updates. Ability to mentor junior engineers and promote knowledge sharing across teams. Excellent communication skills, both verbal and written, with the ability to engage with technical and non-technical stakeholders. A passion for innovation and continuous learning in the rapidly evolving field of AI and machine learning.
Posted 3 months ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
MLOps, a combination of machine learning and operations, is a rapidly growing field in India. As companies continue to invest in artificial intelligence and machine learning technologies, the demand for MLOps professionals is on the rise. Job seekers looking to enter this field will find a variety of opportunities across different industries in India.
Here are 5 major cities in India actively hiring for MLOps roles: 1. Bangalore 2. Mumbai 3. Hyderabad 4. Pune 5. Delhi
The salary range for MLOps professionals in India can vary based on experience and location. On average, entry-level MLOps professionals can expect to earn around INR 6-10 lakhs per annum, while experienced professionals can earn upwards of INR 15 lakhs per annum.
A typical career progression in MLOps may look like this: - Junior MLOps Engineer - MLOps Engineer - Senior MLOps Engineer - MLOps Architect - MLOps Manager
In addition to MLOps skills, professionals in this field are often expected to have knowledge of: - Machine Learning - Data Engineering - Cloud Computing - Python programming - DevOps
Here are 25 interview questions for MLOps roles: - What is the difference between machine learning and deep learning? (basic) - Explain the concept of model deployment in MLOps. (medium) - How do you handle data drift in a machine learning model? (medium) - What is Docker, and how is it used in MLOps? (basic) - What is the purpose of version control in MLOps? (basic) - Describe your experience with CI/CD pipelines in MLOps. (medium) - How do you monitor the performance of a machine learning model in production? (medium) - What is Kubernetes, and how is it related to MLOps? (medium) - Explain the concept of hyperparameter tuning. (medium) - How do you ensure model reproducibility in MLOps? (advanced) - What is the difference between batch inference and real-time inference? (medium) - How do you handle model retraining in MLOps? (medium) - Describe a time when you had to troubleshoot a machine learning model in production. (medium) - What is the role of data governance in MLOps? (medium) - How do you ensure model security in MLOps? (medium) - Explain the concept of A/B testing in the context of MLOps. (medium) - What are the key components of a machine learning pipeline? (basic) - How do you manage model versioning in MLOps? (medium) - Describe your experience with monitoring and logging in MLOps. (medium) - What is the purpose of artifact management in MLOps? (basic) - How do you handle scalability in machine learning systems? (medium) - What is the difference between supervised and unsupervised learning? (basic) - Explain the concept of bias and variance in machine learning models. (medium) - How do you ensure data quality in MLOps? (medium) - Describe a successful MLOps project you have worked on. (advanced)
As the demand for MLOps professionals continues to grow in India, now is a great time to explore opportunities in this field. By honing your skills, gaining relevant experience, and preparing for interviews, you can position yourself for a successful career in MLOps. Good luck with your job search!
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
36723 Jobs | Dublin
Wipro
11788 Jobs | Bengaluru
EY
8277 Jobs | London
IBM
6362 Jobs | Armonk
Amazon
6322 Jobs | Seattle,WA
Oracle
5543 Jobs | Redwood City
Capgemini
5131 Jobs | Paris,France
Uplers
4724 Jobs | Ahmedabad
Infosys
4329 Jobs | Bangalore,Karnataka
Accenture in India
4290 Jobs | Dublin 2