Senior Software Engineer - Model Inferencing

3 - 6 years

11 - 15 Lacs

Posted:2 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Red Hat OpenShift AI is a flexible, scalable artificial intelligence (AI) and machine learning (ML) platform that enables enterprises to create and deliver AI-enabled applications at scale across hybrid cloud environments. Built using open-source technologies, OpenShift AI provides trusted, operationally consistent capabilities for teams to experiment, serve models, and deliver innovative apps.


The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment and monitoring seamless and scalable across the hybrid cloud and the edge. This is a fascinating opportunity to build and impact the next generation of hybrid cloud MLOps platforms.


What You Will Do




  • Develop and maintain a high-quality, high-performing ML inference runtime platform for multi-modal and distributed model serving.

  • Contribute directly to upstream inference runtime communities such as vLLM, TGI, PyTorch, OpenVINO, and others.

  • Maintain CI/CD build pipelines for container images that allow faster, more secure, reliable, and frequent releases

  • Coordination and communication with various stakeholders

  • Applying a growth mindset by staying up to date with AI and ML advancements


What You Will Bring



  • Highly experienced with programming in Python and PyTorch

  • Familiarity with model parallelization, quantization, and memory optimization using vLLM, TGI, and other inference libraries.

  • Experience with Python packaging, such as PyPI libraries

  • Solid understanding of the fundamentals of model inference architectures

  • Experience with Jenkins, Git, shell scripting, and related technologies

  • Experience with the development of containerized applications in Kubernetes

  • Experience with Agile development methodologies

  • Experience with Cloud Computing using at least one of the following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud

  • Ability to work across a large, distributed, hybrid engineering team

  • Following is considered a plus

  • Experience with open-source development is a plus

  • Development experience with C++, especially with the CUDA APIs, is a big plus


 

 

 

 

Mock Interview

Practice Video Interview with JobPe AI

Start Software Engineer Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Red Hat logo
Red Hat

Software Development

Raleigh NC

RecommendedJobs for You