Senior Software Engineer - Model Inferencing

3 - 6 years

11 - 15 Lacs

Posted:2 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description


Red Hat OpenShift AI is a flexible, scalable artificial intelligence (AI) and machine learning (ML) platform that enables enterprises to create and deliver AI-enabled applications at scale across hybrid cloud environments. Built using open-source technologies, OpenShift AI provides trusted, operationally consistent capabilities for teams to experiment, serve models, and deliver innovative apps.The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment and monitoring seamless and scalable across the hybrid cloud and the edge. This is a fascinating opportunity to build and impact the next generation of hybrid cloud MLOps platforms.What You Will Do Develop and maintain a high-quality, high-performing ML inference runtime platform for multi-modal and distributed model serving. Contribute directly to upstream inference runtime communities such as vLLM, TGI, PyTorch, OpenVINO, and others. Maintain CI/CD build pipelines for container images that allow faster, more secure, reliable, and frequent releases Coordination and communication with various stakeholders Applying a growth mindset by staying up to date with AI and ML advancementsWhat You Will Bring Highly experienced with programming in Python and PyTorch Familiarity with model parallelization, quantization, and memory optimization using vLLM, TGI, and other inference libraries. Experience with Python packaging, such as PyPI libraries Solid understanding of the fundamentals of model inference architectures Experience with Jenkins, Git, shell scripting, and related technologies Experience with the development of containerized applications in Kubernetes Experience with Agile development methodologies Experience with Cloud Computing using at least one of the following Cloud infrastructuresAWS, GCP, Azure, or IBM Cloud Ability to work across a large, distributed, hybrid engineering teamFollowing is considered a plus Experience with open-source development is a plus Development experience with C++, especially with the CUDA APIs, is a big plus  

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Red Hat logo
Red Hat

Software Development

Raleigh NC

RecommendedJobs for You