4 Gpu Inference Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

8.0 - 10.0 years

0 Lacs

bengaluru, karnataka, india

On-site

We are looking for a Senior Software Engineer who thrives on hands-on coding and problem solving to build the real-time data pipelines and serving systems that power large-scale ML models. You will be directly responsible for designing, coding, and optimizing ETL pipelines, GPU inference serving that ensure models get the freshest data and serve results with millisecond-level latency. This role is ideal for someone who loves to write high-quality production code, profile and debug performance bottlenecks, and ship optimizations at scale, while collaborating closely with applied scientists and other engineers. Microsoft's mission is to empower every person and every organization on the planet...

Posted 3 days ago

AI Match Score
Apply

4.0 - 6.0 years

0 Lacs

india

On-site

Overview We're building the future of voice intelligent, expressive, multilingual systems that can understand, respond, and connect with humans naturally. As ourSpeech AI Engineer , you'll be part of a high-performance team designing voice intelligence that powers conversational platforms for governments, enterprises, and next-gen contact centers. Your mission is to make machines sound human blending deep learning, linguistics, and emotion modeling to create voices that respond with empathy and precision. You'll work on speech pipelines that run in real time, across languages like Arabic, English, Hindi , and beyond optimizing for accuracy, speed, and natural flow. This role is for those who...

Posted 1 week ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

delhi, india

On-site

AI/ML Engineer - L4/L5 (Lead / Senior-Lead) Location & Type: Delhi, Full-time CTC Range (LPA): 51.50 - 77.15 Role Overview We're looking for a hands-on AI/ML Lead Engineer who can bridge research, product, and engineering. You'll own the full ML lifecycle - from problem framing and data pipelines to model deployment, evaluation, and scaling in production. You'll guide a small team of engineers, collaborate with product and design, and turn ideas into working AI features that reach users quickly and reliably. This is a startup environment: you'll move fast, make pragmatic trade-offs, and ship continuously. What You'll Do Lead design and development of core AI features - from data ingestion to...

Posted 2 weeks ago

AI Match Score
Apply

0.0 - 4.0 years

0 Lacs

noida, uttar pradesh

On-site

As an AI Engineer at our company, your role will involve designing, developing, and deploying an in-house AI assistant powered by LLaMA 3 and integrated with our MS SQL-based ERP system (4QT ERP). You will be responsible for setting up LLM infrastructure, implementing voice input (Whisper), translating natural language to SQL, and ensuring accurate, context-aware responses to ERP-related queries. Key Responsibilities: - Setup and deploy LLaMA 3 (8B/FP16) models using llama-cpp-python or Hugging Face - Integrate the AI model with FastAPI to create secure REST endpoints - Implement prompt engineering or fine-tuning (LoRA) to enhance SQL generation accuracy - Develop a user-facing interface (Re...

Posted 1 month ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies