Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in india
>
Orbion Infotech
>
ML + Python RAG ,LLM ENG (4 Yrs)

ML + Python RAG ,LLM ENG (4 Yrs)

Orbion Infotech

4 - 6 years

0 Lacs

india

Posted:2 days ago| Platform: Foundit logo

Apply

Skills Required

langchain faiss transformers-based models milvus

Work Mode

Remote

Job Type

Full Time

Job Description

Primary Job Title:

Machine Learning Engineer LLM & RAGIndustry: Enterprise AI / Software & Cloud Solutions. Sector: Large Language Model (LLM) applications, Retrieval-Augmented Generation (RAG), and production ML services for business workflows. Location: India (Remote).

About The Opportunity

Join a fast-moving engineering team building production-grade LLM-powered services and RAG pipelines that enable intelligent search, document understanding, and agentic automation for enterprise customers. You will design, implement, and operate scalable retrieval, embedding, and inference pipelinesturning research-grade models into reliable, low-latency products.Role & Responsibilities

Design and implement end-to-end RAG workflows: document ingestion, embedding generation, vector indexing, retrieval, and LLM inference.
Develop robust Python services that integrate Transformers-based models, LangChain pipelines, and vector search (FAISS/Milvus) for production APIs.
Optimize embedding strategies, retrieval quality, and prompt templates to improve relevance, latency, and cost-efficiency.
Build scalable inference stacks with serving, batching, caching, and monitoring to meet SLA targets for throughput and latency.
Collaborate with data scientists and product teams to evaluate model architectures, run A/B tests, and implement continuous retraining/validation loops.
Implement observability, CI/CD, and reproducible deployments (Docker-based containers, model versioning, and automated tests).

Skills & Qualifications

Must-Have

4+ years of professional experience in ML or software engineering with hands-on LLM/RAG work.
Strong Python programming and system-design skills for production services.
Experience with Transformers-based models and fine-tuning/inference workflows.
Proven experience building retrieval pipelines using vector search (FAISS, Milvus) and embeddings.
Familiarity with LangChain or equivalent orchestration libraries for LLM workflows.
Practical experience containerizing and deploying ML workloads (Docker, CI/CD, basic infra automation).

Preferred

Experience with cloud ML infra (AWS, Azure or GCP) and model serving at scale.
Familiarity with Kubernetes or other orchestration for production deployments.
Experience with retrieval evaluation, relevance metrics, and A/B experimentation.

Benefits & Culture Highlights

Fully remote role with flexible hours and an outcomes-driven culture.
Opportunity to ship end-to-end LLM products and influence architecture choices.
Mentorship-oriented environment with access to modern tools and model stacks.

Why apply: This role offers hands-on ownership of RAG systems and LLM deployment in productionideal for engineers who want to move fast, optimize for real-world impact, and work with cutting-edge LLM tooling.
Skills: python,backend,rag,llm

More Jobs at Orbion Infotech

Collibra Data Engineer (8 yrs Noida)

Noida, Uttar Pradesh, India

5 - 8 yrs

₹ 25 - 30 Lacs

Net+ C# Developer (6 years pune )

Gurugram, Haryana, India

6 - 6 yrs

Salary: Not disclosed

Lead Blockchain Developer (10 years)

India

10.0 - 10.0 yrs

Salary: Not disclosed

Lead Java engineer 9 years

India

Experience: Not specified

Salary: Not disclosed

Fullstack Developer React + . Net Core (6 yrs jaipur)

Jodhpur, Rajasthan, India

Experience: Not specified

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

Orbion Infotech

RecommendedJobs for You

ML + Python RAG ,LLM ENG (4 Yrs)

Orbion Infotech

india

ML + Python RAG ,LLM ENG (4 Yrs)

Orbion Infotech

india

Login to

Please Verify Your Phone or Email

Confirm Action

ML + Python RAG ,LLM ENG (4 Yrs)