Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in india
>
Orbion Infotech
>
ML + RAG, Python, LLM, Eng (4 yrs)

ML + RAG, Python, LLM, Eng (4 yrs)

Orbion Infotech

4 years

10 - 12 Lacs

india

Posted:3 days ago| Platform:

Apply

Skills Required

ml python learning ai software model retrieval latency inference data summarization design indexing quantization onnx integration optimization strategies docker orchestration automate monitoring drift engineering tooling aws gcp mlflow cutting architecture collaborative experimentation shipping

Work Mode

Remote

Job Type

Full Time

Job Description

Machine Learning Engineer – LLM & RAG (Remote, India)

About The Opportunity

We operate in the AI/ML and Enterprise Software sector, building production-ready large language model (LLM) applications and retrieval-augmented generation (RAG) systems that solve real-world enterprise problems. The team focuses on scalable, low-latency LLM inference, vector search, and data pipelines to deliver intelligent search, summarization, and automated knowledge workflows for customers across industries.Role & Responsibilities

Design and implement end-to-end RAG solutions: document ingestion, embedding generation, vector indexing, retriever design, and LLM-based response generation.
Develop and maintain Python back-end services and APIs that integrate LLMs, LangChain/LlamaIndex workflows, and vector search for production use.
Optimize LLM inference performance: model selection, batching, quantization, ONNX/Triton integration, and memory/GPU optimization to meet latency and cost SLAs.
Integrate and tune vector search stacks (FAISS, Milvus, Weaviate, or hosted vector DBs) and design embedding strategies for robust retrieval.
Deploy and operate scalable infrastructure using Docker and orchestration platforms; automate CI/CD, monitoring, and alerting for ML services.
Collaborate with Data Scientists and product teams to productionize models, implement A/B experiments, monitor drift, and iterate on model quality and UX.

Skills & Qualifications

Must-Have

4+ years of experience in machine learning or ML engineering with hands-on LLM projects.
Strong software engineering in Python and building production back-end services.
Experience with transformer frameworks and LLM tooling (Hugging Face Transformers, PyTorch).
Practical experience building RAG pipelines and working with vector search (FAISS or similar).
Proven experience deploying ML services with Docker and cloud environments (AWS/GCP/Azure).
Knowledge of model optimization and serving techniques (quantization, ONNX, Triton, batching).

Preferred

Hands-on experience with LangChain, LlamaIndex, or similar orchestration frameworks.
Familiarity with vector databases (Milvus, Weaviate) and managed vector DB services.
Experience with MLOps and monitoring tools (MLflow, Prometheus, Grafana, model-drift tooling).

Benefits & Culture Highlights

Fully remote role with flexible hours supporting work-life balance across India.
Opportunity to work on cutting-edge LLM/RAG products and influence architecture and tooling choices.
Collaborative, fast-paced engineering culture that values ownership, experimentation, and scalable design.

To apply, bring strong Python engineering, hands-on LLM/RAG experience, and a passion for shipping scalable AI systems. This role is ideal for engineers who enjoy end-to-end ownership of production ML services and optimizing LLMs for real user impact.
Skills: llm,rag,python

More Jobs at Orbion Infotech

Collibra Data Engineer (8 yrs Noida)

Noida, Uttar Pradesh, India

5 - 8 yrs

₹ 25 - 30 Lacs

Net+ C# Developer (6 years pune )

Gurugram, Haryana, India

6 - 6 yrs

Salary: Not disclosed

Lead Blockchain Developer (10 years)

India

10.0 - 10.0 yrs

Salary: Not disclosed

Lead Java engineer 9 years

India

Experience: Not specified

Salary: Not disclosed

Fullstack Developer React + . Net Core (6 yrs jaipur)

Jodhpur, Rajasthan, India

Experience: Not specified

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Orbion Infotech

RecommendedJobs for You

ML + RAG, Python, LLM, Eng (4 yrs)

Orbion Infotech

india

ML + RAG, Python, LLM, Eng (4 yrs)

Orbion Infotech

india

Login to

Please Verify Your Phone or Email

Confirm Action

ML + RAG, Python, LLM, Eng (4 yrs)