Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in hyderabad
>
Jovetix Technologies
>
AI/ML Engineer (GenAI & LLM Specialist)

AI/ML Engineer (GenAI & LLM Specialist)

Jovetix Technologies

3 - 7 years

15 - 18 Lacs

hyderabad

Posted:1 day ago| Platform:

Apply

Skills Required

tensorflow pytorch large language model machine learning python deep learning model development

Work Mode

Hybrid

Job Type

Full Time

Job Description

Role Summary

We are seeking an experienced AI/ML Engineer to lead the development of our core intelligence engine. In this role, you will own the end-to-end lifecycle of our proprietary AI tool: selecting the optimal open-source Large Language Model (LLM), fine-tuning it for our specific domain, and architecting a RAG (Retrieval-Augmented Generation) pipeline.

Your primary mission is to build a system that can ingest complex PDF documents, comprehend historical and referential data, and provide accurate, context-aware answers to user queries.

Role & responsibilities

1. Model Selection & Strategy

Evaluate Open-Source Models:
Analyze and benchmark state-of-the-art open-source models (e.g., Llama 3, Mistral, Falcon, Mixtral) to identify the best balance of performance, inference cost, and license suitability for our specific use case.
Feasibility Analysis:
Determine when to use Retrieval-Augmented Generation (RAG) versus Fine-Tuning (or a hybrid approach) to ensure the highest accuracy in answering queries based on uploaded PDFs.

2. RAG Pipeline & Data Engineering

Document Ingestion:
Build robust pipelines to parse, clean, and OCR complex PDF files (handling tables, headers, and multi-column layouts) using tools like Unstructured, PyMuPDF, or LayoutLM.
Vector Database Management:
Design and implement vector search architectures (using Pinecone, Milvus, ChromaDB, or Weaviate) to store and retrieve high-dimensional embeddings efficiently.
Context Optimization:
Optimize "chunking" strategies and context window management to ensure the LLM receives the most relevant historical data without hallucinating.

3. Model Training & Fine-Tuning

Fine-Tuning:
Implement efficient fine-tuning techniques (PEFT, LoRA, QLoRA) on the selected LLM to adapt its tone and reasoning capabilities to our business domain.
Dataset Preparation:
Curate and format training datasets from internal data to improve the model's ability to understand domain-specific terminology found in the PDFs.

4. Deployment & Optimization

Inference Optimization:
Optimize model latency and throughput using quantization (GGML/GGUF/AWQ) or engines like vLLM and TGI.
API Development:
Wrap the AI engine in a robust API (FastAPI/Flask) for integration with our front-end application.

Required Skills & Qualifications

Core Tech:
Expert proficiency in
Python
and deep learning frameworks (
PyTorch
or TensorFlow).
LLM Ecosystem:
Deep familiarity with the
Hugging Face
ecosystem (Transformers, Accelerate, PEFT, Datasets).
GenAI Frameworks:
Hands-on experience with orchestration frameworks like
LangChain
or
LlamaIndex
specifically for building RAG applications.
Vector Search:
Experience working with Vector Databases (Pinecone, ChromaDB, Elasticsearch, or pgvector).
Document Processing:
Experience extracting clean text from unstructured files (PDFs) using open source OCR tools (Nougat, Surya, PaddleOCR) or Python libraries(pymupdf, pdfplumber) for native pdfs.
Deployments:
Experience with
Containerization & Orchestration
(Docker, Kubernetes) and serving LLMs in air-gapped or offline environments using tools like
vLLM, Ollama, or llama.cpp
Mathematics:
Solid understanding of linear algebra, probability, and how Transformer architectures (Attention mechanisms) work.

Nice-to-Have (Bonus Points)

Experience deploying LLMs on cloud GPUs (AWS SageMaker or RunPod).
Knowledge of prompt engineering techniques (Chain-of-Thought, ReAct).
Previous experience building "Chat with your Data" style applications.

Why Join Us?

High Impact:
You will be the primary architect of the intelligence behind our product, not just maintaining legacy code.
Cutting Edge:
You will work with the absolute latest developments in the Open Source LLM space.
Autonomy:
You will have the freedom to choose the tech stack (Models, DBs, Frameworks) that best solves the problem.

More Jobs at Jovetix Technologies

Database Engineer

Bengaluru

5 - 8 yrs

INR 20 - 25 Lacs

AI/ML Engineer (GenAI & LLM Specialist)

hyderabad

3.0 - 7.0 yrs

INR 15 - 18 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Machine Learning Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Jovetix Technologies

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

AI/ML Engineer (GenAI & LLM Specialist)

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

Role Summary

Role & responsibilities

1. Model Selection & Strategy

Evaluate Open-Source Models:

Feasibility Analysis:

2. RAG Pipeline & Data Engineering

Document Ingestion:

Vector Database Management:

Context Optimization:

3. Model Training & Fine-Tuning

Fine-Tuning:

Dataset Preparation:

4. Deployment & Optimization

Inference Optimization:

API Development:

Required Skills & Qualifications

Core Tech:

Python

PyTorch

LLM Ecosystem:

Hugging Face

GenAI Frameworks:

LangChain

LlamaIndex

Vector Search:

Document Processing:

Deployments:

Containerization & Orchestration

vLLM, Ollama, or llama.cpp

Mathematics:

Nice-to-Have (Bonus Points)

Why Join Us?

High Impact:

Cutting Edge:

Autonomy:

More Jobs at Jovetix Technologies