Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in hyderabad
>
Smartncode
>
AI/GenAI Engineer

AI/GenAI Engineer

Name: Jobpe
Address: T-Hub, Plot No 1/C, Sy No 83/1, Raidurgam panmaktha, Knowledge City Rd, Hyderabad, Telangana, 500081, IN
Telephone: +91-83339-09630
Price range: Free

Smartncode

0 - 5 years

0 - 3 Lacs

hyderabad

Posted:2 weeks ago| Platform:

Apply

Skills Required

llm generative ai chatbot conversational ai python

Work Mode

Work from Office

Job Type

Full Time

Job Description

Position: AI/GenAI Engineer (LLM Integration Specialist)

About the Role

We're building a high-performance chat application an AI/GenAI Engineer to lead the integration and optimization of Large Language Models (LLMs). You'll be responsible for connecting LLM APIs, implementing domain-specific fine-tuning strategies, prompt engineering, and ensuring optimal performance for production use.

You'll work closely with our React and Python developers to create a seamless, intelligent chat experience that serves thousands of concurrent users.

Key Responsibilities

LLM Integration & Architecture

Integrate multiple LLM APIs (OpenAI, Anthropic Claude, Google Gemini, or open-source models)
Design and implement robust API wrapper services with retry logic, fallback mechanisms, and error handling
Implement streaming responses for real-time chat experience
Build rate limiting and quota management systems
Handle token counting, context window management, and cost optimization

Domain Customization & Fine-tuning

Develop domain-specific prompt engineering strategies
Implement RAG (Retrieval Augmented Generation) pipelines using vector databases
Fine-tune or adapt models for specific use cases using techniques like LoRA, prompt tuning
Create and maintain knowledge bases for domain-specific responses
Implement few-shot learning and in-context learning strategies

Performance & Optimization

Optimize API response times and reduce latency
Implement caching strategies for common queries
Monitor and optimize token usage to control costs
A/B test different models and prompts for quality improvements
Build fallback chains (primary/secondary model routing)

Safety & Quality

Implement content moderation and safety filters
Build guardrails to prevent prompt injection and jailbreaking
Develop evaluation frameworks to measure response quality
Monitor and handle hallucinations and inaccuracies
Implement user feedback loops for continuous improvement

Infrastructure

Design scalable architecture for handling concurrent LLM requests
Implement queue systems for managing high-volume API calls
Set up monitoring and logging for LLM interactions
Work with DevOps to deploy models (if self-hosted)

Required Skills & Experience

Must Have:

Experience working with LLMs and GenAI technologies
Strong experience with
OpenAI API, Anthropic Claude, or similar LLM APIs
Proficiency in
Python
(FastAPI, LangChain, LlamaIndex preferred)
Strong understanding of
prompt engineering
techniques and best practices
Experience with
vector databases
(Pinecone, Weaviate, Qdrant, ChromaDB)
Knowledge of
RAG (Retrieval Augmented Generation)
implementation
Understanding of
transformer architecture
and attention mechanisms
Experience with
API integration, webhooks, and streaming responses
Strong problem-solving skills and ability to debug complex AI systems
Experience with
LangChain, LlamaIndex, or similar LLM frameworks
Knowledge of
fine-tuning techniques
(LoRA, QLoRA, PEFT)
Experience with
embedding models
and semantic search
Familiarity with
HuggingFace Transformers
library
Experience deploying models using
vLLM, TGI (Text Generation Inference)
Knowledge of
function calling/tool use
with LLMs
Experience with
model evaluation metrics
(BLEU, ROUGE, BERTScore)
Understanding of
token economics
and cost optimisation
Experience with
open-source models
(Llama, Mistral, Falcon)
Knowledge of
model quantization
and optimization techniques
Experience with
multi-modal models
(vision, audio)
Familiarity with
MLOps practices
and experiment tracking (Weights & Biases, MLflow)
Experience with
AWS SageMaker, Google Vertex AI, or Azure ML
Understanding of
chain-of-thought prompting, ReAct, agents
Experience building
chatbots or conversational AI
systems
Publications or contributions to AI/ML community

Technical Stack You'll Work With

Languages:
Python (primary), JavaScript/TypeScript (basic understanding)
LLM APIs:
OpenAI, Anthropic, Google Gemini, Cohere
Frameworks:
LangChain, LlamaIndex, FastAPI
Vector DBs:
Pinecone, Weaviate, Qdrant, or ChromaDB
Infrastructure:
Docker, Redis, PostgreSQL, Message Queues
Cloud:
AWS/GCP/Azure (whatever your team uses)
Monitoring:
Prometheus, Grafana, custom LLM analytics

More Jobs at Smartncode

QA Automation Specialist (NetSuite Testing-Mandatory)

Pune

6 - 9 yrs

INR 16 - 30 Lacs

UI/UX Developer & Designer (Freelancing 2 -3 Months)

Hyderabad

3 - 5 yrs

INR 5 - 6 Lacs

Ui Ux Developer & Designer (2 -3 Months Contract)

Hyderabad

3 - 4 yrs

INR 3 - 5 Lacs

Oracle (MDM) Lead Developer

Bengaluru

6 - 9 yrs

INR 8 - 16 Lacs

Data Analyst

Bengaluru

6 - 10 yrs

INR 6 - 12 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Smartncode

Information Technology

Tech City

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

AI/GenAI Engineer

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

Position: AI/GenAI Engineer (LLM Integration Specialist)

About the Role

Key Responsibilities

LLM Integration & Architecture

Domain Customization & Fine-tuning

Performance & Optimization

Safety & Quality

Infrastructure

Required Skills & Experience

Must Have:

OpenAI API, Anthropic Claude, or similar LLM APIs

Python

prompt engineering

vector databases

RAG (Retrieval Augmented Generation)

transformer architecture

API integration, webhooks, and streaming responses

LangChain, LlamaIndex, or similar LLM frameworks

fine-tuning techniques

embedding models

HuggingFace Transformers

vLLM, TGI (Text Generation Inference)

function calling/tool use

model evaluation metrics

token economics

open-source models

model quantization

multi-modal models

MLOps practices

AWS SageMaker, Google Vertex AI, or Azure ML

chain-of-thought prompting, ReAct, agents

chatbots or conversational AI

Technical Stack You'll Work With

Languages:

LLM APIs:

Frameworks:

Vector DBs:

Infrastructure:

Cloud:

Monitoring:

More Jobs at Smartncode