Jobs
Interviews

29 Retrieval-Augmented Generation Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 8.0 years

19 - 20 Lacs

Chennai, Bengaluru

Work from Office

We are hiring AL/ML Engineers with 5–8 years of experience in AI/ML, Agentic AI, Gen AI, TensorFlow, RAG pipelines, and LLM flows. Required Candidate profile Experienced AI/ML Engineer with expertise in Agentic AI, Gen AI, TensorFlow, RAG pipelines, and LLM workflows. Seeking challenging lead-level opportunities. Ready to join within 30 days.

Posted 3 days ago

Apply

9.0 - 14.0 years

17 - 30 Lacs

Hyderabad

Work from Office

Role Details: Position: AI Lead Location: Hyderabad, Telangana, India Key Responsibilities: Spearhead the end-to-end development, deployment, and continuous optimization of cutting-edge AI/ML systemsGenerative AI, LLMs/MLLMs, RAG, autonomous agents, and custom MCP servers in production Architect and implement scalable Retrieval-Augmented Generation pipelines, multi-modal transformer systems, and autonomous multi-agent frameworks (LangChain, AutoGPT, BabyAGI, NVIDIA NeMo Agents) Lead deep-learning model design and delivery: ANN, CNN, RNN, LSTM/BiLSTM, attention-based Transformers (BERT, GPT, T5, Vision Transformers), GANs, VAEs, Diffusion models, and Mixture-of-Experts architectures Define and enforce best practices for prompt engineering, fine-tuning (LlamaIndex, Hugging Face), quantization (INT4, GGUF), pruning, distillation, and advanced inference optimizations Build and maintain robust MLOps pipelines: CI/CD (Git, Bitbucket, automated tests), model versioning (MLflow, Hugging Face Hub & LFS), Docker/GPU containerization, Kubernetes/OpenShift orchestration, Terraform infrastructure as code Design high-performance data engineering workflows: ETL optimization (Pandas, NumPy, Spark), real-time streaming (Kafka, RabbitMQ), and caching strategies (Redis) Ensure secure, compliant AI deployments by embedding Responsible AI principles (bias mitigation, transparency, privacy), governance frameworks, and regulatory controls. Monitor emerging research (self-supervised pre-training, federated learning, synthetic data, LLMOps) and integrate top innovations into production Collaborate closely with Product, DevOps, Data Engineering, and Security teams; mentor and guide engineers and data scientists in software engineering and model validation best practices Required Qualifications: Education: Masters or PhD in AI/ML, Computer Science (AI/ML specialization), Data Science, Mathematics, or related field Industry Experience: 15+ years in AI/ML roles, including 710 years hands-on in Generative AI and Deep Learning, plus 5+ years in senior or leadership positions Hands-On Coding & Deployment: Expert at architecting, writing, debugging, deploying, and optimizing production-grade AI/ML code in Python (primary), C++, R, Java/Spring Boot, with strong testing, code-review, and performance-profiling discipline Deep Learning & Generative AI Frameworks: Advanced usage of PyTorch, TensorFlow, ONNX, GGUF, Hugging Face Transformers & Diffusers; fine-tuning pipelines (LlamaIndex, custom scripts) Large Language & Multimodal Models: Production deployment of LLMs, Multimodal LLMs (vision-language models, cross-modal understanding), RAG systems, and custom MCP A2A servers; embedding management & vector search with Pinecone, Milvus, Chroma, Quadrant Advanced Model Optimization: Quantization, pruning, distillation, MoE routing, efficient inference strategies, prompt-engineering platforms (LangSmith, PromptFlow) Classical & Advanced ML: Proficiency in regression, decision trees, random forests, SVM; boosting (XGBoost, CatBoost, LightGBM); clustering (K-Means, DBSCAN, hierarchical); RL (Q-Learning, DDPG, PPO); statistical methods and optimization math Computer Vision & Document Processing: Expertise in OpenCV, PyMuPDF, python-docx/pptx, pytesseract for advanced text/image extraction and analysis Cloud & Infrastructure: GPU acceleration (CUDA, cuDNN); AWS (Bedrock, SageMaker), Azure AI Studio & GPU Containers, GCP (Vertex AI, Cloud Run), serverless AI, NVIDIA Triton Inference Server Backend & APIs: FastAPI, OpenAPI, Django REST Framework, Flask for building and scaling AI microservices Data Engineering & Storage: ETL pipelines, streaming, caching; relational (MySQL, Oracle SQL), NoSQL (MongoDB), vector DBs MLOps & Observability: MLflow, Kubeflow; Docker, Kubernetes, OpenShift; CI/CD (Git, Bitbucket); monitoring with Prometheus, Grafana, OpenTelemetry, Logstash, Kibana Security & Compliance: Secure coding, network security, and adherence to GDPR, HIPAA, GxP standards Industry Certifications & Research: Must hold at least one recognized AI/ML certification (e.g. AWS Certified Machine Learning – Specialty, Google Professional Machine Learning Engineer, Microsoft Certified: Azure AI Engineer Associate) and have published research papers in top-tier AI/ML conferences or journals (NeurIPS, ACM, arXiv, AAAI, JMLR). Good to Have Skills: PhD-level publications and advanced certifications (Stanford AI, MIT, AWS, Google) Experience with federated learning, synthetic data generation, privacy-preserving ML (differential privacy, homomorphic encryption), and edge/in-device inference (TensorFlow Lite, CoreML) Familiarity with Responsible AI toolkits, model cards/DataCard’s, and AI risk management frameworks Significant open-source contributions to major AI/ML projects Proven success integrating AI in regulated industries (Texting/Messaging, pharma, healthcare, finance) Industries and disciplines. How to Apply: If you are a passionate developer who loves coding and thrives in an innovative startup environment, apply now to join our journey in revolutionizing communication! Apply here or email to join@beetexting.com

Posted 1 week ago

Apply

5.0 - 8.0 years

15 - 18 Lacs

Mumbai Suburban, Navi Mumbai, Mumbai (All Areas)

Hybrid

Role & responsibilities Experimenting with new Generative AI models Developing proof of concepts and pilots for various Generative AI and Agentic AI use cases Collaborating with Solution Architects and Enterprise Architects Evangelizing Generative AI and AI adoption within the bank Creating and presenting demos to promote reference architectures Experience: 5+ years of relevant experience Skills: 1. Programming Languages: a. Python 2. Machine Learning Frameworks: a. PyTorch b. TensorFlow 3. Experience with Generative AI Models: a. GANs (Generative Adversarial Networks) b. VAEs (Variational Autoencoders) c. Transformers 4. Cloud Platforms: a. AWS (Amazon Web Services) b. GCP (Google Cloud Platform) c. Azure 5. Data Handling: a. Feature stores b. Vector stores c. Retrieval-Augmented Generation (RAG) d. Data visualization 6. Big Data Technologies: a. Spark 7. Workflow Orchestration: a. Airflow

Posted 1 week ago

Apply

4.0 - 9.0 years

25 - 35 Lacs

Bengaluru

Remote

AI/ML Development Leadership: Lead the implementation of machine learning models and automation pipelines for CPT/ICD code prediction and claims processing. Develop and optimize retrieval-augmented generation (RAG) workflows using LLMs, vector databases (e.g., FAISS), and custom prompts. Direct the design of structured training datasets derived from SOAP notes, payer files, and denial records. Team & Project Management: Manage day-to-day activities of India-based engineers and coding specialists. Coordinate closely with U.S.-based consultants to ensure AI solutions align with reimbursement policy and documentation standards. Track project milestones, guide model improvements, and ensure output quality. Technical Execution: Build, fine-tune, and deploy models using PyTorch, TensorFlow, HuggingFace Transformers , and scikit-learn . Integrate LLM APIs for code summarization and document understanding. Implement vector search and orchestration platforms for real-time AI assistance. Role & responsibilities Preferred candidate profile

Posted 1 week ago

Apply

3.0 - 6.0 years

5 - 10 Lacs

Chennai

Work from Office

Job Title: AI Engineer Location: Chennai Experience: 2-4 years Employment Type: Full-Time Job Summary: We are seeking an AI Engineer with strong proficiency in Python and hands-on experience in building AI-powered applications. The ideal candidate should have experience working with FastAPI, Langchain, and Pydantic, along with a solid understanding of Generative AI concepts, Large Language Models (LLMs), Prompt Engineering, and Retrieval-Augmented Generation (RAG). Key Responsibilities: Develop, integrate, and optimize AI-powered applications using FastAPI and Langchain. Design and deploy APIs using FastAPI with Pydantic for data validation. Implement and fine-tune solutions leveraging Large Language Models (LLMs). Build and optimize Prompt Engineering pipelines for LLM interactions. Design and implement Retrieval-Augmented Generation (RAG) solutions for enhanced contextual responses. Collaborate with data scientists, product managers, and engineers to deliver high-quality AI applications. Continuously research and experiment with advancements in Generative AI technologies. Key Skills & Technologies: Programming: Python (Intermediate to Advanced) Frameworks & Libraries: FastAPI (API development) Langchain (LLM integrations & workflows) Pydantic (Data validation and settings management) AI/ML Concepts: Generative AI (Basics) Large Language Models (LLM) Understanding architecture, capabilities, and limitations Prompt Engineering – Techniques for effective LLM interactions Retrieval-Augmented Generation (RAG) – Building hybrid retrieval-generation systems Version Control: Git, GitHub/GitLab CI/CD: Familiarity with deployment pipelines (optional) Preferred/Optional (Good to Have): Experience with AI Agent Frameworks such as: LangGraph CrewAI Knowledge of Vector Databases (e.g., FAISS, Chroma) Cloud Platforms (AWS, Azure, GCP) Containerization (Docker)

Posted 2 weeks ago

Apply

5.0 - 7.0 years

12 - 15 Lacs

Kolkata

Remote

Design and fine-tune AI apps using GPT, BERT, Azure OpenAI, and Cognitive Services. Build chatbots with Copilot Studio, RAG pipelines, vector DBs, Graph API, and M365 Plugins. Code in Python/Node.js with REST API integration. Required Candidate profile 5+ years of experience in AI/ML or Conversational AI Strong grip on prompt engineering and model fine-tuning Deep understanding of Microsoft’s AI ecosystem

Posted 2 weeks ago

Apply

7.0 - 9.0 years

21 - 24 Lacs

Bengaluru

Work from Office

Responsibilities: * Lead ML projects from ideation to deployment using Microsoft tools. * Collaborate with cross-functional teams on project requirements and deliverables. Work from home

Posted 2 weeks ago

Apply

4.0 - 6.0 years

5 - 11 Lacs

Bengaluru

Work from Office

Role: Artificial Intelligence Engineer Location Bangalore (Ashok Nagar) Experience : 3 to 5 years Education : Bachelor's/Masters Degree in Technology. Salary : Negotiable Job Type : Full Time (On Role) Mode of Work : Work from Office Job Description We are looking for an experienced AI Engineer to join our team and develop advanced AI solutions. The role involves designing, training, and deploying machine learning models to address complex business challenges. Proficiency in Python, TensorFlow, and experience working with large datasets are critical for success. Preferred Skills: Proficiency in Python and NLP libraries such as OpenAIs GPT, LangChain, etc. Knowledge of Large Language Models (LLMs) and fine-tuning techniques Retrieval-Augmented Generation (RAG) for improving AI response Experience in API development and hosting (IIS preferred) Basic understanding of SQL Expertise in training, tuning, deploying, and leveraging models with SQL databases Technical Skills: Proficiency in programming languages such as Python, R, or Java Ability to integrate AI models into existing systems and applications Strong knowledge of machine learning algorithms (e.g., regression, classification, clustering) Experience with deep learning frameworks such as TensorFlow and PyTorch Expertise in data manipulation and cleaning techniques Familiarity with cloud platforms (AWS, Azure, GCP) for AI deployment Interested candidates kindly share your CV and below details to usha.sundar@adecco.com 1) Present CTC (Fixed + VP) - 2) Expected CTC - 3) No. of years experience - 4) Notice Period - 5) Offer-in hand - 6) Reason of Change - 7) Present Location -

Posted 2 weeks ago

Apply

3.0 - 8.0 years

10 - 20 Lacs

Chennai

Work from Office

We are seeking a Software Engineer with expertise in AI/ML and Full stack development to contribute to the development of core platform features. This role involves designing, developing, and optimizing high-performance AI-driven applications, building scalable microservices, and ensuring seamless integration across AI, backend, and frontend systems. You will play a key role in developing workflow automation modules, AI-powered search engines, and scalable enterprise solutions. Skills: P1 (must to have Skills) Generative AI Expertise/ AI Agents, Agentic Workflows/ RAG/ Prompt Engineering / Advanced Python Programming / LangChain / LangGraph / Vector Databases. P2 (need to have Skills ) FastAPI, NodeJS / Intuitive UI/UX Design: Either React or Typescript or Next.js. P3 (nice to have Skills) LlamaIndex / Integration of SLM/LLMs. Required Skills Generative AI Expertise : Advanced knowledge in AI Agents, Agentic Workflows, Retrieval-Augmented Generation (RAG), and Prompt Engineering. Advanced Python Programming : Proficiency in Python, particularly for AI-driven applications, with experience in frameworks like LangChain and LangGraph. Vector Databases : Strong skills in managing and utilizing vector databases for AI solutions. FastAPI & NodeJS : Experience in building backend services using FastAPI or NodeJS. UI/UX Design : Ability to design and implement intuitive user interfaces, with beginner to intermediate proficiency in React. LlamaIndex & SLM/LLM Integration : Familiarity with LlamaIndex and expertise in integrating System Language Models/Large Language Models into applications. Preferred Education and Experience: Bachelors/masters degree in computer science, AI, Machine Learning, or related field. At least 3 years in full-stack development with a focus on AI. Proven track record leading small engineering teams and delivering complex AI-driven products. Direct client-facing experience, from requirements gathering to final delivery.

Posted 2 weeks ago

Apply

3.0 - 4.0 years

8 - 12 Lacs

Hyderabad

Remote

Python AI Developer (Work From Home ) Experience Required: 3 to 4 Years Job Type: Full-Time Department: Artificial Intelligence / Machine Learning Reports To: CEO Programming Languages : Python, JavaScript Key Responsibilities: Design and implement scalable AI applications using Python, with a strong focus on LLM-based architectures. Develop and fine-tune LLMs using frameworks such as Hugging Face Transformers, LangChain, or OpenAI APIs. Build and deploy Retrieval-Augmented Generation (RAG) systems combining LLMs with vector databases and search tools. Integrate LLMs into products and workflows via RESTful APIs and/or microservices. Work with unstructured and semi-structured data (e.g., text, PDFs, web content) to build NLP pipelines. Optimize model inference performance and reduce latency for real-time applications. Collaborate with data scientists, backend engineers, and product managers to deliver high-impact solutions. Stay up to date with the latest advancements in LLMs, RAG systems, and AI tooling. Implemented Retrieval-Augmented Generation (RAG) to enable dynamic and context-aware responses in the conversational AI system. Designed and developed Conversational AI chat functionality, leveraging Redis for session management and real-time performance. Built and optimize RESTful APIs, ensuring efficient communication between backend services and the frontend. Collaborate closely with the frontend team to integrate APIs and ensure seamless user experience. Actively participate in bug fixing, performance optimization, and code reviews to maintain code quality and application stability. Provid support for other projects within the organization, assisting with debugging, enhancements, and knowledge sharing. Required Skills & Qualifications: Bachelors or Masters in Computer Science, Artificial Intelligence, Data Science, or a related field with 3–4 years of experience with Python, focused on AI/ML development. Strong experience with LLMs (e.g., GPT, LLaMA, Claude, Mistral) and prompt engineering. Hands-on experience with LangChain, Haystack, Hugging Face, or similar libraries for building RAG-based systems. Familiarity with vector databases like FAISS, Pinecone, Weaviate, or Qdrant. Proficiency in NLP concepts such as tokenization, embeddings, named entity recognition, and summarization. Experience with data processing libraries (Pandas, NumPy) and ML frameworks (PyTorch, TensorFlow, scikit-learn). Strong understanding of RESTful API development using FastAPI, Flask, or Django. Familiar with Git, Docker, and model deployment best practices. Preferred Qualifications: Experience fine-tuning LLMs or using adapters/LoRA for domain-specific tasks. Exposure to cloud services (AWS Sagemaker, Azure ML, or GCP Vertex AI). Knowledge of MLOps and model lifecycle management tools. Familiarity with search/retrieval tools (Elasticsearch, OpenSearch). Understanding of document ingestion pipelines and OCR/NLP integration. Benefits: Competitive compensation package Remote work flexibility

Posted 3 weeks ago

Apply

3.0 - 5.0 years

4 - 9 Lacs

Jaipur

Work from Office

Job Title: AI Engineer Experience: 3 - 5 Years Location: Jagatpura, Jaipur Type: Full-Time Kindly share your CV at shubhanshu.mishra@dotsquares.com Job Summary: We are seeking a highly skilled AI Engineer (Senior Python Developer) with 3- 5 years of experience, including at least 3+ years of hands-on expertise in LLM implementation , Retrieval-Augmented Generation (RAG) architectures, and multi-agent systems . This role is ideal for someone who enjoys working across the stack from data engineering and scalable backend APIs to cutting-edge Generative AI and MLOps workflows. You will take full ownership of building production-grade AI applications and intelligent systems, collaborating cross-functionally with stakeholders and product teams. Key Responsibilities: Design and develop AI applications using Large Language Models (LLMs) , RAG , and multi-agent architectures . Build high-performance APIs using FastAPI , including scalable, secure, and well-documented endpoints. Lead development of Generative AI-based conversational agents (text and voice) with support for low-latency communication and WebSocket integrations. Architect end-to-end solutions integrating vector databases (FAISS, pgvector, Chroma, Pinecone) for semantic and embedding-based search. Develop efficient ETL and data pipelines using Pandas, PySpark, and SQL. Apply and optimize caching techniques (Redis, memory-based) and background task queues using Celery . Ensure robust MLOps workflows : CI/CD for ML, model versioning, monitoring, retraining, deployment, and registry. Perform database integration with relational and NoSQL systems ( MySQL, PostgreSQL, MongoDB ). Write optimized and complex SQL queries for reporting, analytics, and dynamic retrieval. Deploy scalable services and ML models in cloud environments such as AWS , Azure , or GCP . Communicate effectively with technical and non-technical stakeholders; gather requirements and translate them into actionable solutions. Must-Have Skills: Python (Expert) with strong fundamentals and extensive experience in Pandas and PySpark . Hands-on experience with LLMs , RAG , multi-agent systems , and GenAI applications. Strong knowledge of FastAPI for API development. Advanced proficiency in SQL capable of writing efficient, optimized, and analytical queries. Experience with vector databases : FAISS, pgvector, Chroma, Pinecone. Background task orchestration using Celery . Redis and memory-based caching for performance. Strong system design and problem-solving skills. Experience with MLOps practices and tools (model registry, CI/CD, retraining, serving, monitoring). Hands-on with MySQL , PostgreSQL , and NoSQL databases (e.g., MongoDB). Strong verbal and written communication; ability to interface directly with clients and stakeholders. Experience deploying in cloud environments AWS, Azure, or GCP. Good to Have: Experience with WebSocket for real-time, low-latency systems. Conversational AI involving voice + text interfaces . Fine-tuning or retraining open-source LLMs (e.g., LLaMA , Falcon , Mistral ). Familiarity with MCP architecture (Model-Context-Prompt) for GenAI systems. Exposure to Docker , Kubernetes , and scalable ML deployment patterns. Experience with ETL orchestration tools like Airflow, Prefect (if applicable). Qualifications: Bachelors or Masters degree in Computer Science , Artificial Intelligence , Data Science , or a related field. 3- 5 years of experience in backend development and AI systems with at least 3 years in GenAI / LLM-based projects . Proven ownership of production-grade AI solutions and real-world deployment.

Posted 3 weeks ago

Apply

5.0 - 10.0 years

5 - 10 Lacs

Hyderabad, Chennai, Bengaluru

Work from Office

Position : Solution Architect Location : PAN India (Onsite) Position Type : Permanent Job Description: 5-12 years experience in AI and machine learning, with a strong focus on solution architecture. Should have hands-on experience in deploying at least one end to end GenAI project. Proven experience with Azure OpenAI LLMs and its application in enterprise environments. Hands-on experience with Retrieval-Augmented Generation (RAG) systems. Expertise in Azure cloud services and architecture. Strong programming skills in Python and familiarity with AI development tools and libraries. Knowledge of AI model training, fine-tuning, and deployment processes. Excellent problem-solving and analytical skills. Strong leadership and project management abilities. Effective communication and interpersonal skills.

Posted 3 weeks ago

Apply

6.0 - 11.0 years

25 - 30 Lacs

Chennai

Work from Office

Seeking a GenAI Software Engineer with 5+ years’ experience to develop LLM-based applications, RAG, and AI agents. Must be skilled in LangChain, vector databases, and cloud-based AI deployment in agile teams. Required Candidate profile Experienced AI developer with expertise in GenAI, LLM apps, and RAG. Proficient in LangChain, vector DBs, and cloud AI services. Strong collaboration and agile development background preferred.

Posted 4 weeks ago

Apply

7.0 - 10.0 years

15 - 30 Lacs

Hyderabad

Work from Office

Job Title: Senior AI/ML Engineer Custom LLM & RAG Implementation Location: On-site Experience: 7+ years Industry: Cross-Domain (Finance, Healthcare, Logistics, Retail, LegalTech, etc.) About the Role We are seeking a highly skilled and hands-on Senior AI/ML Engineer with strong expertise in building and deploying custom Large Language Models (LLMs) . The ideal candidate will have demonstrable experience in Retrieval-Augmented Generation (RAG) implementations, fine-tuning foundation models , and solving complex problems across domains using applied machine learning and natural language processing. This role is strategic and technical, requiring a blend of research, solution engineering, MLOps maturity, and domain adaptability. Key Responsibilities LLM Development & Deployment Design, build, and deploy customized LLM pipelines tailored to enterprise use cases. Implement end-to-end LLMOps workflows including model packaging, CI/CD, and monitoring. RAG & Fine-Tuning Implement RAG pipelines using vector databases (e.g., FAISS, Pinecone, Weaviate) and document ingestion frameworks (e.g., LangChain, Haystack). Fine-tune open-source LLMs (e.g., LLaMA, Falcon, Mistral, MPT) on proprietary datasets using frameworks like Hugging Face Transformers and PEFT/LoRA. Solution Engineering Translate business problems into ML/LLM solutions with clear problem framing and data strategy . Collaborate with product, data engineering, and domain teams to prototype and deliver scalable solutions. Cross-Functional AI Applications Apply ML/LLM solutions across multiple verticals such as legal document analysis, customer support automation, compliance, supply chain optimization, or medical NLP. Build domain-agnostic prompt engineering strategies and apply zero-shot/few-shot learning where appropriate. Leadership & Mentorship Mentor junior engineers and contribute to AI/ML best practices . Act as a thought partner in innovation and experimentation within the team and with external stakeholders. Required Skills & Qualifications Bachelors or Master’s in Computer Science, AI/ML, Data Science, or related field. 5+ years of hands-on experience in ML/NLP , with a recent focus on LLMs and foundation models. Deep knowledge of Hugging Face ecosystem , PyTorch/TensorFlow, LangChain, OpenAI APIs, and popular model libraries. Experience deploying ML models to production using FastAPI, Docker, Kubernetes , or cloud-native tools (AWS/GCP/Azure). Familiarity with vector databases , embeddings, and search frameworks. Strong understanding of model evaluation metrics , bias/fairness in ML, and responsible AI practices. Ability to work cross-functionally with business, legal, and engineering teams. Preferred Qualifications Experience with RLHF (Reinforcement Learning from Human Feedback) . Published work in open-source communities or AI research conferences. Familiarity with multi-modal AI , autoML , or agentic workflows is a plus. Prior work in regulated domains (e.g., finance, healthcare, legal). Why Join Us Work on cutting-edge AI/LLM use cases that span industries and functions. Lead mission-critical AI initiatives from ideation to deployment. Be part of a collaborative, innovation-driven team shaping the next generation of enterprise AI solutions.

Posted 1 month ago

Apply

5.0 - 10.0 years

18 - 20 Lacs

Noida

Work from Office

Key Responsibilities Design and develop AI solutions that address real-world business challenges, ensuring alignment with strategic objectives and measurable outcomes. Work with large-scale structured and unstructured datasets, leveraging modern data frameworks, tools, and platforms. Establish and maintain robust standards for data security, privacy, and regulatory compliance across all AI and data workflows. Collaborate closely with cross-functional teams to gather requirements, share insights, and deliver high-impact solutions. Monitor and maintain production AI systems to ensure continued accuracy, scalability, and reliability over time. Stay up to date with the latest advancements in AI, machine learning, and data engineering, and apply them where relevant. Write clean, well-documented, and maintainable code, and actively contribute to team best practices and technical documentation. Required Skills & Qualifications Bachelors or Masters degree in Computer Science, Data Science, or a related field Strong programming skills in Python (preferred) and experience with AI/ML libraries such as TensorFlow, PyTorch, scikit-learn, or Hugging Face Experience designing and deploying machine learning models and AI systems in production environments Familiarity with modern data platforms and cloud services (e.g., Azure, AWS, GCP), including AutoML and MLflow Proficiency with data processing tools and frameworks (e.g., Spark, Pandas, SQL) and working with both structured and unstructured data Experience with Generative AI technologies, including prompt engineering, vector databases, and RAG (Retrieval- Augmented Generation) pipelines Solid understanding of data security, privacy, and compliance principles, with experiencng implementing these in real-world projects Strong problem-solving skills and ability to translate complex business problems into technical solutions Excellent communication and collaboration skills, with the ability to work effectively across technical and non- technical teams Experience with version control (e.g., Git) and agile development practices Enthusiasm for learning and applying emerging technologies in AI and machine learning

Posted 1 month ago

Apply

1.0 - 3.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Job Overview Huemn is seeking a talented Generative AI Engineer to join our dynamic team in Hyderabad. This full-time, junior-level position requires a professional with 1 to 3 years of experience in the field. As a Generative AI Engineer, you will be instrumental in advancing our AI capabilities, especially in leveraging natural language processing and large language models to revolutionize studio management for photographers around the world. Qualifications and Skills Proficiency in Python and experience with PyTorch for building and deploying machine learning models. Knowledge of deep learning techniques and their application in real-world scenarios to solve complex problems. Hands-on experience with TensorFlow for implementing machine intelligence in scalable systems. In-depth understanding of machine learning algorithms and neural networks for developing intelligent applications. Expertise in natural language processing (Mandatory skill) for effective communication and interaction with systems. Strong proficiency in working with large language models (LLM) (Mandatory skill) to enhance cognitive computing capabilities. Experience in retrieval-augmented generation (RAG) (Mandatory skill) for innovative AI-driven solutions. Ability to collaborate with cross-functional teams and contribute to innovative projects in a fast-paced environment. Roles and Responsibilities Develop and optimize natural language processing applications to enhance our studio management platform. Implement and maintain large language models to advance Huemn's AI-driven offerings. Utilize retrieval-augmented generation techniques to innovate new AI functionalities. Collaborate with team members to integrate AI solutions into existing systems and workflows. Conduct research and stay updated on the latest advancements in generative AI technologies. Ensure the scalability and efficiency of AI models in production environments. Participate in code reviews, feedback sessions, and contribute to engineering best practices. Document processes, algorithms, and code for future reference and knowledge sharing.

Posted 1 month ago

Apply

2.0 - 4.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Req ID: 312439 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Mid Data Science & AIML, GenAI Lead/Engineer to join our team in Bangalore, Karn?taka (IN-KA), India (IN). Job Duties: Job Title: Data Science & AIML, GenAI Lead/Engineer Responsibilities: Conduct data analysis using Python. Develop AI models leveraging Langchain, LlamaIndex, and Azure OpenAI Services. Utilize Generative AI models and Natural Language Processing techniques. Implement Retrieval-Augmented Generation (RAG) solutions. Apply machine learning algorithms, data pre-processing, and manage AI lifecycle (model training, validation, monitoring). Minimum Skills Required: Qualifications: 2+ years of experience with AI Python development Bachelor's degree in computer science, Information Technology, or related field. Strong analytical skills and attention to detail. Ability to work independently or as part of a team. #GenAINTT About NTT DATA NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at NTT DATA endeavors to make accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact us at . This contact information is for accommodation requests only and cannot be used to inquire about the status of applications. NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For our EEO Policy Statement, please click . If you'd like more information on your EEO rights under the law, please click . For Pay Transparency information, please click.

Posted 1 month ago

Apply

2.0 - 7.0 years

8 - 18 Lacs

Hyderabad

Work from Office

Location: Hyderabad, India | Employment Type: Full-Time Experience Level: 2+Years Company: Covasant Contact Person: Ranjith Reddy 9703455109 | ranjith.palle@covasant.cm | linkedin.com/in/ranjith-r-75a766227 Build the Future of AI with Covasant At Covasant , we don't just work with AI we engineer the next era of it. We're hiring mid-level to senior developers and AI leads to help us build next-generation agentic AI systems that are intelligent, collaborative, and scalable. This is your chance to go beyond prompt engineering and shape the architecture of autonomous, multi-agent AI solutions using tools like LangGraph, AutoGen, CrewAI , and more. If youve got the skills and curiosity to work on what the AI world will be talking about next year , we want to hear from you. Your Role Design and develop multi-agent LLM systems using LangGraph, AutoGen, or CrewAI. Build and deploy MCP servers , LLM gateways , and design Agent-to-Agent collaboration flows. Fine-tune language models for verticals like healthcare, manufacturing, or finance. Architect retrieval-augmented generation (RAG) systems with vector stores like FAISS, Pinecone, or Weaviate. Integrate tools like LangSmith , GuardrailsAI , and knowledge graphs to ensure trust, safety, and observability. Collaborate cross-functionally with product, data science, and engineering teams. What You Bring 2+ years in software development, with strong Python skills. Proven expertise in one or more: LangGraph , AutoGen , CrewAI . Deep understanding of Agent-based AI , LLM orchestration , and RAG pipelines . Experience fine-tuning LLMs and applying prompt engineering and domain adaptation . Familiarity with tools like LangSmith , PromptGuard , or Guardrails frameworks. Bonus If You Have Cloud experience (AWS, Azure, GCP) Familiarity with Docker, Kubernetes Exposure to multi-modal models (LLaMA, Mistral, Falcon) Frontend tech: React, Angular, or Vue CI/CD, MLOps, or LLMOps knowledge Important We’re currently hiring mid-level, senior, and lead professionals with hands-on experience in AI/ML projects . These openings are not for freshers or professionals with less than 2 years of experience — but we are planning something exciting for early-career AI talent soon! Why Join Covasant Work on real-world agentic AI systems ahead of industry trends Collaborative and innovation-first work culture Competitive pay, benefits & performance incentives Hybrid/flexible work setup A chance to lead and influence the next chapter in AI Let’s Connect If this excites you — whether or not you're actively job hunting — don’t miss the chance to explore this game-changing opportunity. Ranjith Reddy – 9703455109 ranjith.palle@covasant.cm Connect with me on LinkedIn – I’d love to stay in touch, even if this isn’t the right time. Apply now or just start a conversation. The future of AI doesn’t wait — and neither should you.

Posted 1 month ago

Apply

2.0 - 4.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Req ID: 312429 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Mid Data Science & AIML, GenAI Lead/Engineer to join our team in Bangalore, Karn?taka (IN-KA), India (IN). Job Duties: Job Title: Data Science & AIML, GenAI Lead/Engineer Responsibilities: Conduct data analysis using Python. Develop AI models leveraging Langchain, LlamaIndex, and Azure OpenAI Services. Utilize Generative AI models and Natural Language Processing techniques. Implement Retrieval-Augmented Generation (RAG) solutions. Apply machine learning algorithms, data pre-processing, and manage AI lifecycle (model training, validation, monitoring). Minimum Skills Required: Qualifications: 2+ years of experience with AI Python development Bachelor's degree in computer science, Information Technology, or related field. Strong analytical skills and attention to detail. Ability to work independently or as part of a team. #GenAINTT About NTT DATA NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at NTT DATA endeavors to make accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact us at . This contact information is for accommodation requests only and cannot be used to inquire about the status of applications. NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For our EEO Policy Statement, please click . If you'd like more information on your EEO rights under the law, please click . For Pay Transparency information, please click.

Posted 1 month ago

Apply

10.0 - 20.0 years

25 - 40 Lacs

Noida

Work from Office

About DigiLantern DigiLantern is a forward-thinking technology services company focused on delivering AI-powered digital transformation solutions . Our services include custom web and mobile application development , Salesforce and CRM integrations , AI agents and automation , SaaS product development , and e-commerce solutions . We help clients accelerate growth, enhance customer engagement, and optimize operations through intelligent and scalable technology. Role Overview We are seeking a highly skilled Technical Architect to lead solution architecture and system design across diverse digital and AI-first projects. You will be responsible for building scalable platforms with a strong focus on Agent AI frameworks , Retrieval-Augmented Generation (RAG) pipelines, and modern cloud-native infrastructure. This is a key leadership role that blends hands-on technical design with strategic planning and client engagement. Key Responsibilities Architect robust, scalable, and secure cloud-based solutions for enterprise-grade applications Design and integrate Agent AI workflows using frameworks like LangChain, AutoGen, or Semantic Kernel Build RAG pipelines for knowledge retrieval from vector databases (e.g., FAISS, Pinecone, Weaviate) Collaborate with product managers, developers, and clients to translate requirements into technical blueprints Lead architecture reviews, code quality standards, and technical governance across projects Guide engineering teams on architectural best practices, DevOps, and performance optimization Create technical documentation, architecture diagrams, and project-level design assets Stay current with advancements in AI/ML, cloud infrastructure, and modern development stacks Required Qualifications 10+ years of experience in software development and system architecture Strong backend expertise in Python, Java, Node.js and relevant frameworks (Django, Spring Boot, Express.js) Hands-on experience with cloud platforms : AWS (preferred), GCP, Azure Proven experience with containerization and orchestration : Docker, Kubernetes Deep understanding of Agent AI architectures and LLM orchestration (LangChain, AutoGen, CrewAI) Experience building or integrating RAG-based solutions using vector databases and embedding models Strong knowledge of APIs (REST, GraphQL), security protocols (OAuth2, JWT), and DevSecOps principles Excellent communication and problem-solving skills Bachelor's or Masters degree in Computer Science, Engineering, or a related discipline Preferred Experience Designing and deploying AI agents or automation tools with real-time decision-making capabilities Working with OpenAI APIs, Hugging Face, or custom LLMs Experience in regulated industries such as healthcare, finance, or nonprofit Collaborating with global, cross-functional engineering teams in agile environments Exposure to compliance standards such as HIPAA, GDPR Technology Stack AI & Data Agent AI: LangChain, AutoGen, CrewAI, Semantic Kernel, OpenAI APIs RAG: FAISS, Pinecone, Weaviate, ChromaDB, Haystack ML Tools: TensorFlow, PyTorch, Scikit-learn Databases: PostgreSQL, MongoDB, Redis, MySQL Analytics: Google Analytics, Power BI, Tableau Backend Python, Java, Node.js, PHP Django, Flask, Spring Boot, Express.js Frontend JavaScript / TypeScript React, Angular, Vue.js Material-UI, TailwindCSS Cloud & DevOps AWS, GCP, Azure Jenkins, GitHub Actions, Terraform, Ansible CI/CD with GitLab, CircleCI Security & Integration OAuth2, JWT, SAML SSL/TLS, AES encryption API tools: Swagger, Postman, MuleSoft Why Join DigiLantern? Architect real-world, high-impact solutions with cutting-edge AI technologies Be a key technical leader in a fast-growing digital innovation firm Opportunity to shape future-ready platforms across industries Competitive compensation, collaborative culture, and high ownership Thanks & Regards Pankhuri Agarwal Assistant Manager-HR pankhuri.agarwal@digilantern.co.in 9821486056 www.digilantern.com

Posted 1 month ago

Apply

0.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

________________________________________ Ready to shape the future of work At Genpact, we don&rsquot just adapt to change&mdashwe drive it. AI and digital innovation are redefining industries, and we&rsquore leading the charge. Genpact&rsquos AI Gigafactory, our industry-first accelerator, is an example of how we&rsquore scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to agentic AI, our breakthrough solutions tackle companies most complex challenges. If you thrive in a fast-moving, tech-driven environment, love solving real-world problems, and want to be part of a team that&rsquos shaping the future, this is your moment. Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge, operational excellence, and cutting-edge solutions - we help companies across industries get ahead and stay ahead. Powered by curiosity, courage, and innovation, our teams implement data, technology, and AI to create tomorrow, today. Get to know us at genpact.com and on LinkedIn, X, YouTube, and Facebook. Inviting applications for the role of Senior Principal Consultant- Senior Data Engineer - Snowflake, AWS, Cortex AI & Horizon Catalog Role Summary: We are seeking an experienced Senior Data Engineer with deep expertise in modernizing Data & Analytics platforms on Snowflake, leveraging AWS services, Cortex AI, and Horizon Catalog for high-performance, AI-driven data management. The role involves designing scalable data architectures, integrating AI-powered automation, and optimizing data governance, lineage, and analytics frameworks. Key Responsibilities: . Architect & modernize enterprise Data & Analytics platforms on Snowflake, utilizing AWS, Cortex AI, and Horizon Catalog. . Design and optimize Snowflake-based Lakehouse architectures, integrating AWS services (S3, Redshift, Glue, Lambda, EMR, etc.). . Leverage Cortex AI for AI-driven data automation, predictive analytics, and workflow orchestration. . Implement Horizon Catalog for enhanced data lineage, governance, metadata management, and security. . Develop high-performance ETL/ELT pipelines, integrating Snowflake with AWS and AI-powered automation frameworks. . Utilize Snowflake&rsquos native capabilities like Snowpark, Streams, Tasks, and Dynamic Tables for real-time data processing. . Establish data quality automation, lineage tracking, and AI-enhanced data governance strategies. . Collaborate with data scientists, ML engineers, and business stakeholders to drive AI-led data initiatives. . Continuously evaluate emerging AI and cloud-based data engineering technologies to improve efficiency and innovation. Qualifications we seek in you! Minimum Qualifications . experience in Data Engineering, AI-powered automation, and cloud-based analytics. . Expertise in Snowflake (Warehousing, Snowpark, Streams, Tasks, Dynamic Tables). . Strong experience with AWS services (S3, Redshift, Glue, Lambda, EMR). . Deep understanding of Cortex AI for AI-driven data engineering automation. . Proficiency in Horizon Catalog for metadata management, lineage tracking, and data governance. . Advanced knowledge of SQL, Python, and Scala for large-scale data processing. . Experience in modernizing Data & Analytics platforms and migrating on-premises solutions to Snowflake. . Strong expertise in Data Quality, AI-driven Observability, and ModelOps for data workflows. . Familiarity with Vector Databases & Retrieval-Augmented Generation (RAG) architectures for AI-powered analytics. . Excellent leadership, problem-solving, and stakeholder collaboration skills. Preferred Skills: . Experience with Knowledge Graphs (Neo4J, TigerGraph) for structured enterprise data systems. . Exposure to Kubernetes, Terraform, and CI/CD pipelines for scalable cloud deployments. . Background in streaming technologies (Kafka, Kinesis, AWS MSK, Snowflake Snowpipe). Why Join Us . Lead Data & AI platform modernization initiatives using Snowflake, AWS, Cortex AI, and Horizon Catalog. . Work on cutting-edge AI-driven automation for cloud-native data architectures. . Competitive salary, career progression, and an opportunity to shape next-gen AI-powered data solutions. ________________________________________Why join Genpact . Be a transformation leader - Work at the cutting edge of AI, automation, and digital innovation . Make an impact - Drive change for global enterprises and solve business challenges that matter . Accelerate your career - Get hands-on experience, mentorship, and continuous learning opportunities . Work with the best - Join 140,000+ bold thinkers and problem-solvers who push boundaries every day . Thrive in a values-driven culture - Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up. Let&rsquos build tomorrow together. Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation. Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a %27starter kit,%27 paying to apply, or purchasing equipment or training.

Posted 1 month ago

Apply

0.0 years

0 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

Ready to shape the future of work At Genpact, we don&rsquot just adapt to change&mdashwe drive it. AI and digital innovation are redefining industries, and we&rsquore leading the charge. Genpact&rsquos AI Gigafactory, our industry-first accelerator, is an example of how we&rsquore scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to agentic AI, our breakthrough solutions tackle companies most complex challenges. If you thrive in a fast-moving, tech-driven environment, love solving real-world problems, and want to be part of a team that&rsquos shaping the future, this is your moment. Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge, operational excellence, and cutting-edge solutions - we help companies across industries get ahead and stay ahead. Powered by curiosity, courage, and innovation, our teams implement data, technology, and AI to create tomorrow, today. Get to know us at genpact.com and on LinkedIn, X, YouTube, and Facebook. Inviting applications for the role of Senior Principal Consultant- Senior Data Engineer - Databricks, Azure & Mosaic AI Role Summary: We are seeking a Senior Data Engineer with extensive expertise in Data & Analytics platform modernization using Databricks, Azure, and Mosaic AI. This role will focus on designing and optimizing cloud-based data architectures, leveraging AI-driven automation to enhance data pipelines, governance, and processing at scale. Key Responsibilities: . Architect & modernize Data & Analytics platforms using Databricks on Azure. . Design and optimize Lakehouse architectures integrating Azure Data Lake, Databricks Delta Lake, and Synapse Analytics. . Implement Mosaic AI for AI-driven automation, predictive analytics, and intelligent data engineering solutions. . Lead the migration of legacy data platforms to a modern cloud-native Data & AI ecosystem. . Develop high-performance ETL pipelines, integrating Databricks with Azure services such as Data Factory, Synapse, and Purview. . Utilize MLflow & Mosaic AI for AI-enhanced data processing and decision-making. . Establish data governance, security, lineage tracking, and metadata management across modern data platforms. . Work collaboratively with business leaders, data scientists, and engineers to drive innovation. . Stay at the forefront of emerging trends in AI-powered data engineering and modernization strategies. Qualifications we seek in you! Minimum Qualifications . experience in Data Engineering, Cloud Platforms, and AI-driven automation. . Expertise in Databricks (Apache Spark, Delta Lake, MLflow) and Azure (Data Lake, Synapse, ADF, Purview). . Strong experience with Mosaic AI for AI-powered data engineering and automation. . Advanced proficiency in SQL, Python, and Scala for big data processing. . Experience in modernizing Data & Analytics platforms, migrating from on-prem to cloud. . Knowledge of Data Lineage, Observability, and AI-driven Data Governance frameworks. . Familiarity with Vector Databases & Retrieval-Augmented Generation (RAG) architectures for AI-powered data analytics. . Strong leadership, problem-solving, and stakeholder management skills. Preferred Skills: . Experience with Knowledge Graphs (Neo4J, TigerGraph) for data structuring. . Exposure to Kubernetes, Terraform, and CI/CD for scalable cloud deployments. . Background in streaming technologies (Kafka, Spark Streaming, Kinesis). Why join Genpact . Be a transformation leader - Work at the cutting edge of AI, automation, and digital innovation . Make an impact - Drive change for global enterprises and solve business challenges that matter . Accelerate your career - Get hands-on experience, mentorship, and continuous learning opportunities . Work with the best - Join 140,000+ bold thinkers and problem-solvers who push boundaries every day . Thrive in a values-driven culture - Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up. Let&rsquos build tomorrow together. Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation. Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a %27starter kit,%27 paying to apply, or purchasing equipment or training.

Posted 1 month ago

Apply

3.0 - 7.0 years

9 - 19 Lacs

Kolkata, Pune, Chennai

Work from Office

The JD for AI/Ml programmer is given below: Key Responsibilities: Design, develop, and deploy Generative AI models using state-of-the-art architectures (e.g., Transformers, Diffusion models). Build and fine-tune LLM-powered agents capable of multi-step reasoning, task planning, and tool use. Work with frameworks like LangChain, AutoGPT, BabyAGI, CrewAI , or similar agent orchestration tools. Integrate models with REST APIs, vector databases (e.g., Pinecone, FAISS, Chroma), and external systems. Optimize inference pipelines for performance, latency, and scalability. Collaborate with product managers and data scientists to prototype and productionize AI features. Stay updated on recent advancements in Generative AI and autonomous agents. Required Qualifications: 34 years of hands-on experience in Machine Learning / Deep Learning , with at least 1–2 years in Generative AI and/or AI Agents . Proficiency in Python and ML libraries such as PyTorch , TensorFlow , Transformers (Hugging Face) . Experience with LLM APIs (OpenAI, Claude, Mistral, etc.) and building LLM-based applications. Solid understanding of prompt engineering , fine-tuning , RAG (Retrieval-Augmented Generation) , and multi-modal learning . Familiarity with agent orchestration frameworks and LLM tool chaining . Strong problem-solving and communication skills. Preferred Qualifications: Experience with cloud platforms (AWS, GCP, Azure) and MLOps tools (MLflow, Weights & Biases). Knowledge of Reinforcement Learning or Meta-learning for agent training. Experience contributing to open-source projects or published papers in the field of AI.

Posted 1 month ago

Apply

7 - 12 years

0 - 0 Lacs

Mumbai, Pune, Bengaluru

Hybrid

Senior Software Engineer/ LLM Ops Engineer External Description Description - External JD - What You Will Do Design, implement, and maintain LLM operations workflows using tools like Langfuse to monitor performance, track usage, and create feedback loops for continuous improvement Develop and maintain infrastructure-as-code for AI deployments using Terraform and AWS services (Lambda, SQS, API Gateway, OpenSearch, CloudWatch) Build and enhance monitoring, logging, and alerting systems to ensure optimal performance and reliability of our LLM infrastructure Collaborate with AI engineers to design and implement evaluation frameworks (including LLM-as-judge systems) to measure and improve model performance Manage prompt versioning, testing, and deployment pipelines through CI/CD and custom tooling Implement and maintain security guardrails for LLM interactions, ensuring compliance with best practices Create comprehensive documentation for LLM operations, including runbooks for production incidents Participate in on-call rotations to support mission-critical AI systems Drive innovation in LLM operations by researching and implementing best practices and emerging tools in the rapidly evolving GenAI space Deep understanding of prompt engineering strategies What You Will Bring To succeed in this role, you will need a combination of experience, technology skills, personal qualities, and education. Required Qualifications 3+ years of experience in DevOps, SRE, or similar roles, with at least 1 year specifically working with LLMs or AI systems in production Strong hands-on experience with AWS cloud services, particularly Bedrock, Lambda, SQS, API Gateway, OpenSearch, and CloudWatch Experience with infrastructure-as-code using Terraform, CloudFormation, or similar tools Proficiency in Python and experience building automation tooling and pipelines Familiarity with LangOps platforms such as Langfuse for LLM observability and evaluation Experience with CI/CD pipelines Knowledge of logging, monitoring, and alerting systems Understanding of security best practices for AI systems, including prompt injection mitigation techniques Excellent troubleshooting and problem-solving skills Strong communication skills and ability to work effectively with cross-functional teams Must be legally entitled to work in the country where the role is located Preferred Qualifications Experience with prompt engineering and testing tools like Promptfoo Familiarity with vector databases and retrieval-augmented generation (RAG) systems Knowledge of serverless architectures and event-driven systems Experience with AWS Guardrails for LLM security Background in data engineering or machine learning operations Understanding of financial systems and data security requirements in the finance industry Familiarity with implementing technical solutions to meet compliance requirements outlined in SOC2, ISAE 3402, and ISO 27001

Posted 2 months ago

Apply

5 - 10 years

25 - 30 Lacs

Mumbai, Navi Mumbai, Chennai

Work from Office

We are looking for an AI Engineer (Senior Software Engineer). Interested candidates email me resumes on mayura.joshi@lionbridge.com OR WhatsApp on 9987538863 Responsibilities: Design, develop, and optimize AI solutions using LLMs (e.g., GPT-4, LLaMA, Falcon) and RAG frameworks. Implement and fine-tune models to improve response relevance and contextual accuracy. Develop pipelines for data retrieval, indexing, and augmentation to improve knowledge grounding. Work with vector databases (e.g., Pinecone, FAISS, Weaviate) to enhance retrieval capabilities. Integrate AI models with enterprise applications and APIs. Optimize model inference for performance and scalability. Collaborate with data scientists, ML engineers, and software developers to align AI models with business objectives. Ensure ethical AI implementation, addressing bias, explainability, and data security. Stay updated with the latest advancements in generative AI, deep learning, and RAG techniques. Requirements: 8+ years experience in software development according to development standards. Strong experience in training and deploying LLMs using frameworks like Hugging Face Transformers, OpenAI API, or LangChain. Proficiency in Retrieval-Augmented Generation (RAG) techniques and vector search methodologies. Hands-on experience with vector databases such as FAISS, Pinecone, ChromaDB, or Weaviate. Solid understanding of NLP, deep learning, and transformer architectures. Proficiency in Python and ML libraries (TensorFlow, PyTorch, LangChain, etc.). Experience with cloud platforms (AWS, GCP, Azure) and MLOps workflows. Familiarity with containerization (Docker, Kubernetes) for scalable AI deployments. Strong problem-solving and debugging skills. Excellent communication and teamwork abilities Bachelors or Masters degree in computer science, AI, Machine Learning, or a related field.

Posted 2 months ago

Apply
Page 1 of 2
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies