Mistral Jobs – Apply to Latest Mistral Job Vacancies

AI/ML Expert - LLM Training for Retail Operations & Customer Analytics MostEdge

5.0 years

10 - 27 Lacs

India

On-site

About MostEdge At MostEdge , our purpose is clear: Accelerate commerce and build sustainable, trusted experiences. With every byte of data, we strive to Protect Every Penny. Power Every Possibility. We empower retailers to make real-time, profitable decisions using cutting-edge AI , smart infrastructure, and operational excellence. Our platforms handle: hundreds of thousands of sales transactions/hour hundreds of vendor purchase invoices/hour few hundred product updates/day With systems built for 99.99999% uptime We are building an AI-native commerce engine , and language models are at the heart of this transformation. Role Overview We are looking for an AI/ML Expert with deep experience in training and deploying Large Language Models (LLMs) to power MostEdge's next-generation operations, cost intelligence, and customer analytics platform . You will be responsible for fine-tuning domain-specific models using internal structured and unstructured data (product catalogs, invoices, chats, documents), embedding real-time knowledge through RAG pipelines, and enabling AI-powered interfaces that drive search, reporting, insight generation, and operational recommendations. Scope & Accountability What You Will Own Fine-tune and deploy LLMs for product, vendor, and shopper-facing use cases. Design hybrid retrieval-augmented generation (RAG) pipelines with LangChain, FastAPI, and vector DBs (e.g., FAISS, Weaviate, Qdrant). Train models on internal datasets (sales, cost, product specs, invoices, support logs) using supervised fine-tuning and LoRA/QLoRA techniques. Orchestrate embedding pipelines, prompt tuning, and model evaluation across customer and field operations use cases. Deploy LLMs efficiently on RunPod, AWS, or GCP , optimizing for multi-GPU, low-latency inference . Collaborate with engineering and product teams to embed model outputs in dashboards, chat UIs, and retail systems. What Success Looks Like 90%+ accuracy on retrieval and reasoning tasks for product/vendor cost and invoice queries. <3s inference time across operational prompts, running on GPU-optimized containers. Full integration of LLMs with backend APIs, sales dashboards, and product portals. 75% reduction in manual effort across selected operational workflows. Skills & Experience Must-Have 5+ years in AI/ML , with 2+ years working on LLMs or transformer architectures . Proven experience training or fine-tuning Mistral, LLaMA, Falcon, or similar open-source LLMs . Strong command over LoRA, QLoRA, PEFT, RAG, embeddings, and quantized inference . Familiarity with LangChain, HuggingFace Transformers, FAISS/Qdrant , and FastAPI for LLM orchestration. Experience deploying models on RunPod, AWS, or GCP using Docker + Kubernetes. Proficient in Python , PyTorch , and data preprocessing (structured and unstructured). Experience with ETL pipelines , multi-modal data, and real-time data integration. Nice-to-Have Experience with retail, inventory, or customer analytics systems . Knowledge of semantic search, OCR post-processing, or auto-tagging pipelines . Exposure to multi-tenant environments and secure model isolation for enterprise use. How You Reflect Our Values Lead with Purpose : You empower smarter decisions with AI-first operations. Build Trust : You make model behavior explainable, dependable, and fair. Own the Outcome : You train and optimize end-to-end pipelines from data to insights. Win Together : You partner across engineering, ops, and customer success teams. Keep It Simple : You design intuitive models, prompts, and outputs that drive action—not confusion. Why Join MostEdge? Shape how AI transforms commerce and operations at scale . Be part of a mission-critical, high-velocity, AI-first company . Build LLMs with purpose—connecting frontline data to real-time results. Job Types: Full-time, Permanent Pay: ₹1,068,726.69 - ₹2,729,919.70 per year Benefits: Health insurance Life insurance Paid sick time Paid time off Provident Fund Schedule: Evening shift Morning shift US shift Supplemental Pay: Performance bonus Yearly bonus Work Location: In person Expected Start Date: 15/07/2025

Posted 2 hours ago

Apply

Senior Machine Learning Engineer Level AI

3.0 years

1 - 6 Lacs

Noida

On-site

Level AI was founded in 2019 and is a Series C startup headquartered in Mountain View, California. Level AI revolutionizes customer engagement by transforming contact centers into strategic assets. Our AI-native platform leverages advanced technologies such as Large Language Models to extract deep insights from customer interactions. By providing actionable intelligence, Level AI empowers organizations to enhance customer experience and drive growth. Consistently updated with the latest AI innovations, Level AI stands as the most adaptive and forward-thinking solution in the industry. Empowering contact center stakeholders with real-time insights, our tech facilitates data-driven decision-making for contact centers, enhancing service levels and agent performance. As a vital team member, your work will be cutting-edge technologies and will play a high-impact role in shaping the future of AI-driven enterprise applications. You will directly work with people who've worked at Amazon, Facebook, Google, and other technology companies in the world. With Level AI, you will get to have fun, learn new things, and grow along with us. Ready to redefine possibilities? Join us! We'll love to explore more about you if you have Qualification: B.E/B.Tech/M.E/M.Tech/PhD from tier 1 engineering institutes with relevant work experience with a top technology company in computer science or mathematics-related fields with 3-5 years of experience in machine learning and NLP. Knowledge and practical experience in solving NLP problems in areas such as text classification, entity tagging, information retrieval, question-answering, natural language generation, clustering, etc. 3+ years of experience working with LLMs in large-scale environments. Expert knowledge of machine learning concepts and methods, especially those related to NLP, Generative AI, and working with LLMs. Knowledge and hands-on experience with Transformer-based Language Models like BERT, DeBERTa, Flan-T5, Mistral, Llama, etc. Deep familiarity with internals of at least a few Machine Learning algorithms and concepts. Experience with Deep Learning frameworks like Pytorch and common machine learning libraries like scikit-learn, numpy, pandas, NLTK, etc. Experience with ML model deployments using REST API, Docker, Kubernetes, etc. Knowledge of cloud platforms (AWS/Azure/GCP) and their machine learning services is desirable. Knowledge of basic data structures and algorithms. Knowledge of real-time streaming tools/architectures like Kafka, Pub/Sub is a plus. Your role at Level AI includes but is not limited to Big picture: Understand customers’ needs, innovate and use cutting edge Deep Learning techniques to build data-driven solutions. Work on NLP problems across areas such as text classification, entity extraction, summarization, generative AI, and others. Collaborate with cross-functional teams to integrate/upgrade AI solutions into the company’s products and services. Optimize existing deep learning models for performance, scalability, and efficiency. Build, deploy, and own scalable production NLP pipelines. Build post-deployment monitoring and continual learning capabilities. Propose suitable evaluation metrics and establish benchmarks. Keep abreast with SOTA techniques in your area and exchange knowledge with colleagues. Desire to learn, implement and work with latest emerging model architectures, training and inference techniques, data curation pipelines, etc. To learn more visit : https://thelevel.ai/ Funding : https://www.crunchbase.com/organization/level-ai LinkedIn : https://www.linkedin.com/company/level-ai/

Posted 2 hours ago

Apply

Data Engineer(5) Ericsson

10.0 years

6 - 8 Lacs

Calcutta

On-site

Join our Team About this opportunity: We are seeking a highly skilled, hands-on AI Architect - GenAI to lead the design and implementation of production-grade, cloud-native AI and NLP solutions that drive business value and enhance decision-making processes. The ideal candidate will have a robust background in machine learning, generative AI, and the architecture of scalable production systems. As an AI Architect, you will play a key role in shaping the direction of advanced AI technologies and leading teams in the development of cutting-edge solutions. What you will do: Architect and design AI and NLP solutions to address complex business challenges and support strategic decision-making. Lead the design and development of scalable machine learning models and applications using Python, Spark, NoSQL databases, and other advanced technologies. Spearhead the integration of Generative AI techniques in production systems to deliver innovative solutions such as chatbots, automated document generation, and workflow optimization. Guide teams in conducting comprehensive data analysis and exploration to extract actionable insights from large datasets, ensuring these findings are communicated effectively to stakeholders. Collaborate with cross-functional teams, including software engineers and data engineers, to integrate AI models into production environments, ensuring scalability, reliability, and performance. Stay at the forefront of advancements in AI, NLP, and Generative AI, incorporating emerging methodologies into existing models and developing new algorithms to solve complex challenges. Provide thought leadership on best practices for AI model architecture, deployment, and continuous optimization. Ensure that AI solutions are built with scalability, reliability, and compliance in mind. The skills you bring: Minimum of 10+ years of experience in AI, machine learning, or a similar role, with a proven track record of delivering AI-driven solutions. Hands-on experience in designing and implementing end-to-end GenAI-based solutions, particularly in chatbots, document generation, workflow automation, and other generative use cases. Expertise in Python programming and extensive experience with AI frameworks and libraries such as TensorFlow, PyTorch, scikit-learn, and vector databases. Deep understanding and experience with distributed data processing using Spark. Proven experience in architecting, deploying, and optimizing machine learning models in production environments at scale. Expertise in working with open-source Generative AI models (e.g., GPT-4, Mistral, Code-Llama, StarCoder) and applying them to real-world use cases. Expertise in designing cloud-native architectures and microservices for AI/ML applications. Why join Ericsson? At Ericsson, you´ll have an outstanding opportunity. The chance to use your skills and imagination to push the boundaries of what´s possible. To build solutions never seen before to some of the world’s toughest problems. You´ll be challenged, but you won’t be alone. You´ll be joining a team of diverse innovators, all driven to go beyond the status quo to craft what comes next. What happens once you apply? Click Here to find all you need to know about what our typical hiring process looks like. Encouraging a diverse and inclusive organization is core to our values at Ericsson, that's why we champion it in everything we do. We truly believe that by collaborating with people with different experiences we drive innovation, which is essential for our future growth. We encourage people from all backgrounds to apply and realize their full potential as part of our Ericsson team. Ericsson is proud to be an Equal Opportunity Employer. learn more. Primary country and city: India (IN) || Kolkata Req ID: 763161

Posted 2 hours ago

Apply

Solutions Architect AI NTT DATA, Inc.

8.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Make an impact with NTT DATA Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive. Your day at NTT DATA Seeking a talented Solution Architect/BDM for On-Prem/Private AI. Requires deep open source LLM expertise to translate client needs into technical solutions. Responsibilities include assessing needs, recommending LLM tech, sizing opportunities and infrastructure, and collaborating on end-to-end solutions with costing. Needs strategic thinking, strong technical and business skills to drive innovation and client value. What You'll Be Doing Key Roles and Responsibilities: Solution Architecture & Technical Leadership Demonstrate deep expertise in LLMs such as Phi-4, Mistral, Gemma, Llama and other foundation models Assess client business requirements and translate them into detailed technical specifications Recommend appropriate LLM solutions based on specific business outcomes and use cases Experience in sizing and architecting infrastructure for AI/ML workloads, particularly GPU-based systems. Design scalable and secure On-Prem/Private AI architectures Create technical POCs and prototypes to demonstrate solution capabilities Hands-on experience with vector databases (open-source or proprietary), such as Weaviate, Milvus, or Vald etc. Expertise in fine-tuning, query caching, and optimizing vector embeddings for efficient similarity searches Business Development Size and qualify opportunities in the On-Prem/Private AI space Develop compelling proposals and solution presentations for clients Build and nurture client relationships at technical and executive levels Collaborate with sales teams to create competitive go-to-market strategies Identify new business opportunities through technical consultation Project & Delivery Leadership Work with delivery teams to develop end-to-end solution approaches and accurate costing Lead technical discovery sessions with clients Guide implementation teams during solution delivery Ensure technical solutions meet client requirements and business outcomes Develop reusable solution components and frameworks to accelerate delivery AI Agent Development Design, develop, and deploy AI-powered applications leveraging agentic AI frameworks such as LangChain, AutoGen, and CrewAI. Utilize the modular components of these frameworks (LLMs, Prompt Templates, Agents, Memory, Retrieval, Tools) to build sophisticated language model systems and multi-agent workflows. Implement Retrieval Augmented Generation (RAG) pipelines and other advanced techniques using these frameworks to enhance LLM responses with external data. Contribute to the development of reusable components and best practices for agentic AI implementations. Knowledge, Skills, and Attributes: Basic Qualifications: 8+ years of experience in solution architecture or technical consulting roles 3+ years of specialized experience working with LLMs and Private AI solutions Demonstrated expertise with models such as Phi-4, Mistral, Gemma, and other foundation models Strong understanding of GPU infrastructure sizing and optimization for AI workloads Proven experience converting business requirements into technical specifications Experience working with delivery teams to create end-to-end solutions with accurate costing Strong understanding of agentic AI systems and orchestration frameworks Bachelor’s degree in computer science, AI, or related field Ability to travel up to 25% Preferred Qualifications: Master's degree or PhD in Computer Science or related technical field. Experience with Private AI deployment and fine-tuning LLMs for specific use cases Knowledge of RAG (Retrieval Augmented Generation) and enterprise knowledge systems Hands-on experience with prompt engineering and LLM optimization techniques Understanding of AI governance, security, and compliance requirements Experience with major AI providers: OpenAI/Azure OpenAI, AWS, Google, Anthropic, etc. Prior experience in business development or pre-sales for AI solutions Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders Strong problem-solving abilities and analytical mindset Location: Delhi or Bangalore Workplace type: Hybrid Working About NTT DATA NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo. Equal Opportunity Employer NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today. Show more Show less

Posted 3 hours ago

Apply

Solutions Architect AI NTT DATA, Inc.

8.0 years

0 Lacs

Delhi Cantonment, Delhi, India

On-site

Make an impact with NTT DATA Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive. Your day at NTT DATA Seeking a talented Solution Architect/BDM for On-Prem/Private AI. Requires deep open source LLM expertise to translate client needs into technical solutions. Responsibilities include assessing needs, recommending LLM tech, sizing opportunities and infrastructure, and collaborating on end-to-end solutions with costing. Needs strategic thinking, strong technical and business skills to drive innovation and client value. What You'll Be Doing Key Roles and Responsibilities: Solution Architecture & Technical Leadership Demonstrate deep expertise in LLMs such as Phi-4, Mistral, Gemma, Llama and other foundation models Assess client business requirements and translate them into detailed technical specifications Recommend appropriate LLM solutions based on specific business outcomes and use cases Experience in sizing and architecting infrastructure for AI/ML workloads, particularly GPU-based systems. Design scalable and secure On-Prem/Private AI architectures Create technical POCs and prototypes to demonstrate solution capabilities Hands-on experience with vector databases (open-source or proprietary), such as Weaviate, Milvus, or Vald etc. Expertise in fine-tuning, query caching, and optimizing vector embeddings for efficient similarity searches Business Development Size and qualify opportunities in the On-Prem/Private AI space Develop compelling proposals and solution presentations for clients Build and nurture client relationships at technical and executive levels Collaborate with sales teams to create competitive go-to-market strategies Identify new business opportunities through technical consultation Project & Delivery Leadership Work with delivery teams to develop end-to-end solution approaches and accurate costing Lead technical discovery sessions with clients Guide implementation teams during solution delivery Ensure technical solutions meet client requirements and business outcomes Develop reusable solution components and frameworks to accelerate delivery AI Agent Development Design, develop, and deploy AI-powered applications leveraging agentic AI frameworks such as LangChain, AutoGen, and CrewAI. Utilize the modular components of these frameworks (LLMs, Prompt Templates, Agents, Memory, Retrieval, Tools) to build sophisticated language model systems and multi-agent workflows. Implement Retrieval Augmented Generation (RAG) pipelines and other advanced techniques using these frameworks to enhance LLM responses with external data. Contribute to the development of reusable components and best practices for agentic AI implementations. Knowledge, Skills, and Attributes: Basic Qualifications: 8+ years of experience in solution architecture or technical consulting roles 3+ years of specialized experience working with LLMs and Private AI solutions Demonstrated expertise with models such as Phi-4, Mistral, Gemma, and other foundation models Strong understanding of GPU infrastructure sizing and optimization for AI workloads Proven experience converting business requirements into technical specifications Experience working with delivery teams to create end-to-end solutions with accurate costing Strong understanding of agentic AI systems and orchestration frameworks Bachelor’s degree in computer science, AI, or related field Ability to travel up to 25% Preferred Qualifications: Master's degree or PhD in Computer Science or related technical field. Experience with Private AI deployment and fine-tuning LLMs for specific use cases Knowledge of RAG (Retrieval Augmented Generation) and enterprise knowledge systems Hands-on experience with prompt engineering and LLM optimization techniques Understanding of AI governance, security, and compliance requirements Experience with major AI providers: OpenAI/Azure OpenAI, AWS, Google, Anthropic, etc. Prior experience in business development or pre-sales for AI solutions Excellent verbal and written communication skills, with the ability to explain complex technical concepts to non-technical stakeholders Strong problem-solving abilities and analytical mindset Location: Delhi or Bangalore Workplace type: Hybrid Working About NTT DATA NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo. Equal Opportunity Employer NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today. Show more Show less

Posted 3 hours ago

Apply

Agentic AI Engineer FriskaAi

2.0 years

0 Lacs

Kerala, India

On-site

About FriskaAi: FriskaAi is an intelligent health management platform, revolutionizing preventive care, chronic disease management, and population health through AI-driven insights. We are part of HFWL Company, committed to transforming healthcare through innovation, compassion, and cutting-edge technology. As we continue to expand, we are building a team of forward-thinking engineers passionate about the future of autonomous, agent-driven AI. If you're excited about designing systems that think, reason, and act independently in complex healthcare environments — we want you . Role Overview: We are looking for an experienced Agentic AI Engineer to design, build, and deploy AI agents that operate with autonomy, solve complex healthcare-related tasks, and collaborate across systems and datasets with minimal human intervention. You will work closely with our data science, product, and engineering teams to integrate agentic behavior into FriskaAi’s platform and services. Key Responsibilities: Design and build intelligent AI agents capable of independent decision-making, task planning, and adaptive learning in healthcare scenarios. Develop frameworks for multi-agent collaboration and goal-directed workflows within FriskaAi’s systems. Implement memory, reasoning, and planning capabilities into agentic modules. Integrate agents with FriskaAi’s data pipelines, patient management systems, and healthcare APIs. Research and apply advanced techniques such as tool-use , self-reflection , long-term memory , task decomposition , and recursive improvement . Optimize for explainability , ethics , and compliance in autonomous agent behavior. Collaborate with cross-functional teams to prototype, iterate, and productionize agentic solutions. Stay current with the latest advances in LLMs, autonomous systems, and healthcare AI regulations. Requirements: Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field. PhD preferred. 2+ years working on agentic or autonomous AI systems . Strong expertise with LLMs (e.g., GPT, Claude, Mistral) and frameworks like LangChain, AutoGen, CrewAI, or similar . Deep understanding of planning algorithms , decision-making models , and agentic architectures . Hands-on experience with Python , TensorFlow , PyTorch , or other relevant AI/ML libraries. Familiarity with healthcare data standards (e.g., HL7, FHIR, HIPAA compliance) is a strong plus. Excellent problem-solving skills and the ability to work independently in a fast-paced startup environment. Strong written and verbal communication skills. Nice to Have: Experience developing multi-agent systems (MAS) . Knowledge of health informatics , clinical decision support systems (CDSS) , or population health AI . Contributions to open-source agentic AI projects. Familiarity with neurosymbolic AI or cognitive architectures . Apply now and join us in shaping the future of personalized health! Show more Show less

Posted 4 hours ago

Apply

Generative AI Engineer Acesoft Labs

3.0 - 6.0 years

10 - 13 Lacs

Bengaluru

Hybrid

Hi all, We are hiring for the role Generative AI Engineer Experience: 3 - 6 Years Location: Bangalore Notice Period: Immediate - 15 Days Skills: Generative AI Engineer Position Overview: We are looking for a Generative AI Engineer with expertise in Azure OpenAI and hands-on experience with models such as GPT-4o, GPT-o1, and open source LLMs like Llama, mistral. You will work on GenAI solutions development, RAG, fine-tuning, and deploying resources in Azure environment. Proficiency in prompt engineering, Python, PostgreSQL, FastAPI, Streamlit, Django and Angular is essential. This role also requires strong skills in AI models orchestration using intent mapping, Semantic Kernel or function calling, along with proficiency in presentation and public speaking. Key Responsibilities: • RAG, fine-tune, and deploy Azure OpenAI models (e.g., GPT-4o, GPT-o1) and other open-source large language models (LLMs). • Build AI-powered applications using frameworks such as FastAPI, Streamlit, Django, and Angular. • Design and execute AI workflows using tools like prompt flow, Semantic Kernel and implement function calling for complex use cases. • Conduct prompt engineering to improve model performance for specific business cases. • Visualize data and create user interaction insights using Power BI. • Ensure smooth deployment and maintenance of models on Azure cloud infrastructure, including scalability and optimization. • Prepare and deliver presentations, demos, and technical documentation to internal and external stakeholders. • Stay updated with advancements in generative AI, NLP, and machine learning to continuously improve models and methodologies. Required Skills & Qualifications: • Bachelors degree in Computer science, Artificial intelligence, Machine learning, or related field. • At least 2+ year of hands-on experience working on generative AI projects. • Strong expertise in Azure OpenAI models (GPT-4o, GPT-3.5, GPT-o1 etc.). • Proficient in Prompt Engineering, Python, Streamlit, Django, FastAPI, and Angular. • Basics of html, css, javascript, typescript and angular. • Basic understanding of neural networks, machine and transformer architectures. • Experience in retrieval-augmented generation (RAG) and fine-tuning Large language models. • Familiarity with AI model orchestration tools such as Semantic Kernel, intent mapping and function calling techniques. • Excellent public speaking and presentation skills to convey technical concepts to business stakeholders. • Azure Certified AZ900 or AI900 Preferred Qualifications: • Masters degree in Artificial Intelligence, Machine Learning, or related field. • At least 3+ years of experience working on generative AI, NLP, and machine learning projects. • Strong understanding of neural networks, machine learning and transformer architectures. • Implemented GenAI solutions in production. • Familiarity with Automotive Industry • Hands on experience in RAG, RAFT and optimized fine-tuning. • Azure Certified AI-102, DP-100, AZ-204 or DP-203 If you are interested drop your resume at mojesh.p@acesoftlabs.com Call: 9701971793

Posted 5 hours ago

Apply

Job Requirement: AI Engineer (LLM, Multi-Agent Systems & Backend Specialist) Alpixn Technologies Private Limited

2.0 - 3.0 years

0 Lacs

Gurugram, Haryana, India

Remote

Position Type: PartTime / Contract / Remote Location: Open Experience Required: 2-3 years (or equivalent project experience) Key Responsibilities: Work with Large Language Models (LLMs) , both open-source (like LLaMA, Mistral, GPT-NeoX) and API-based (like OpenAI, Anthropic) Develop and manage multi-agent system architectures for AI-driven applications Conduct comparative evaluation of LLMs based on quality, speed, cost, and reliability Design, test, and optimize prompts , context management strategies, and resolve common issues like hallucinations and irrelevant outputs Understand the basics of model fine-tuning and customizing pre-trained models for domain-specific use-cases Build and integrate backend systems using Node.js , RESTful/GraphQL APIs, and database management (SQL / NoSQL) Deploy applications on cloud platforms like Firebase, AWS, or Azure , and manage resources effectively Implement and manage agentic AI systems , A2A workflows (agent-to-agent) , and RAG (Retrieval-Augmented Generation) pipelines Handle hosting, scaling, and deployment of websites/web apps and maintain performance during high traffic loads Optimize infrastructure for cost-effectiveness and high availability Work with frameworks like LangChain , AutoGen , or equivalent LLM orchestration tools Lead or collaborate on AI-powered product development from concept to deployment Balance between rapid prototyping and building production-grade, reliable AI applications Preferred Skills: Strong understanding of LLM evaluation frameworks and metrics Familiarity with LangChain, AutoGen, Haystack , or similar AI agent management libraries Working knowledge of AI deployment best practices Basic knowledge of Docker, Kubernetes , and scalable hosting setups Experience in managing cross-functional teams or AI development interns is a plus Bonus Advantage: Prior experience working on AI-powered SaaS platforms Contribution to open-source AI projects or AI hackathons Familiarity with data privacy, security compliance, and cost management in AI applications Show more Show less

Posted 6 hours ago

Apply

Data Engineer(5) Ericsson-Worldwide

0.0 years

0 Lacs

Kolkata, West Bengal

On-site

Indeed logo

Kolkata,West Bengal,India Job ID 763161 Join our Team About this opportunity: We are seeking a highly skilled, hands-on AI Architect - GenAI to lead the design and implementation of production-grade, cloud-native AI and NLP solutions that drive business value and enhance decision-making processes. The ideal candidate will have a robust background in machine learning, generative AI, and the architecture of scalable production systems. As an AI Architect, you will play a key role in shaping the direction of advanced AI technologies and leading teams in the development of cutting-edge solutions. What you will do: Architect and design AI and NLP solutions to address complex business challenges and support strategic decision-making. Lead the design and development of scalable machine learning models and applications using Python, Spark, NoSQL databases, and other advanced technologies. Spearhead the integration of Generative AI techniques in production systems to deliver innovative solutions such as chatbots, automated document generation, and workflow optimization. Guide teams in conducting comprehensive data analysis and exploration to extract actionable insights from large datasets, ensuring these findings are communicated effectively to stakeholders. Collaborate with cross-functional teams, including software engineers and data engineers, to integrate AI models into production environments, ensuring scalability, reliability, and performance. Stay at the forefront of advancements in AI, NLP, and Generative AI, incorporating emerging methodologies into existing models and developing new algorithms to solve complex challenges. Provide thought leadership on best practices for AI model architecture, deployment, and continuous optimization. Ensure that AI solutions are built with scalability, reliability, and compliance in mind. The skills you bring: Minimum of 10+ years of experience in AI, machine learning, or a similar role, with a proven track record of delivering AI-driven solutions. Hands-on experience in designing and implementing end-to-end GenAI-based solutions, particularly in chatbots, document generation, workflow automation, and other generative use cases. Expertise in Python programming and extensive experience with AI frameworks and libraries such as TensorFlow, PyTorch, scikit-learn, and vector databases. Deep understanding and experience with distributed data processing using Spark. Proven experience in architecting, deploying, and optimizing machine learning models in production environments at scale. Expertise in working with open-source Generative AI models (e.g., GPT-4, Mistral, Code-Llama, StarCoder) and applying them to real-world use cases. Expertise in designing cloud-native architectures and microservices for AI/ML applications. Why join Ericsson? At Ericsson, you´ll have an outstanding opportunity. The chance to use your skills and imagination to push the boundaries of what´s possible. To build solutions never seen before to some of the world’s toughest problems. You´ll be challenged, but you won’t be alone. You´ll be joining a team of diverse innovators, all driven to go beyond the status quo to craft what comes next. What happens once you apply?

Posted 10 hours ago

Apply

CogniCor Technologies - Senior Engineer - Artificial Intelligence/Machine Learning CogniCor Technologies

3.0 - 7.0 years

0 Lacs

Kochi, Kerala, India

On-site

Role : AI/ML Senior Engineer Job Description We are looking for an AI/ML Senior Engineer with expertise in Natural Language Processing (NLP), Generative AI, and Machine Learning (ML) to be a part of our AI Engineering team. The ideal candidate will work on cutting-edge AI models, including Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Agentic AI Systems, to develop and deploy AI-driven solutions. Roles & Responsibilities Develop and optimize AI models leveraging LLMs, RAG, and Agentic Systems for real-world applications. Fine-tune and optimize LLMs for domain-specific tasks, ensuring scalability and efficiency. Design and implement Retrieval-Augmented Generation (RAG) pipelines to enhance information retrieval and reasoning. Apply Prompt Engineering techniques to refine and optimize LLM outputs. Integrate and deploy AI models into production environments, ensuring high performance and reliability. Support and maintain AI-powered client-facing applications, continuously improving model effectiveness. Stay updated with advancements in NLP, Generative AI, and Machine Learning to enhance AI capabilities. Must-Have Competencies Strong Python programming skills, with experience in AI/ML frameworks (PyTorch, TensorFlow, Hugging Face, Haystack, LangChain, etc. Good understanding of LLMs and Generative AI, including OpenAI, Anthropic, Mistral, Llama, etc. Experience with Retrieval-Augmented Generation (RAG) for improving AI-driven search and reasoning. Proficiency in Prompt Engineering to optimize AI-generated responses. Understanding of Agentic AI Systems and multi-agent workflows. Strong foundation in ML & NLP, including transformer models, embeddings, and vector search. Mathematical & Analytical skills for designing AI solutions. Ability to work effectively in an AI team and contribute to AI research and development. Qualifications BTech/MTech/MSc in Computer Science, AI, Computational Linguistics, or a related field. 3 to 7 years of experience in NLP, Machine Learning, or Generative AI projects. (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

Techvantage Analytics - Senior Artificial Intelligence Engineer techvantage.ai

3.0 years

0 Lacs

Trivandrum, Kerala, India

On-site

We are looking for a Senior AI Engineer with 3+ years of hands-on experience in Artificial Intelligence/ML and a passion for innovation. This role is ideal for someone who thrives in a startup environment-fast-paced, product-driven, and full of opportunities to make a real impact. You will contribute to building intelligent, scalable, and production-grade AI systems, with a strong focus on Generative AI and Agentic AI technologies. Roles And Responsibilities Build and deploy AI-driven applications and services, focusing on Generative AI and Large Language Models (LLMs). Design and implement Agentic AI systems-autonomous agents capable of planning and executing multi-step tasks. Collaborate with cross-functional teams including product, design, and engineering to integrate AI capabilities into products. Write clean, scalable code and build robust APIs and services to support AI model deployment. Own feature delivery end-to-end-from research and experimentation to deployment and monitoring. Stay current with emerging AI frameworks, tools, and best practices and apply them in product development. Contribute to a high-performing team culture and mentor junior team members as needed. Skill Set 3-6 years of overall software development experience, with 3+ years specifically in AI/ML engineering. Strong proficiency in Python, with hands-on experience in PyTorch, TensorFlow, and Transformers (Hugging Face). Proven experience working with LLMs (e.g, GPT, Claude, Mistral) and Generative AI models (text, image, or audio). Practical knowledge of Agentic AI frameworks (e.g, LangChain, AutoGPT, Semantic Kernel). Experience building and deploying ML models to production environments. Familiarity with vector databases (Pinecone, Weaviate, FAISS) and prompt engineering concepts. Comfortable working in a startup-like environment-self-motivated, adaptable, and willing to take ownership. Solid understanding of API development, version control, and modern DevOps/MLOps practices. (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

Techvantage Analytics - Generative AI Architect - Python/LLM techvantage.ai

10.0 years

0 Lacs

Trivandrum, Kerala, India

On-site

Our Company Techvantage.ai is a next-generation technology and product engineering company at the forefront of innovation in Generative AI, Agentic AI, and autonomous intelligent systems. We design and build intelligent, secure, and scalable platforms that are reshaping the future across industries. Role Overview We are looking for a visionary and hands-on Generative AI Architect with 10+ years of experience in building scalable AI and data platforms, including 3+ years of expertise in Generative AI technologies. The ideal candidate will lead the design, development, and deployment of intelligent systems that integrate large language models (LLMs), multimodal AI, and other cutting-edge GenAI capabilities into real-world applications. This is a strategic and high-impact role that combines deep technical leadership with product innovation to define the future of AI at Techvantage.ai. What we are looking from an ideal candidate ? Architect and lead end-to-end solutions involving LLMs, diffusion models, transformers, and multimodal AI across enterprise and consumer use cases. Design robust and scalable architectures for prompt engineering, model orchestration, embedding pipelines, vector databases, and fine-tuning/instruction tuning. Evaluate and integrate open-source and proprietary models (e.g., OpenAI, Anthropic, Mistral, LLaMA, Claude, Gemini, etc.) based on business requirements. Lead model selection, finetuning, RAG (Retrieval-Augmented Generation), prompt optimization, and latency vs. performance tradeoffs. Collaborate with product managers, data scientists, engineers, and clients to design GenAI-powered applications aligned with business goals. Ensure adherence to ethical AI practices, data privacy, model safety, and bias mitigation. Contribute to internal GenAI accelerators, reusable frameworks, and innovation strategies. Stay at the forefront of AI/ML advancements and drive PoCs and pilot implementations across industry verticals. Preferred Skills What skills do you need ? Required Skills & Qualifications 10+ years of experience in technology roles, including AI/ML architecture, data engineering, or solution architecture. At least 3+ years of experience in designing and deploying Generative AI solutions. Deep knowledge of transformer architectures, LLMs, diffusion models, and related frameworks (e.g., Hugging Face, LangChain, LlamaIndex, Haystack, etc.) Experience with prompt engineering, RAG pipelines, and vector database technologies (FAISS, Weaviate, Pinecone, Chroma, etc.) Proficiency in Python, Docker, Kubernetes, and cloud platforms (AWS, Azure, or GCP). Strong understanding of MLOps, CI/CD for ML, and model lifecycle management. Excellent communication and stakeholder management skills, with the ability to explain complex AI concepts to non-technical audiences. Preferred Qualifications Experience deploying GenAI systems in BFSI, healthcare, e-commerce, or enterprise SaaS platforms. Familiarity with agentic AI systems, autoML, RLHF, or autonomous agents. Knowledge of data governance, compliance, and responsible AI frameworks. Contributions to open-source AI projects or publications in AI/ML are a strong plus. Masters or PhD in Computer Science, AI/ML, Data Science, or a related field is preferred. What We Offer Lead innovation at the edge of Generative AI and intelligent systems. Work with a talented team building future-ready AI infrastructure. Access to state-of-the-art tools, models, and cloud credits. Compensation is not a constraint for the right candidate. (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

Techvantage Analytics - Senior Software Engineer - Artificial Intelligence techvantage.ai

3.0 years

0 Lacs

Trivandrum, Kerala, India

On-site

We are looking for a Senior Software Engineer AI with 3+ years of hands-on experience in Artificial Intelligence/ML and a passion for innovation. This role is ideal for someone who thrives in a startup environmentfast-paced, product-driven, and full of opportunities to make a real impact. You will contribute to building intelligent, scalable, and production-grade AI systems, with a strong focus on Generative AI and Agentic AI technologies. Roles And Responsibilities Build and deploy AI-driven applications and services, focusing on Generative AI and Large Language Models (LLMs). Design and implement Agentic AI systems autonomous agents capable of planning and executing multi-step tasks. Collaborate with cross-functional teams including product, design, and engineering to integrate AI capabilities into products. Write clean, scalable code and build robust APIs and services to support AI model deployment. Own feature delivery end-to-end from research and experimentation to deployment and monitoring. Stay current with emerging AI frameworks, tools, and best practices and apply them in product development. Contribute to a high-performing team culture and mentor junior team members as needed. Skill Set 3 to 6 years of overall software development experience, with 3+ years specifically in AI/ML engineering. Strong proficiency in Python, with hands-on experience in PyTorch, TensorFlow, and Transformers (Hugging Face). Proven experience working with LLMs (e.g., GPT, Claude, Mistral) and Generative AI models (text, image, or audio). Practical knowledge of Agentic AI frameworks (e.g., LangChain, AutoGPT, Semantic Kernel). Experience building and deploying ML models to production environments. Familiarity with vector databases (Pinecone, Weaviate, FAISS) and prompt engineering concepts. Comfortable working in a startup-like environmentself-motivated, adaptable, and willing to take ownership. Solid understanding of API development, version control, and modern DevOps/MLOps practices. (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

FarmwiseAI - Senior Generative AI Engineer FarmwiseAI Pvt Ltd

3.0 - 5.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Role : Senior Generative AI Engineer Location : Chennai (on-site) Exp : 3 - 5 years FarmwiseAI is a leading Geospatial AI company based in Chennai, specializing in AI driven agriculture solutions that enable data-driven decision-making for governments, lenders, and businesses. Founded in 2020, we deliver real-time advisory, automated land mapping, and crop-monitoring products at scale to foster sustainable development. As an AI-first organization, we embed AI assistance across our entire product lifecycle from brainstorming and architecture to testing and deployment to empower every team member to leverage AI in their day-to-day work. We are hiring a Generative AI Engineer to build, deploy, and optimize multimodal AI services across text, speech, and vision. You'll work on RAG, synthetic data generation, agent workflows, and integrate STT/TTS/OCR with scalable backend systems. Generative Pipelines : Design applications for RAG, CAG, text classification, summarization, image/video generation, OCR, and synthetic data generation. Multimodal Integration : Work with STT, TTS, IVR, OCR, and vision inputs to enable seamless AI interactions. AI Agent Workflows : Develop modular, multi-step orchestrations for document, conversational, and data-based user journeys. Containerization & Deployment : Collaborate with DevOps to containerize services, manage Kubernetes orchestration, and implement CI/CD for agile delivery. Observability : Instrument services using OpenTelemetry, Prometheus, and logging tools to ensure SLO-driven production reliability. Collaboration : Work cross-functionally with product, data science, and frontend teams to define APIs (REST/GraphQL) and ensure smooth integration. Documentation & Mentorship : Participate in architecture reviews, write clear documentation, and mentor junior engineers and interns Bachelor's/Masters in Computer Science, Data Science, IT, or related field. 2 - 3 years of experience building AI/ML products in Python. Must be proficient in AI-first coding tools like Claude Code, Cursor, Roocode, etc. Proven experience in deploying GenAI applications and agents in production. Strong hands-on with vector search, embedding-based retrieval, STT, TTS, OCR/vision. Familiarity with Docker, Kubernetes, frontend development, and CI/CD workflows. Strong debugging, performance tuning, and cost-optimization skills. Excellent communication, teamwork, and mentoring abilities. Languages & Tools (mandatory) : Python (pandas, scikit-learn, PyTorch, Tensorflow, etc.), Git/GitHub , AWS or GCP. Generative AI stack (mandatory) : LangChain, LlamaIndex, transformers, frontier LLMs (OpenAI, Anthropic, Gemini models) and open models (DeepSeek, Qwen, Llama and Phi models). Vector stores : FAISS, Pinecone, Qdrant, Weaviate, etc. Keyword Index : Elasticsearch, Apache Solr, Typesense, etc. Validation frameworks : Pydantic, Instructor, etc. LLM Abstraction libraries : Lite LLM Asynchronous or parallel programming : asyncio, joblib, etc. API frameworks : FastAPI, Flask, etc. FE prototyping : Streamlit, Gradio, etc Agentic AI Frameworks (mandatory, 1) : Google Agents Development Kit , LangGraph, OpenAI Agents SDK, PydanticAI. Speech & Vision (nice-to-have) : OpenAI Realtime Voice API/Whisper; ElevenLabs/Smallest.ai TTS; LlamaParse/JinaAI/Mistral OCR. Observability & Monitoring (nice-to-have) : OpenTelemetry, Prometheus , LangSmith, Pydantic Logfire . Cloud & DevOps (nice-to-have) : Docker, Kubernetes, GitHub Actions. Domain experience in AgriTech, FinTech, HRTech or EduTech. Experience and profound interest in reading and implementing research papers. Open-source contributions or published evaluation suites. Exposure to managed cloud AI services (Vertex AI, Bedrock, JumpStart). Familiarity with React/Next.js integration (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

Avenir Digital - Generative AI Specialist/Senior AI Engineer Avenir Digital Inc

58.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Job Title : Generative AI Specialist / Senior AI Engineer Location : NCR (Gurugram) / Chennai, India Experience : 58 years total, with minimum 1 year hands-on in Generative AI About The Role We are looking for a passionate and experienced AI professional to join our AI/ML team. As a Generative AI Specialist, you will work on cutting-edge projects that involve building, fine-tuning, and deploying LLMs and generative models for real world applications in areas such as customer service, content generation, and intelligent automation. Key Responsibilities Design and implement Generative AI solutions using LLMs (e.g., GPT, LLaMA, Mistral, Claude). Fine-tune and deploy foundation models using frameworks like Hugging Face, LangChain, or custom pipelines. Work closely with product, design, and engineering teams to integrate GenAI solutions into products. Optimize models for performance, scalability, and latency. Develop evaluation frameworks to measure output quality, relevance, and safety. Stay updated with the latest in GenAI, foundation models, and responsible AI practices. Collaborate in building reusable components, frameworks, and services for GenAI use cases. Required Qualifications 58 years of total experience in software engineering / AI / data science. Minimum 1 year of hands-on experience with Generative AI (e.g., prompt engineering, LLM fine-tuning, RAG, embeddings). Strong experience with Python, PyTorch or TensorFlow, and GenAI libraries like Transformers, LangChain, LlamaIndex, or similar. Experience with cloud platforms (AWS, Azure, or GCP) for model deployment and MLOps. Exposure to NLP tasks like summarization, Q&A, chatbots, and semantic search. Familiarity with vector databases (e.g., FAISS, Pinecone, Weaviate). Preferred Skills Experience with Retrieval-Augmented Generation (RAG) pipelines. Understanding of Responsible AI, privacy, and model governance. Background in finance, healthcare, or enterprise AI products is a plus. Contributions to open-source GenAI projects or published research papers. (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

Generative AI & Automation Expert Group Drishti

5.0 years

0 Lacs

Delhi, India

On-site

Job Title : Generative AI & Automation Expert. Location : Delhi, India. Department : Technology. Reports To : Chief Technology Officer (CTO). Employment Type : Drishti IAS : 'Drishti The Vision' was founded on 1st November 1999. Since then, we have been a beacon of excellence in the field of Civil Services Examination preparation and one of the top choices for those preparing for the UPSC Civil Services Examination (CSE). Drishti IAS is dedicated to providing unwavering support and expert guidance for UPSC and State Services aspirants across multiple States/Union Territories in India. We also offer comprehensive online programs accessible through our Drishti Learning App (Android, iOS & Windows), in addition to our well-established physical classroom centers. Job Overview We are seeking a skilled Generative AI & Automation Expert to lead the development and enhancement of our AI-driven educational tools and automation infrastructure. The ideal candidate will have deep expertise in generative AI, large language models (LLMs), and workflow automation technologies, with a passion for EdTech innovation. This role involves building and maintaining AI solutions that improve learner experiences, automating content generation processes, and ensuring operational excellence through automation. Key Responsibilities AI Infrastructure Development : Design, develop, and maintain scalable AI infrastructure to support educational and operational requirements. Deploy and fine-tune local large language models (LLMs), ensuring high performance, reliability, and cost efficiency. Generative AI Features Enhance educational applications by implementing generative AI capabilities such as chatbots for real-time doubt resolution, automated quiz generation, and summarization tools. Develop automated systems to efficiently generate educational content such as test series questions and learning materials. Automation & Workflow Optimization Deploy and manage automation tools to streamline organizational workflows, improving productivity across various teams. Implement AI-driven analytical tools to evaluate sales and support interactions, providing actionable insights to management. Technical Leadership Lead and collaborate with cross-functional teams in integrating advanced AI and automation solutions. Stay current with developments in AI, LLMs, and automation tools, and incorporate industry best practices into the technology strategy. Knowledge Transfer & Documentation Create comprehensive documentation of processes, workflows, and system architectures. Provide training and mentorship to team members to foster innovation and maintain technological proficiency. Education Qualifications & Skills : Bachelor's or Master's degree in Computer Science, Engineering, or a related discipline. Experience Minimum of 5 years experience specializing in generative AI, LLMs, and automation. Proven experience developing AI solutions within EdTech or related sectors. Hands-on expertise with frameworks such as LangChain, LlamaIndex, and Hugging Face Transformers. Practical experience in deploying and fine-tuning models such as Deepseek, Mistral, or Llama. Technical Skills Strong programming proficiency in Python, FastAPI, and containerization technologies like Docker. Experience with workflow automation tools (e.g., n8n, Zapier). Knowledge of cloud services (AWS, Azure, GCP) and vector databases (Weaviate, Qdrant, Pinecone). Familiarity with MLOps practices and CI/CD pipelines is advantageous. Soft Skills Strong analytical thinking and problem-solving abilities. Excellent communication and interpersonal skills. Capacity to independently manage projects and handle multiple priorities effectively. Preferred Prior experience in the EdTech industry, particularly developing AI-driven educational tools. Familiarity with UPSC examination content and preparation processes. (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

Data Scientist - NLP/LLM SAG InfoTech Pvt. Ltd.

3.0 years

0 Lacs

Jaipur, Rajasthan, India

On-site

Job Description We are looking for an experienced Data Scientist / AI Developer with a strong foundation in classical machine learning, deep learning, natural language processing (NLP), and generative AI. You will be responsible for designing and implementing AI models, including fine-tuning large language models (LLMs), and developing innovative solutions to solve complex problems in a variety of domains. Key Responsibilities : . Develop and implement machine learning models and deep learning algorithms for various use cases. Work on NLP projects involving text classification, language modelling, entity recognition, and sentiment analysis. Leverage generative AI techniques to create innovative solutions and models for content generation, summarization, and translation tasks. Fine-tune large language models (LLMs) to optimize performance for specific tasks or applications. Collaborate with cross-functional teams to design AI-driven solutions that address business problems. Analyse large-scale datasets, perform data pre-processing, feature engineering, and model evaluation. Stay updated with the latest advancements in AI, ML, NLP, and LLMs to continuously improve models and methodologies. Present findings and insights to stakeholders in a clear and actionable manner. Build and maintain end-to-end machine learning pipelines for scalable deployment. Required Skills Strong expertise in supervised and unsupervised machine learning techniques. Proficiency in deep learning frameworks such as TensorFlow, PyTorch, or Keras. Solid experience in Natural Language Processing (NLP), including tokenization, embeddings, and sequence modelling. Hands-on experience with generative AI models and their practical applications. Proven ability to fine-tune large language models (LLMs) for specific tasks. Strong programming skills in Python and familiarity with libraries like Scikit-learn, NumPy, and pandas. Experience in handling large datasets and working with databases (SQL, NoSQL). Familiarity with cloud platforms (AWS, Azure, or GCP) and containerization tools (Docker, Kubernetes). Deep expertise in computer vision, including techniques for object detection, image segmentation, image classification, and feature extraction. Strong problem-solving skills, analytical thinking, and attention to detail. Preferred Skills Proven experience in fine-tuning LLMs (like llama series, mistral) for specific tasks and optimizing their performance. Expertise in computer vision techniques, including object detection, image segmentation, and classification. Proficiency with YOLO algorithms and other state-of-the-art computer vision models. Hands-on experience in building and deploying models in real-time applications or production environments. Qualifications 3+ years of relevant experience in AI, ML, NLP, or related fields. Bachelors or Masters degree in Computer Science, Statistics, or a related discipline. (ref:hirist.tech) Show more Show less

Posted 13 hours ago

Apply

AI Engineer Centre for Computational Technologies (CCTech)

2.0 years

0 Lacs

Pune, Maharashtra, India

On-site

load_list_page(event)"> Job listing Job details Job Information Date Opened 06/16/2025 Industry IT Services Job Type Full time City Pune City State/Province Maharashtra Country India Zip/Postal Code 411001 About Us CCTech 's mission is to transform human life by the democratization of technology. We are a well established digital transformation company building the applications in the areas of CAD, CFD, Artificial Intelligence, Machine Learning, 3D Webapps, Augmented Reality, Digital Twin, and other enterprise applications. We have two business divisions: product and consulting. simulationHub is our flagship product and the manifestation of our vision. Currently, thousands of users use our CFD app in their upfront design process. Our consulting division, with its partners such as Autodesk Forge, AWS and Azure, is helping the world's leading engineering organizations, many of which are Fortune 500 list of companies, in achieving digital supremacy. Job Description We are seeking a passionate and skilled AI Engineer with over 2 years of hands-on experience to join our growing team. The ideal candidate will have an engineering background and a strong grasp of modern AI technologies, especially in Prompt Engineering, Agentic AI models, and production-grade AI workflows . You’ll play a key role in building intelligent systems that augment and automate real-world business processes. Responsibility Design, develop, and deploy AI-powered solutions using LLMs and agentic frameworks. Build and optimize prompt engineering strategies to ensure high-performance language model behavior. Create and maintain autonomous AI agents capable of executing complex multi-step task. Develop, test, and iterate on real-world AI workflows integrated into broader applications. Collaborate with product managers, designers, and engineers to translate business problems into scalable AI solutions. Monitor and fine-tune AI models in production for accuracy, performance, and cost-effectiveness. Stay current with emerging trends in generative AI, LLMs, agent-based architectures, and MLOps. Requirements 2+ years of hands-on experience in AI/ML engineering or applied NLP. Proven experience with Prompt Engineering and customizing large language model behavior. Experience developing or integrating Agentic AI frameworks (e.g., LangChain, AutoGPT, CrewAI, etc.). Strong understanding of LLMs (e.g., GPT-4, Claude, Mistral, Gemini, etc.) and how to apply them in workflow automation. Demonstrated ability to deploy working AI solutions and pipelines in production environments. Proficient in Python and relevant AI libraries (Transformers, OpenAI SDK, LangChain, etc.). Familiarity with RESTful APIs, cloud platforms (e.g., Azure, AWS, GCP), and version control tools (e.g., Git) Benefits Opportunity to work with a dynamic and fast-paced IT organization. Make a real impact on the company's success by shaping a positive and engaging work culture. Work with a talented and collaborative team. Be part of a company that is passionate about making a difference through technology. check(event) ; career-website-detail-template-2 => apply(record.id,meta)" mousedown="lyte-button => check(event)" final-style="background-color:#2185D0;border-color:#2185D0;color:white;" final-class="lyte-button lyteBackgroundColorBtn lyteSuccess" lyte-rendered=""> Show more Show less

Posted 20 hours ago

Apply

Principal AI Engineer Bison Global Search

8.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Bison Global Search is seeking a Principal AI Engineer for a leading product company in Chennai . They work on some cutting-edge technologies in the BIOS Industry Please find below details about the role : Location: Chennai (please do not apply if you are not willing to relocate to Chennai) Company: Product Company (Leader in BIOS Products) Designation: Principal AI Engineer Skills Required: Python + RAG ( Retrieval-Augmented Generation) + Agentic AI Experience : 8 + years of experience as an AI engineer + Guiding and mentor a team of 3-5 AI engineers Please find below the complete JD. if this interests you, please apply to this email We are looking for a highly skilled Principal AI Engineer with deep expertise in Retrieval-Augmented Generation (RAG) and Agentic AI to lead our AI initiatives and drive innovation in FW and Data Center AI solutions . This role requires a strategic thinker who can design and deploy scalable AI architectures, integrate LLMs with retrieval-based techniques, and develop intelligent agentic systems that autonomously interact with data, APIs, and workflows. This role will lead the design and deployment of cutting-edge AI-driven solutions, focusing on LLMs for code synthesis, automated testing, and intelligent autonomous agents that enhance software development workflows with strong technical expertise, strategic vision, and leadership to build and deploy AI-driven products that align with business goals. Key Responsibilities: AI Strategy and Leadership: Define and execute AI strategies focused on RAG-based retrieval, code generation, and AI-assisted software engineering Work with stakeholders to align AI capabilities with business objectives and software development needs Research and integrate cutting-edge LLMs and autonomous AI agent architecture into development processes. RAG & Agentic AI Development: Develop RAG pipelines that enhance AI‘s ability to retrieve relevant knowledge and generate context-aware responses. Build and optimize agentic AI systems that can interact with APIs, databases, and development environments (such as LangChain, OpenAI APIs, etc.) Implement AI-powered search, chatbots, and decision-support tools for software engineers. Fine-tune LLMs (GPT, Llama, Mistral, Claude, Gemini etc.) for domain-specific applications. Optimize retrieval mechanisms to enhance response accuracy, grounding AI outputs in real-world data Code generation & Test case Automation: Leverage LLMs to generate high-quality, production-ready code Develop AI-driven test case generation tools that automatically create and validate unit tests, integration tests, and regression tests Integrate AI-driven code assistants and programming agents into IDE and CI/CD workflows Optimize prompt engineering and fine-tuning strategies for LLMs to improve code quality and efficiency MLOps & Scalable AI Systems: Architect and deploy scalable AI models and retrieval pipelines using cloud-based MLOps pipelines (AWS/GCP/Azure, Docker, Kubernetes) Optimize LLMs for real-time AI inferencing , ensuring low latency and high-performance AI solutions. Collaboration: Work cross-functionally with product teams, software engineers, and business stakeholders to integrate AI solutions into products. Mentorship: Guide and mentor a team of 3-5 AI engineers in LLM fine-tuning, retrieval augmentation, and autonomous AI agents. Establish best practices for AI-assisted software development, secure AI integration, and bias mitigation. Research & Innovation: Commitment to staying updated with the latest AI and machine learning research and advancements . Ability to think creatively and propose innovative solutions to complex problems. Model Development: Ability to design, train, and evaluate various AI models , including LLMs and standalone models —familiarity with model training tools and frameworks like Hugging Face Trainer, Fairseq, etc . Required Qualifications: Education: Master's or Ph.D. in Computer Science, AI, Machine Learning, or a related field. Experience: 8+ years of experience in AI and machine learning, with at least 2 years of experience working on LLMs, code generation, RAG, or AI-powered automation . Technical skills: Proficiency in Python, Tensorflow, PyTorch, and LangChain Experience with LLM fine-tuning for code generation Strong expertise in vector databases (FAISS, Weaviate, Chroma, Pinecone, Milvus) and retrieval models Hands-on experience with AI-powered code assistants (Copilot, code Llama, Codex, GTP-4) Knowledge of automated software testing, AI-driven test case generation, AI-assisted debugging Experience with multi-agent AI systems (LangGraph, CrewAI, AutgoGen, OpenAI Assistants API) for autonomous coding tasks Knowledge of GoLang for building high-performance and scalable components and unit test case generation using CMocka is a plus. Hands-on model development, working with business stakeholders to define KPIs and develop and deliver multi-modal (Text and Images) and ensemble models. Develop novel approaches to solve firmware lifecycle management code generation and customer support issues. Implement advanced natural language processing and computer vision models to extract insights from diverse data sources , user-generated data, and images. Automate model lifecycle management . Stay updated with AI and machine learning technology advancements to drive Firmware Lifecycle Management. Analytical & Problem-Solving: Analytical Thinking: Strong analytical skills to interpret complex data and derive actionable insights. Problem-Solving: Ability to troubleshoot and resolve technical issues related to AI models and systems. Research & Innovation: Continuous Learning: Commitment to staying updated with the latest research and advancements in AI and machine learning. Innovation: Ability to think creatively and propose innovative solutions to complex problems. Soft Skills: Communication: Excellent verbal and written communication skills. Adaptability: Ability to adapt to changing technologies and project requirements. Team Player: Strong interpersonal skills and the ability to work well in a team environment. Preferred Qualifications: Experience with deploying and maintaining AI models in production environments . Familiarity with RAG-specific techniques like knowledge distillation or multi-hop retrieval . Understanding of reinforcement learning and active learning techniques for model improvement . Previous experience with large-scale NLP systems and AI-powered search engines . Contribution to AI research, patents, or open-source development Show more Show less

Posted 22 hours ago

Apply

AI Engineer Sprintpark solutions

3.0 - 7.0 years

7 - 16 Lacs

Hyderābād

On-site

AI Specialist / Machine Learning Engineer Location: On-site (hyderabad) Department: Data Science & AI Innovation Experience Level: Mid–Senior Reports To: Director of AI / CTO Employment Type: Full-time Job Summary We are seeking a skilled and forward-thinking AI Specialist to join our advanced technology team. In this role, you will lead the design, development, and deployment of cutting-edge AI/ML solutions, including large language models (LLMs), multimodal systems, and generative AI. You will collaborate with cross-functional teams to develop intelligent systems, automate complex workflows, and unlock insights from data at scale. Key Responsibilities Design and implement machine learning models for natural language processing (NLP), computer vision, predictive analytics, and generative AI. Fine-tune and deploy LLMs using frameworks such as Hugging Face Transformers, OpenAI APIs, and Anthropic Claude. Develop Retrieval-Augmented Generation (RAG) pipelines using tools like LangChain, LlamaIndex, and vector databases (e.g., Pinecone, Weaviate, Qdrant). Productionize ML workflows using MLflow, TensorFlow Extended (TFX), or AWS SageMaker Pipelines. Integrate generative AI with business applications, including Copilot-style features, chat interfaces, and workflow automation. Collaborate with data scientists, software engineers, and product managers to build and scale AI-powered products. Monitor, evaluate, and optimize model performance, focusing on fairness, explainability (e.g., SHAP, LIME), and data/model drift. Stay informed on cutting-edge AI research (e.g., NeurIPS, ICLR, arXiv) and evaluate its applicability to business challenges. Tools & Technologies Languages & Frameworks Python, PyTorch, TensorFlow, JAX FastAPI, LangChain, LlamaIndex ML & AI Platforms OpenAI (GPT-4/4o), Anthropic Claude, Mistral, Cohere Hugging Face Hub & Transformers Google Vertex AI, AWS SageMaker, Azure ML Data & Deployment MLflow, DVC, Apache Airflow, Ray Docker, Kubernetes, RESTful APIs, GraphQL Snowflake, BigQuery, Delta Lake Vector Databases & RAG Tools Pinecone, Weaviate, Qdrant, FAISS ChromaDB, Milvus Generative & Multimodal AI DALL·E, Sora, Midjourney, Runway Whisper, CLIP, SAM (Segment Anything Model) Qualifications Bachelor’s or Master’s in Computer Science, AI, Data Science, or related discipline 3–7 years of experience in machine learning or applied AI Hands-on experience deploying ML models to production environments Familiarity with LLM prompt engineering and fine-tuning Strong analytical thinking, problem-solving ability, and communication skills Preferred Qualifications Contributions to open-source AI projects or academic publications Experience with multi-agent frameworks (e.g., AutoGPT, OpenDevin) Knowledge of synthetic data generation and augmentation techniques Job Type: Permanent Pay: ₹734,802.74 - ₹1,663,085.14 per year Benefits: Health insurance Provident Fund Schedule: Day shift Work Location: In person

Posted 1 day ago

Apply

Senior Machine Learning Engineer Level AI

3.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Level AI was founded in 2019 and is a Series C startup headquartered in Mountain View, California. Level AI revolutionizes customer engagement by transforming contact centers into strategic assets. Our AI-native platform leverages advanced technologies such as Large Language Models to extract deep insights from customer interactions. By providing actionable intelligence, Level AI empowers organizations to enhance customer experience and drive growth. Consistently updated with the latest AI innovations, Level AI stands as the most adaptive and forward-thinking solution in the industry. Empowering contact center stakeholders with real-time insights, our tech facilitates data-driven decision-making for contact centers, enhancing service levels and agent performance. As a vital team member, your work will be cutting-edge technologies and will play a high-impact role in shaping the future of AI-driven enterprise applications. You will directly work with people who've worked at Amazon, Facebook, Google, and other technology companies in the world. With Level AI, you will get to have fun, learn new things, and grow along with us. Ready to redefine possibilities? Join us! We'll love to explore more about you if you have Qualification: B.E/B.Tech/M.E/M.Tech/PhD from tier 1 engineering institutes with relevant work experience with a top technology company in computer science or mathematics-related fields with 3-5 years of experience in machine learning and NLP Knowledge and practical experience in solving NLP problems in areas such as text classification, entity tagging, information retrieval, question-answering, natural language generation, clustering, etc 3+ years of experience working with LLMs in large-scale environments. Expert knowledge of machine learning concepts and methods, especially those related to NLP, Generative AI, and working with LLMs Knowledge and hands-on experience with Transformer-based Language Models like BERT, DeBERTa, Flan-T5, Mistral, Llama, etc Deep familiarity with internals of at least a few Machine Learning algorithms and concepts Experience with Deep Learning frameworks like Pytorch and common machine learning libraries like scikit-learn, numpy, pandas, NLTK, etc Experience with ML model deployments using REST API, Docker, Kubernetes, etc Knowledge of cloud platforms (AWS/Azure/GCP) and their machine learning services is desirable Knowledge of basic data structures and algorithms Knowledge of real-time streaming tools/architectures like Kafka, Pub/Sub is a plus Your role at Level AI includes but is not limited to Big picture: Understand customers’ needs, innovate and use cutting edge Deep Learning techniques to build data-driven solutions Work on NLP problems across areas such as text classification, entity extraction, summarization, generative AI, and others Collaborate with cross-functional teams to integrate/upgrade AI solutions into the company’s products and services Optimize existing deep learning models for performance, scalability, and efficiency Build, deploy, and own scalable production NLP pipelines Build post-deployment monitoring and continual learning capabilities. Propose suitable evaluation metrics and establish benchmarks Keep abreast with SOTA techniques in your area and exchange knowledge with colleagues Desire to learn, implement and work with latest emerging model architectures, training and inference techniques, data curation pipelines, etc To learn more visit : https://thelevel.ai/ Funding : https://www.crunchbase.com/organization/level-ai LinkedIn : https://www.linkedin.com/company/level-ai/ Show more Show less

Posted 1 day ago

Apply

Senior Machine Learning Engineer Level AI

3.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Level AI was founded in 2019 and is a Series C startup headquartered in Mountain View, California. Level AI revolutionizes customer engagement by transforming contact centers into strategic assets. Our AI-native platform leverages advanced technologies such as Large Language Models to extract deep insights from customer interactions. By providing actionable intelligence, Level AI empowers organizations to enhance customer experience and drive growth. Consistently updated with the latest AI innovations, Level AI stands as the most adaptive and forward-thinking solution in the industry. Empowering contact center stakeholders with real-time insights, our tech facilitates data-driven decision-making for contact centers, enhancing service levels and agent performance. As a vital team member, your work will be cutting-edge technologies and will play a high-impact role in shaping the future of AI-driven enterprise applications. You will directly work with people who've worked at Amazon, Facebook, Google, and other technology companies in the world. With Level AI, you will get to have fun, learn new things, and grow along with us. Ready to redefine possibilities? Join us! We'll love to explore more about you if you have Qualification: B.E/B.Tech/M.E/M.Tech/PhD from tier 1 engineering institutes with relevant work experience with a top technology company in computer science or mathematics-related fields with 3-5 years of experience in machine learning and NLP Knowledge and practical experience in solving NLP problems in areas such as text classification, entity tagging, information retrieval, question-answering, natural language generation, clustering, etc 3+ years of experience working with LLMs in large-scale environments. Expert knowledge of machine learning concepts and methods, especially those related to NLP, Generative AI, and working with LLMs Knowledge and hands-on experience with Transformer-based Language Models like BERT, DeBERTa, Flan-T5, Mistral, Llama, etc Deep familiarity with internals of at least a few Machine Learning algorithms and concepts Experience with Deep Learning frameworks like Pytorch and common machine learning libraries like scikit-learn, numpy, pandas, NLTK, etc Experience with ML model deployments using REST API, Docker, Kubernetes, etc Knowledge of cloud platforms (AWS/Azure/GCP) and their machine learning services is desirable Knowledge of basic data structures and algorithms Knowledge of real-time streaming tools/architectures like Kafka, Pub/Sub is a plus Your role at Level AI includes but is not limited to Big picture: Understand customers’ needs, innovate and use cutting edge Deep Learning techniques to build data-driven solutions Work on NLP problems across areas such as text classification, entity extraction, summarization, generative AI, and others Collaborate with cross-functional teams to integrate/upgrade AI solutions into the company’s products and services Optimize existing deep learning models for performance, scalability, and efficiency Build, deploy, and own scalable production NLP pipelines Build post-deployment monitoring and continual learning capabilities. Propose suitable evaluation metrics and establish benchmarks Keep abreast with SOTA techniques in your area and exchange knowledge with colleagues Desire to learn, implement and work with latest emerging model architectures, training and inference techniques, data curation pipelines, etc To learn more visit : https://thelevel.ai/ Funding : https://www.crunchbase.com/organization/level-ai LinkedIn : https://www.linkedin.com/company/level-ai/ Show more Show less

Posted 1 day ago

Apply

Artificial Intelligence Engineer HGS

6.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Job Title: AI Engineer Want to join a startup, but with the stability of a larger organization? Join our innovation team at HGS that's focused on building SaaS products. If you are highly driven & passionate person who'd like to build highly scalable SaaS products in a startup type of environment, you're welcome to apply. The HGS Digital Innovation Team is designed to create products and solutions relevant for enterprises, discover innovations and to contextualize and experiment with them within a specific industry. This unit provides an environment for the exploration, development, and testing of Cloud-based Digital AI solutions. In addition to that it also looks at rapid deployment at scale and sustainability of these solutions for target business impacts. Job Overview We are seeking an agile AI Engineer with a strong focus on both AI engineering and SaaS product development in a 0-1 product environment. This role is perfect for a candidate skilled in building and iterating quickly, embracing a fail fast approach to bring innovative AI solutions to market rapidly. You will be responsible for designing, developing, and deploying SaaS products using advanced Large Language Models (LLMs) such as Meta, Azure OpenAI, Claude, and Mistral, while ensuring secure, scalable, and high-performance architecture. Your ability to adapt, iterate, and deliver in fast-paced environments is critical. Responsibilities Lead the design, development, and deployment of SaaS products leveraging LLMs, including platforms like Meta, Azure OpenAI, Claude, and Mistral. Support product lifecycle, from conceptualization to deployment, ensuring seamless integration of AI models with business requirements and user needs. Build secure, scalable, and efficient SaaS products that embody robust data management and comply with security and governance standards. Collaborate closely with product management, and other stakeholders to align AI-driven SaaS solutions with business strategies and customer expectations. Fine-tune AI models using custom instructions to tailor them to specific use cases and optimize performance through techniques like quantization and model tuning. Architect AI deployment strategies using cloud-agnostic platforms (AWS, Azure, Google Cloud), ensuring cost optimization while maintaining performance and scalability. Apply retrieval-augmented generation (RAG) techniques to build AI models that provide contextually accurate and relevant outputs. Build the integration of APIs and third-party services into the SaaS ecosystem, ensuring robust and flexible product architecture. Monitor product performance post-launch, iterating and improving models and infrastructure to enhance user experience and scalability. Stay current with AI advancements, SaaS development trends, and cloud technology to apply innovative solutions in product development. Qualifications Bachelor’s degree or equivalent in Information Systems, Computer Science, or related fields. 6+ years of experience in product development, with at least 2 years focused on AI-based SaaS products. Demonstrated experience in leading the development of SaaS products, from ideation to deployment, with a focus on AI-driven features. Hands-on experience with LLMs (Meta, Azure OpenAI, Claude, Mistral) and SaaS platforms. Proven ability to build secure, scalable, and compliant SaaS solutions, integrating AI with cloud-based services (AWS, Azure, Google Cloud). Strong experience with RAG model techniques and fine-tuning AI models for business-specific needs. Proficiency in AI engineering, including machine learning algorithms, deep learning architectures (e.g., CNNs, RNNs, Transformers), and integrating models into SaaS environments. Solid understanding of SaaS product lifecycle management, including customer-focused design, product-market fit, and post-launch optimization. Excellent communication and collaboration skills, with the ability to work cross-functionally and drive SaaS product success. Knowledge of cost-optimized AI deployment and cloud infrastructure, focusing on scalability and performance. Show more Show less

Posted 1 day ago

Apply

AI Engineer Centre for Computational Technologies

0.0 - 2.0 years

0 Lacs

Pune, Maharashtra

On-site

Indeed logo

Job Information Date Opened 06/16/2025 Industry IT Services Job Type Full time City Pune City State/Province Maharashtra Country India Zip/Postal Code 411001 About Us CCTech 's mission is to transform human life by the democratization of technology. We are a well established digital transformation company building the applications in the areas of CAD, CFD, Artificial Intelligence, Machine Learning, 3D Webapps, Augmented Reality, Digital Twin, and other enterprise applications. We have two business divisions: product and consulting. simulationHub is our flagship product and the manifestation of our vision. Currently, thousands of users use our CFD app in their upfront design process. Our consulting division, with its partners such as Autodesk Forge, AWS and Azure, is helping the world's leading engineering organizations, many of which are Fortune 500 list of companies, in achieving digital supremacy. Job Description We are seeking a passionate and skilled AI Engineer with over 2 years of hands-on experience to join our growing team. The ideal candidate will have an engineering background and a strong grasp of modern AI technologies, especially in Prompt Engineering, Agentic AI models, and production-grade AI workflows . You’ll play a key role in building intelligent systems that augment and automate real-world business processes. Responsibility : Design, develop, and deploy AI-powered solutions using LLMs and agentic frameworks. Build and optimize prompt engineering strategies to ensure high-performance language model behavior. Create and maintain autonomous AI agents capable of executing complex multi-step task. Develop, test, and iterate on real-world AI workflows integrated into broader applications. Collaborate with product managers, designers, and engineers to translate business problems into scalable AI solutions. Monitor and fine-tune AI models in production for accuracy, performance, and cost-effectiveness. Stay current with emerging trends in generative AI, LLMs, agent-based architectures, and MLOps. Requirements 2+ years of hands-on experience in AI/ML engineering or applied NLP. Proven experience with Prompt Engineering and customizing large language model behavior. Experience developing or integrating Agentic AI frameworks (e.g., LangChain, AutoGPT, CrewAI, etc.). Strong understanding of LLMs (e.g., GPT-4, Claude, Mistral, Gemini, etc.) and how to apply them in workflow automation. Demonstrated ability to deploy working AI solutions and pipelines in production environments. Proficient in Python and relevant AI libraries (Transformers, OpenAI SDK, LangChain, etc.). Familiarity with RESTful APIs, cloud platforms (e.g., Azure, AWS, GCP), and version control tools (e.g., Git) Benefits Opportunity to work with a dynamic and fast-paced IT organization. Make a real impact on the company's success by shaping a positive and engaging work culture. Work with a talented and collaborative team. Be part of a company that is passionate about making a difference through technology.

Posted 1 day ago

Apply

Senior Data Scientist Birdeye

0 years

0 Lacs

Gurugram, Haryana, India

On-site

Organization Snapshot: Birdeye is the leading all-in-one Experience Marketing platform , trusted by over 100,000+ businesses worldwide to power customer acquisition, engagement, and retention through AI-driven automation and reputation intelligence. From local businesses to global enterprises, Birdeye enables brands to deliver exceptional customer experiences across every digital touchpoint. As we enter our next phase of global scale and product-led growth , AI is no longer an add-on—it’s at the very heart of our innovation strategy . Our future is being built on Large Language Models (LLMs), Generative AI, Conversational AI, and intelligent automation that can personalize and enhance every customer interaction in real time. Job Overview: Birdeye is seeking a Senior Data Scientist – NLP & Generative AI to help reimagine how businesses interact with customers at scale through production-grade, LLM-powered AI systems . If you’re passionate about building autonomous, intelligent, and conversational systems , this role offers the perfect platform to shape the next generation of agentic AI technologies. As part of our core AI/ML team, you'll design, deploy, and optimize end-to-end intelligent systems —spanning LLM fine-tuning , Conversational AI , Natural Language Understanding (NLU) , Retrieval-Augmented Generation (RAG) , and Autonomous Agent frameworks . This is a high-impact IC role ideal for technologists who thrive at the intersection of deep NLP research and scalable engineering . Key Responsibilities: LLM, GenAI & Agentic AI Systems Architect and deploy LLM-based frameworks using GPT, LLaMA, Claude, Mistral, and open-source models. Implement fine-tuning , LoRA , PEFT , instruction tuning , and prompt tuning strategies for production-grade performance. Build autonomous AI agents with tool use , short/long-term memory , planning , and multi-agent orchestration (using LangChain Agents, Semantic Kernel, Haystack, or custom frameworks). Design RAG pipelines with vector databases ( Pinecone , FAISS , Weaviate ) for domain-specific contextualization. Conversational AI & NLP Engineering Build Transformer-based Conversational AI systems for dynamic, goal-oriented dialog—leveraging orchestration tools like LangChain, Rasa, and LLMFlow. Implement NLP solutions for semantic search , NER , summarization , intent detection , text classification , and knowledge extraction . Integrate modern NLP toolkits: SpaCy, BERT/RoBERTa, GloVe, Word2Vec, NLTK , and HuggingFace Transformers . Handle multilingual NLP, contextual embeddings, and dialogue state tracking for real-time systems. Scalable AI/ML Engineering Build and serve models using Python , FastAPI , gRPC , and REST APIs . Containerize applications with Docker , deploy using Kubernetes , and orchestrate with CI/CD workflows. Ensure production-grade reliability, latency optimization, observability, and failover mechanisms. Cloud & MLOps Infrastructure Deploy on AWS SageMaker , Azure ML Studio , or Google Vertex AI , integrating with serverless and auto-scaling services. Own end-to-end MLOps pipelines : model training, versioning, monitoring, and retraining using MLflow , Kubeflow , or TFX . Cross-Functional Collaboration Partner with Product, Engineering, and Design teams to define AI-first experiences. Translate ambiguous business problems into structured ML/AI projects with measurable ROI. Contribute to roadmap planning, POCs, technical whitepapers, and architectural reviews. Technical Skillset Required Programming : Expert in Python , with strong OOP and data structure fundamentals. Frameworks : Proficient in PyTorch , TensorFlow , Hugging Face Transformers , LangChain , OpenAI/Anthropic APIs . NLP/LLM : Strong grasp of Transformer architecture , Attention mechanisms , self-supervised learning , and LLM evaluation techniques . MLOps : Skilled in CI/CD tools, FastAPI , Docker , Kubernetes , and deployment automation on AWS/Azure/GCP . Databases : Hands-on with SQL/NoSQL databases, Vector DBs , and retrieval systems. Tooling : Familiarity with Haystack , Rasa , Semantic Kernel , LangChain Agents , and memory-based orchestration for agents. Applied Research : Experience integrating recent GenAI research (AutoGPT-style agents, Toolformer, etc.) into production systems. Bonus Points Contributions to open-source NLP or LLM projects. Publications in AI/NLP/ML conferences or journals. Experience in Online Reputation Management (ORM) , martech, or CX platforms. Familiarity with reinforcement learning , multi-modal AI , or few-shot learning at scale. Show more Show less

Posted 1 day ago

Apply

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

285 Mistral Jobs

Set Job Alert

Start Your Job Search Today

Please Verify Your Phone or Email

Job Application AI Bot

Download the Mobile App

Setup Job Alerts

Featured Companies

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

285 Mistral Jobs

Set Job Alert

Upload Resume

AI Job Matching Summary

Pros

Cons

Summary

Start Your Job Search Today

Please Verify Your Phone or Email

Job Application AI Bot

Download the Mobile App

Setup Job Alerts

Featured Companies