Jobs
Interviews

103 Rag Pipelines Jobs - Page 2

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 7.0 years

0 Lacs

thiruvananthapuram, kerala

On-site

As a Generative AI Architect at Techvantage.ai, your primary responsibility will be to design and lead end-to-end solutions incorporating Large Language Models (LLMs), diffusion models, transformers, and multimodal AI across various enterprise and consumer use cases. You will play a crucial role in developing robust and scalable architectures for prompt engineering, model orchestration, embedding pipelines, vector databases, and fine-tuning/instruction tuning. Your expertise will be essential in evaluating and integrating both open-source and proprietary models based on specific business requirements. In this strategic and high-impact role, you will be expected to collaborate closely with product managers, data scientists, engineers, and clients to design and implement cutting-edge GenAI-powered applications aligned with business objectives. Ethical considerations such as data privacy, model safety, and bias mitigation will be integral parts of your responsibilities. Moreover, your contribution to internal GenAI accelerators, reusable frameworks, and innovation strategies will be crucial to the continuous growth and development of the organization. To excel in this position, you should possess a minimum of 10 years of experience in technology roles, specifically in AI/ML architecture, data engineering, or solution architecture, with at least 3 years dedicated to designing and deploying Generative AI solutions. Deep knowledge of transformer architectures, LLMs, diffusion models, and related frameworks is essential, as well as proficiency in Python, Docker, Kubernetes, and cloud platforms such as AWS, Azure, or GCP. Strong communication and stakeholder management skills will also be key in effectively conveying complex AI concepts to non-technical audiences. While not a mandatory requirement, experience deploying GenAI systems in BFSI, healthcare, e-commerce, or enterprise SaaS platforms, familiarity with agentic AI systems, autoML, RLHF, or autonomous agents, and knowledge of data governance, compliance, and responsible AI frameworks will be advantageous. Additionally, contributions to open-source AI projects or publications in the AI/ML domain would be considered a strong plus. A Master's or PhD in Computer Science, AI/ML, Data Science, or related field is preferred. Join us at Techvantage.ai and lead innovation at the forefront of Generative AI and intelligent systems. You will have the opportunity to work with a highly talented team, leverage state-of-the-art tools and models, and receive generous compensation that reflects your expertise and contributions.,

Posted 2 weeks ago

Apply

12.0 - 16.0 years

0 Lacs

karnataka

On-site

The position is based at Altimetrik with base locations in Bangalore, Chennai, Pune, Jaipur, Hyderabad, Gurugram. The ideal candidate should be able to join immediately or within 10 days. As a Machine Learning Engineer, your primary responsibility will be to design and implement ML solutions while architecting scalable and efficient systems. You should be proficient in Machine Learning Algorithms, Data Engineering, ETL/ELT processes, data cleaning, preprocessing, EDA, feature engineering, data splitting, and encoding. Additionally, you should have experience in MLOps, including model versioning, training, experimenting, deployment, and monitoring using tools such as Python, Pandas, TensorFlow, PyTorch, Scikit-learn, Keras, XGBoost, LightGBM, Matplotlib, R, Scala, Java, Git, DVC, MLFlow, Kubernetes, Kubeflow, Docker, Containers, CI/CD deployments, Apache Airflow, Databricks, Snowflake, Salesforce, SAP, AWS/Azure/GCP Data Cloud Platforms, AWS SageMaker, Google AI Platform, Azure Machine Learning, model design and optimization, LLMs models (OpenAI, BERT, LLaMA, Gemini, etc.), RDBMS, NoSQL databases, Vector DB, RAG Pipelines, AI Agent Frameworks, AI agent authentication, deployment, AI security and compliance, and Prompt Engineering. Your secondary skills should include cloud computing, data engineering, and DevOps. You will be responsible for designing and developing AI/ML models and algorithms. Collaboration with data scientists and engineers to ensure the scalability and performance of AI/ML systems will be a key part of your role. To be considered for this position, you should have 12-15 years of experience in AI/ML development, strong expertise in AI/ML frameworks and tools, and excellent problem-solving and technical skills.,

Posted 2 weeks ago

Apply

2.0 - 6.0 years

0 - 0 Lacs

hyderabad, telangana

On-site

As a Backend Development Engineer at Shoshin Tech, you will play a key role in building and maintaining the backend logic of AI-driven web applications. Your primary responsibility will be to develop scalable infrastructure for large language models (LLMs), agents, and AI workflows. Your expertise in Python, LangChain, LangGraph, and server-side logic will be crucial in creating robust and intelligent backend systems. You will collaborate with front-end developers and UI/UX designers to integrate AI-driven backend logic with the user interface. Additionally, you will be responsible for developing and managing APIs and microservices using Python, FastAPI, or Flask, as well as designing, implementing, and optimizing workflows leveraging various AI frameworks. Your role will involve building and deploying scalable AI agents and workflows using LLMs, integrating AI models, fine-tuning them, and optimizing their performance. You will also develop and maintain vector databases and retrieval-augmented generation (RAG) pipelines to enhance AI responses. Furthermore, you will implement robust security measures, authentication, and data protection for AI applications, optimize application performance to ensure minimal latency in AI-driven workflows, and work with cloud platforms for model deployment and scalable infrastructure management. You will also mentor junior developers, stay updated with emerging trends in AI and backend development, and collaborate with the product team to translate business requirements into technical solutions. To be successful in this role, you should have a Bachelor's degree or higher in Computer Science or related field, a minimum of 2 years of experience as a Backend Developer, proficiency in Python and frameworks such as FastAPI, Flask, or Django, and familiarity with embeddings, LLMs, and vector databases. Strong problem-solving skills, experience in a startup environment, and excellent communication skills are also essential. At Shoshin Tech, we offer a supportive and flexible workplace that promotes work-life balance, autonomy to explore your ideas, and an inclusive environment that values diversity. You will have the opportunity for cross-functional exposure, engaging social events, and collaboration with a problem-solving team dedicated to creating innovative solutions. If you are passionate about cutting-edge technology, have an entrepreneurial spirit, and a commitment to promoting well-being, Shoshin Tech is the place for you. We are an Equal Opportunity Employer dedicated to creating an inclusive environment for all individuals. Join us at Shoshin Tech and be part of a high-performance team that is enthusiastic about operational excellence and transforming the future of work.,

Posted 2 weeks ago

Apply

0.0 - 4.0 years

0 Lacs

ahmedabad, gujarat

On-site

Design and build smart automation workflows using n8n, Zapier, and Make.com. Integrate APIs and connect third-party apps to streamline business processes. Utilize LLMs (e.g., OpenAI, Cohere) for tasks like summarization, data extraction, and decision logic. Create RAG pipelines utilizing vector databases such as Pinecone, ChromaDB, or Weaviate. Develop and test autonomous agents using LangChain, AutoGen, or similar frameworks. Write clean, modular code in Python or JavaScript to support custom workflow logic. Prototype ideas quickly and deploy real features used in production environments. Document workflows thoroughly and collaborate with developers, consultants, and product teams. Key Skills: Final-year students in Computer Science, AI/ML, Data Science, Information Systems, or related fields, who are willing to work full time after the internship. Curiosity & Initiative are essential you enjoy experimenting with new tools/technologies and are unafraid to break things to learn. Possess Basic to Intermediate Coding Skills and are comfortable with writing Python or JavaScript/TypeScript. Able to read API docs and write modular code. Familiarity (or willingness to learn) with Workflow Platforms such as n8n, Zapier, Make.com, or similar; if you haven't used n8n yet, assistance will be provided for onboarding. Knowledge of APIs, including understanding of RESTful APIs, JSON, and authentication mechanisms. Interest in AI/LLMs familiarity with LLMs basics or eagerness to dive into prompt engineering, embeddings, and RAG concepts. Problem-Solving Mindset ability to break down complex tasks into smaller steps, map flows, and anticipate edge cases. Strong Communication & Documentation skills capable of explaining workflows, documenting steps, and writing clean README/instructions. A Team Player open to feedback, collaboration in agile/scrum-like setups, and assisting peers in troubleshooting.,

Posted 2 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

NTT DATA is looking for an AI Data Scientist - Digital Engineering Sr. Engineer to join their team in Hyderabad, Telangana (IN-TG), India. As an AI Data Scientist, you will be focusing on Generative AI & LLMs. Your responsibilities will include designing and developing AI/ML models with a specific focus on LLMs such as GPT, LLaMA, Mistral, Falcon, and Claude. You will be applying prompt engineering, fine-tuning, and transfer learning techniques to tailor models for enterprise use cases. Additionally, you will work with vector databases and retrieval-augmented generation (RAG) pipelines for contextual response generation. Collaboration is key in this role, as you will be working closely with data engineers, AI Engineers, and MLOps teams to deploy models in production environments. If you are an exceptional, innovative, and passionate individual looking to be part of an inclusive, adaptable, and forward-thinking organization, apply now to join NTT DATA. NTT DATA is a trusted global innovator of business and technology services with a commitment to helping clients innovate, optimize, and transform for long-term success. With a presence in more than 50 countries, NTT DATA serves 75% of the Fortune Global 100 and has a diverse team of experts. The company offers services ranging from business and technology consulting to data and artificial intelligence solutions, industry-specific offerings, and the development, implementation, and management of applications, infrastructure, and connectivity. NTT DATA is a leading provider of digital and AI infrastructure globally and is part of the NTT Group, which invests significantly in R&D to support organizations and society in confidently transitioning into the digital future. For more information, visit us at us.nttdata.com.,

Posted 2 weeks ago

Apply

4.0 - 7.0 years

0 Lacs

mumbai, maharashtra, india

On-site

Senior AI Engineer ???? Mumbai | Hybrid About Quantanite Were a global CX and digital solutions partner that blends cutting-edge AI with the human touch. Headquartered in London and operating across 4 continents, our 2,000+ people help some of the worlds fastest-growing brands scale smarter, work faster, and deliver better serviceevery time. Were not your typical outsourcing company. At Quantanite, great service is built on two things: smart tech and smarter people. From proprietary AI platforms like MBIUS to our collaborative, people-first culture, we equip our teams with the freedom and tools to build game-changing solutions. If youre looking for a place where your AI engineering skills can shape real enterprise impact (not just POCs), youll feel at home here. The Role As a Senior AI Engineer , youll own the full lifecycle of AI solutionsfrom design to production. Youll be hands-on with multi-agent workflows, advanced knowledge retrieval, and LLM optimization to solve enterprise-scale problems. Expect to lead projects such as AI Copilots, Enterprise Search Systems, Summarization Engines, and Intelligent Automation Tools building robust, scalable solutions that move beyond prototypes into production. Youll also play a key role in mentoring junior engineers, setting engineering standards, and collaborating cross-functionally to ensure delivery at scale. What Youll Do Build and deploy agentic systems and multi-agent workflows using LangChain, LlamaIndex, OpenAI SDK (and beyond). Design RAG pipelines with vector DBs like FAISS, Pinecone, Weaviate, or Qdrant. Fine-tune and optimize LLMs (GPT-4, Claude, LLaMA, Mistral, Gemini) with LoRA/QLoRA, quantization, and other techniques. Implement guardrails, safe API usage, and PII protection for secure deployments. Monitor/evaluate agent performance with LangSmith, AgentOps , and similar tools. Ship real-world enterprise AI use casesCopilots, orchestration engines, intelligent search, and more. Mentor and review the work of junior AI engineers, spreading best practices. Work with product, data, and DevOps teams to bring AI solutions into production at scale. What Youll Bring Solid grounding in LLMs and GenAI (fine-tuning, prompt engineering, practical deployment). Proven track record with RAG pipelines and vector databases. Hands-on with agent frameworks (LangChain, LlamaIndex, etc.) and orchestration. Experience deploying/scaling models on clouds (AWS, Azure, HuggingFace). Strong coding ability in Python or similar. Familiarity with LLM Ops : monitoring, evaluation, cost optimisation. Delivered enterprise-grade AI solutions (not just demos). Experience mentoring engineers and building technical excellence in a team. Bonus: contributed to open-source LLM projects or worked on autonomous agent use cases (research copilots, orchestration systems). Qualifications B.Tech (required). 4.57 years of relevant experience. Strong knowledge of GenAI, agentic systems, and multi-agent architectures. Attributes: clear communicator, strong leader, highly tech-savvy. Preferred: exposure to BPO/KPO environments or systems experience in enterprise AI/agentic tools. Why Quantanite ???? Hybrid work model. ???? Ongoing training, mentorship & career growth. ???? Inclusive, people-first culture. ???? Health insurance & provident fund. Quantanite is an equal opportunity employer. We celebrate diversity and are committed to building an inclusive environment for all. ???? Ready to push agentic AI beyond theory into enterprise-scale impact Lets build the future together. Apply today to discover how we can build better, together. Show more Show less

Posted 2 weeks ago

Apply

0.0 years

0 Lacs

india

On-site

The Data and Artificial Intelligence team in Microsoft's Developer Division, part of newly formed CoreAI group, is working on the future of AI for Developers! We are a diverse, entrepreneurial and multi-disciplinary group of scientists, researchers and engineers with passion for using AI to improve the productivity of millions of developers around the world. We have released AI for code advancements in Github Copilot, VS Code, Visual Studio and other Microsoft Copilots. We are seeking a Principal Data & Applied Science Manager to lead high-impact projects at the intersection of AI and software engineering. In this role, you will drive the development and application of advanced data science techniques-especially those involving LLMs-to real-world developer workflows. You'll collaborate closely with data scientists, engineers, researchers, and product teams across Microsoft and GitHub to shape the next generation of AI-powered developer workflows . You will have the opportunity to design and lead experiments, build and evaluate state-of-the-art models (including RAG pipelines , finetuning and evaluation frameworks), and guide your team in turning insights into scalable product features. Most importantly, you'll play a key role in bridging cutting-edge research and production, using data to inform decisions, drive iteration, and deliver measurable impact at scale. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Posted 2 weeks ago

Apply

0.0 years

0 Lacs

india

On-site

The Data and Artificial Intelligence team in Microsoft's Developer Division, part of newly formed CoreAI group, is working on the future of AI for Developers! We are a diverse, entrepreneurial and multi-disciplinary group of scientists, researchers and engineers with passion for using AI to improve the productivity of millions of developers around the world. We have released AI for code advancements in Github Copilot, VS Code, Visual Studio and other Microsoft Copilots. We are seeking a Principal Data & Applied Scien tist to work on groundbreaking and high-impact projects at the intersection of AI and software engineering. In this role, you will drive the development and application of advanced data science techniques-especially those involving LLMs-to real-world developer workflows. You'll collaborate closely with data scientists, engineers, researchers, and product teams across Microsoft and GitHub to shape the next generation of AI-powered developer workflows . You will have the opportunity to design and lead experiments, build and evaluate state-of-the-art models (including RAG pipelines , finetuning and evaluation frameworks), and turn insights into scalable product features. Most importantly, you'll play a key role in bridging cutting-edge research and production, using data to inform decisions, drive iteration, and deliver measurable impact at scale. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond .

Posted 2 weeks ago

Apply

0.0 years

0 Lacs

india

On-site

The Data and Artificial Intelligence team in Microsoft's Developer Division, part of newly formed CoreAI group, is working on the future of AI for Developers! We are a diverse, entrepreneurial and multi-disciplinary group of scientists, researchers and engineers with passion for using AI to improve the productivity of millions of developers around the world. We have released AI for code advancements in Github Copilot, VS Code, Visual Studio and other Microsoft Copilots. We are looking for a passionate and growth-oriented Senior Applied Scientist to join our team working on high-impact projects at the intersection of AI and software engineering . As an Applied Scientist, you will play a pivotal role in development and application of advanced data science techniques-especially those involving LLMs-to real-world developer workflows. You'll collaborate closely with data scientists, engineers, researchers, and product teams across Microsoft and GitHub to shape the next generation of AI-powered developer workflows . You will have the opportunity to design and lead experiments, build and evaluate state-of-the-art models (including RAG pipelines , finetuning and evaluation frameworks), turning insights into scalable product features. Most importantly, you'll play a key role in bridging cutting-edge research and production, using data to inform decisions, drive iteration, and deliver measurable impact at scale. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Posted 2 weeks ago

Apply

0.0 years

0 Lacs

india

On-site

The Data and Artificial Intelligence team in Microsoft's Developer Division, part of newly formed CoreAI group, is working on the future of AI for Developers! We are a diverse, entrepreneurial and multi-disciplinary group of scientists, researchers and engineers with passion for using AI to improve the productivity of millions of developers around the world. We have released AI for code advancements in Github Copilot, VS Code, Visual Studio and other Microsoft Copilots. We are looking for a passionate and growth-oriented Applied Scientist II to join our team working on high-impact projects at the intersection of AI and software engineering . As an Applied Scientist, you will play a pivotal role in development and application of advanced data science techniques-especially those involving LLMs-to real-world developer workflows. You'll collaborate closely with data scientists, engineers, researchers, and product teams across Microsoft and GitHub to shape the next generation of AI-powered developer workflows . You will have the opportunity to design and lead experiments, build and evaluate state-of-the-art models (including RAG pipelines , finetuning and evaluation frameworks), turning insights into scalable product features. Most importantly, you'll play a key role in bridging cutting-edge research and production, using data to inform decisions, drive iteration, and deliver measurable impact at scale. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Posted 2 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

NTT DATA is looking for an AI Data Scientist - Digital Engineering Sr. Engineer to join their team in Hyderabad, Telangana (IN-TG), India. As an AI Data Scientist, your primary focus will be on Generative AI & LLMs. You will be responsible for designing and developing AI/ML models, specifically concentrating on LLMs such as GPT, LLaMA, Mistral, Falcon, and Claude. Your role will involve applying engineering techniques like fine-tuning and transfer learning to customize models for enterprise use cases. Additionally, you will work with vector databases and retrieval-augmented generation (RAG) pipelines for contextual response generation. Collaboration is key in this role, as you will work closely with data engineers, AI Engineers, and MLOps teams to deploy models in production environments. NTT DATA is a trusted global innovator of business and technology services, serving 75% of the Fortune Global 100. They are committed to helping clients innovate, optimize, and transform for long-term success. With experts in over 50 countries and a robust partner ecosystem, NTT DATA provides a wide range of services including business and technology consulting, data and artificial intelligence, industry solutions, and the development, implementation, and management of applications, infrastructure, and connectivity. As one of the leading providers of digital and AI infrastructure globally, NTT DATA is part of the NTT Group, investing over $3.6 billion annually in R&D to support organizations and society in confidently transitioning into the digital future. If you are an exceptional, innovative, and passionate individual looking to be part of an inclusive and forward-thinking organization, apply now to join NTT DATA.,

Posted 2 weeks ago

Apply

3.0 - 12.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Line of Service Advisory Industry/Sector Not Applicable Specialism Data, Analytics & AI Management Level Senior Associate Job Description & Summary At PwC, our people in data and analytics focus on leveraging data to drive insights and make informed business decisions. They utilise advanced analytics techniques to help clients optimise their operations and achieve their strategic goals. In business intelligence at PwC, you will focus on leveraging data and analytics to provide strategic insights and drive informed decision-making for clients. You will develop and implement innovative solutions to optimise business performance and enhance competitive advantage. Why PWC At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes forour clients and communities. This purpose-led and values-driven work, powered by technology in an environment that drives innovation, will enable you to make a tangible impact in the real world. We reward your contributions, support your wellbeing, and offer inclusive benefits, flexibility programmes and mentorship that will help you thrive in work and life. Together, we grow, learn, care, collaborate, and create a future of infinite experiences foreach other. Learn more about us. At PwC, we believe in providing equal employment opportunities, without any discrimination on the grounds of gender, ethnic background, age, disability, marital status, sexual orientation, pregnancy, gender identity or expression, religion or other beliefs, perceived differences and status protected by law. We strive to create an environment where each one of our people can bring their true selves and contribute to their personal growth and the firms growth. To enable this, we have zero tolerance for any discrimination and harassment based on the above considerations. Job Description & Summary: A career within Data and Analytics services will provide you with the opportunity to help organisations uncover enterprise insights and drive business results using smarter data analytics. We focus on a collection of organisational technology capabilities, including business intelligence, data management, and data assurance that help our clients drive innovation, growth, and change within their organisations in order to keep up with the changing nature of customers and technology. We make impactful decisions by mixing mind and machine to leverage data, understand and navigate risk, and help our clients gain a competitive edge. Responsibilities: Build and deploy machine learning models for classification, regression, NLP, and computer vision use cases Conduct data exploration, feature engineering, model training, tuning, and performance evaluation Collaborate with data engineers to develop robust data pipelines and integrate models into production systems Work closely with client and deliver projects in the AI/ML space Contribute to reusable assets, accelerators, and internal capability building Develop and deploy AI solutions on cloud platforms (Azure/ AWS/GCP) using MLOps and containerization Mandatory skill sets: 3+ years of experience in AI/ML engineering, with experience in working in a cloud environment Proficiency in Python, PyTorch/TensorFlow, and GenAI/Agentic frameworks Hands-on experience with cloud-native AI services (Azure ML, AWS SageMaker, GCP Vertex AI) Familiarity with vector databases (e.g., FAISS, Pinecone), RAG pipelines, and prompt engineering Strong understanding of autonomous agents, orchestration, and multi-agent systems Excellent communication and stakeholder management skills across geographies Preferred skill sets: Certifications in cloud platforms and AI/ML technologies Exposure to enterprise-grade AI use cases in BFSI, healthcare, retail, or manufacturing Knowledge of Responsible AI, data governance, and compliance frameworks Exposure to Agentic AI, Gen AI Years of experience required: 3 to 12 years Education qualification: BE, B.Tech, ME, M,Tech, MBA, MCA (60% above) Education (if blank, degree and/or field of study not specified) Degrees/Field of Study required: Master Degree, Bachelor Degree Degrees/Field of Study preferred: Certifications (if blank, certifications not specified) Required Skills Data Science Optional Skills Accepting Feedback, Accepting Feedback, Active Listening, Analytical Thinking, Business Case Development, Business Data Analytics, Business Intelligence and Reporting Tools (BIRT), Business Intelligence Development Studio, Communication, Competitive Advantage, Continuous Process Improvement, Creativity, Data Analysis and Interpretation, Data Architecture, Database Management System (DBMS), Data Collection, Data Pipeline, Data Quality, Data Science, Data Visualization, Embracing Change, Emotional Regulation, Empathy, Inclusion, Industry Trend Analysis + 16 more Desired Languages (If blank, desired languages not specified) Travel Requirements Available for Work Visa Sponsorship Government Clearance Required Job Posting End Date Show more Show less

Posted 2 weeks ago

Apply

0.0 years

0 Lacs

noida, uttar pradesh, india

On-site

Technology & Architecture Design, develop, and implement scalable, secure, and resilient backend systems using FastAPI, Flask, Django, and microservices. Write clean, efficient, and maintainable code, with strong emphasis on hands-on development and troubleshooting. Implement cloud-native solutions using Azure and/or AWS, with focus on CI/CD pipelines, containerization, monitoring, and observability. Build and integrate Generative AI and Agentic AI capabilities into backend services (RAG pipelines, vector databases, LangChain, HuggingFace, Azure OpenAI). Ensure adherence to security, compliance, and data integrity standards. Problem Solving & Hands-On Contribution Take ownership of technical problems and deliver practical, scalable solutions. Perform code reviews, debugging, performance tuning, and system optimization. Collaborate with senior engineers and architects to translate high-level designs into production-ready implementations. Continuously evaluate and adopt emerging technologies for improving backend and AI systems. Leadership & Collaboration Mentor junior engineers by providing technical guidance, pair programming, and knowledge sharing. Work closely with product managers, architects, and cross-functional teams to align on requirements and deliverables. AI & Innovation Build and maintain AI-enabled applications using Python and Node.js. Experiment with and implement prompt engineering, fine-tuning, and AI automation workflows. Stay up to date on latest trends in Agentic AI, autonomous systems, and data pipelines to propose innovative solutions. Delivery & Operational Excellence Ensure timely development of backend and AI solutions with high code quality and reliability. Follow and enforce engineering best practices: code reviews, testing, documentation, and design principles. Monitor and improve system KPIs such as uptime, latency, throughput, and model accuracy.

Posted 2 weeks ago

Apply

6.0 - 12.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About the Role We are looking for a passionate and experienced Software Engineer (E5 / E6 level) to join our Enterprise Search team, which is at the core of redefining how users discover and interact with information across Whatfix's digital adoption platform. This is a unique opportunity to solve deep information retrieval and search relevance challenges using scalable infrastructure, cutting-edge NLP, and Generative AI. As an engineer at this level, you'll be expected to operate with strong ownership, lead cross-team technical initiatives, and influence design choices that directly impact user experience and business outcomes. What You'll Do As a senior engineer, you will: Build a 0-to-1 Enterprise Search product with a strong focus on scalability, performance, and relevance. Lead proof-of-concept efforts to validate ideas quickly and align with business goals. Architect and implement robust, maintainable, and scalable systems for indexing, querying, and ranking. Develop data pipelines, implement automation for reliability, and ensure strong observability and monitoring. Work closely with Product Managers and Designers to translate user needs into data-driven, intuitive search experiences. Guide and support junior engineers through code reviews, technical direction, and best practices. Collaborate with cross-functional teams (data, platform, infra) to deliver cohesive and high-impact solutions. What We're Looking For Must-Have Skills: Familiarity with LLMs, RAG pipelines, or knowledge graph integrations. Deep expertise in information retrieval, search engines (Lucene, Elasticsearch, Solr). Experience with vector search, embeddings, and/or neural ranking models (e.g., BERT, Sentence Transformers). Strong programming skills in Java, Python, or Go. Familiarity with scalable data processing frameworks (e.g., Spark, Kafka, Flink). Good understanding of system design, APIs, caching, and performance tuning. Nice-to-Have: Experience with enterprise content connectors (SharePoint, Confluence, Jira, etc.). Experience working in a SaaS, B2B, or product-first environment. Qualifications 6-10+ years of experience building backend systems, infrastructure, or AI platforms at scale. Proven ability to own and deliver complex features independently, collaborate across teams, and mentor peers in a fast-paced environment. Demonstrated experience leading initiatives with significant technical and organizational impact - from setting direction to aligning stakeholders and driving execution.

Posted 2 weeks ago

Apply

3.0 - 4.0 years

6 - 7 Lacs

noida

Work from Office

* Design develop deploy full stack AI solns using RAG pipelines, LLMS, Python, ML frameworks, TensorFlow, PyTorch, Hugging Face, APIs, LangChain, or LlamaIndex.Experience with OpenAI APIs, LangChain, or PhD in AI/ML /researcher in AI/ML MNC/IIT

Posted 2 weeks ago

Apply

5.0 - 8.0 years

18 - 27 Lacs

bengaluru

Work from Office

Job Title: Senior / Lead DevOps Engineer (GenAI) Level: A4 & A5 (Experience: 5.1 9 Years) Key Responsibilities Design, build, and maintain CI/CD pipelines for GenAI model training and deployment. Automate infrastructure provisioning & scaling using Terraform, Ansible, Pulumi . Optimize GPU/TPU utilization and monitor model performance in production. Integrate GenAI workflows (LLM fine-tuning, embedding generation, RAG pipelines) into production environments. Collaborate with ML Engineers, Data Scientists, and Platform teams to ensure reproducibility & traceability of models/datasets. Implement observability for GenAI workloads – logging, monitoring, alerting . Ensure security, compliance, and cost optimization across cloud/hybrid environments. Required Skills & Qualifications 5+ years of DevOps / MLOps experience, with 1–2 years in GenAI-focused environments. Strong proficiency in Docker, Kubernetes, Terraform, Jenkins/GitLab CI, Helm . Hands-on experience with LLMs, transformers, Hugging Face, LangChain, OpenAI (or similar) . Expertise in cloud platforms (AWS/GCP/Azure) – especially AI/ML services like SageMaker, Vertex AI, Azure ML. Experience with monitoring tools (Prometheus, Grafana, ELK). Familiar with ML lifecycle tools – MLflow, Weights & Biases, DVC. Strong scripting skills in Python, Bash, or Go . Preferred Qualifications Experience with RAG pipelines , LLMOps, or orchestration platforms (LangServe, BentoML, FastAPI). Knowledge of enterprise ML security standards . Exposure to prompt management tools and GenAI deployment practices. Soft Skills Strong problem-solving and ownership mindset. Ability to lead architecture & design discussions for GenAI infrastructure. Excellent communication and collaboration across AI, data, and DevOps teams. What You’ll Get Opportunity to shape and scale GenAI infrastructure in production. Exposure to state-of-the-art models and tools . Competitive compensation, flexible working environment & growth opportunities. Role & responsibilities

Posted 3 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

pune, maharashtra

On-site

Join us at Barclays in the role of Strategic Adoption AI Engineer, where you will be tasked with enhancing existing processes, reporting, and controls to ensure flawless execution of BAU. Your responsibilities will include driving efficiencies, implementing process improvements, and standardizing processes across SBUs wherever feasible. At Barclays, we not only anticipate the future but also actively participate in creating it. To excel in this role, you should possess the following skills: - Proficient programming abilities in Python and hands-on experience with ML libraries such as scikit-learn, TensorFlow, and PyTorch. - Familiarity with automation tools like Jenkins, GitHub Actions, or GitLab CI/CD for streamlining ML pipelines. - Strong knowledge of Docker and Kubernetes for facilitating scalable deployments. - Extensive experience with various AWS services like SageMaker, Bedrock, Lambda, CloudFormation, Step Functions, S3, and IAM. - Managing infrastructure for training and inference using AWS S3, EC2, EKS, and Step Functions. - Expertise in Infrastructure as Code tools like Terraform, AWS CDK. - Familiarity with model lifecycle management tools such as MLflow, SageMaker Model Registry. - Solid understanding of applying DevOps principles to ML workflows. Additionally, valuable skills may include: - Experience with Snowflake and Databricks for collaborative ML development and scalable data processing. - Knowledge of data engineering tools like Apache Airflow, Kafka, Spark. - Understanding of model interpretability, responsible AI, and governance. - Contributions to open-source MLOps tools or communities. - Strong leadership, communication, and cross-functional collaboration skills. - Understanding of data privacy, model governance, and regulatory compliance in AI systems. - Exposure to LangChain, Vector DBs like FAISS, Pinecone, and retrieval-augmented generation (RAG) pipelines. Your assessment may focus on critical skills essential for success in this role, including risk and controls, change and transformation, business acumen, strategic thinking, and digital and technology expertise. This position is located at our Pune office. **Purpose of the role:** To design, develop, and enhance software solutions using various engineering methodologies to provide business, platform, and technology capabilities for our customers and colleagues. **Accountabilities:** - Develop and deliver high-quality software solutions using industry-aligned programming languages, frameworks, and tools. Ensure scalability, maintainability, and performance optimization of the code. - Collaborate with product managers, designers, and engineers to define software requirements, devise solution strategies, and integrate them seamlessly with business objectives. - Engage in peer collaboration, participate in code reviews, and promote a culture of code quality and knowledge sharing. - Stay updated on industry technology trends, contribute to organizational technology communities, and foster technical excellence and growth. - Adhere to secure coding practices to mitigate vulnerabilities, protect sensitive data, and ensure secure software solutions. - Implement effective unit testing practices to ensure proper code design, readability, and reliability. **Assistant Vice President Expectations:** - Provide advice and influence decision-making, contribute to policy development, and take responsibility for operational effectiveness. Collaborate closely with other functions/business divisions. - Lead a team in performing complex tasks, using professional knowledge and skills to impact the entire business function. Set objectives, coach employees, appraise performance, and determine reward outcomes. - Demonstrate clear leadership behaviours to create an environment for colleagues to thrive and deliver excellent results. The four LEAD behaviours are: Listen and be authentic, Energise and inspire, Align across the enterprise, Develop others.,

Posted 3 weeks ago

Apply

2.0 - 6.0 years

0 Lacs

noida, uttar pradesh

On-site

As an AI Full Stack Developer at CodeSpire Solutions India Pvt. Ltd., you will play a crucial role in designing, developing, and maintaining scalable full-stack applications using the MERN stack, which includes React.js, Node.js, MongoDB, and Express.js. Your responsibilities will also involve integrating and optimizing AI/LLM solutions such as OpenAI SDK, LLaMA, and RAG for enhanced performance. You will be tasked with building and managing cloud-native applications on AWS/Azure, implementing CI/CD pipelines, and ensuring backend performance optimization, API efficiency, and AI token usage. Additionally, you will participate in client-facing sales calls and technical discussions with international clients, primarily from the US and EU, while collaborating with cross-functional teams to develop AI-powered solutions for various sectors including finance, ed-tech, and enterprise SaaS. To excel in this role, you should possess strong technical skills in frontend technologies like React.js, TypeScript, Tailwind, and Bootstrap, as well as backend technologies such as Node.js, Express.js, RESTful APIs, and JWT authentication. Proficiency in databases like MongoDB and Mongoose, along with experience in AI/ML tools like OpenAI SDK, LLM fine-tuning, RAG pipelines, and Prompt Engineering, will be highly beneficial. You are expected to leverage your expertise in cloud and DevOps platforms like AWS (EC2, S3, API Gateway), Azure, GitHub Actions, and Terraform, while utilizing tools like JIRA, Confluence, GitHub, and Agile workflow for efficient project management. Strong communication skills are essential for engaging with clients and conducting technical demos effectively. The ideal candidate for this position should have a minimum of 2 years of full-stack development experience with exposure to AI/LLM-based projects, a solid understanding of integrating AI models into real-world applications, and prior experience in handling international clients, preferably from the US. A bachelor's degree in Computer Science, IT, or a related field is required, along with a willingness to relocate to Noida and work from the office at least 4 days a week. Independent work capability and the ability to lead client discussions are also valued traits. At CodeSpire, we offer a competitive salary package with performance-based bonuses, an opportunity to work on cutting-edge AI projects for global clients, exposure to US/EU client interactions and international SaaS markets, and a fast-track career growth in a scaling AI startup. If you are ready to take on this exciting challenge, apply now by sending your resume to hr@codespiresolutions.com.,

Posted 3 weeks ago

Apply

2.0 - 6.0 years

0 Lacs

pune, maharashtra

On-site

At NiCE, we believe in pushing boundaries and challenging ourselves constantly. We are a team of ambitious individuals who are dedicated to being game changers and always strive to emerge victorious. If you are someone who shares our passion for excellence, we have an exciting career opportunity that will ignite a spark within you. The role revolves around the development of the next-generation advanced analytical cloud platform within Actimize's AI and Analytics Team. This platform aims to leverage data to enhance the accuracy of our clients" Financial Crime programs. As a part of the PaaS/SaaS development group, your responsibilities will include crafting this platform for Actimize's cloud-based solutions and working with cutting-edge cloud technologies. NICE Actimize stands as the leading provider of financial crime, risk, and compliance solutions for financial institutions globally. We value the contributions of every employee as crucial to our company's growth and triumph. To attract top talent worldwide, we offer a dynamic work environment, competitive compensation, benefits, and promising career prospects. Join us to share, learn, and grow in a challenging yet enjoyable setting within a rapidly expanding and esteemed organization. The primary objective of this role is to develop and execute advanced analytics projects comprehensively, covering aspects such as data collection, preprocessing, model development, evaluation, and deployment. You will be tasked with designing and implementing predictive and generative models to derive actionable insights from complex datasets. Your expertise in statistical techniques and quantitative analysis will be pivotal in identifying trends, patterns, and correlations in the data. Collaborating with Product, Engineering, and domain SMEs, you will translate business issues into analytical solutions. Moreover, mentoring junior data scientists, advocating best practices in code quality, experimentation, and documentation, is an integral part of this role. To qualify for this position, you should hold a Bachelor's degree in Computer Science, Statistics, Mathematics, or a related field, with a preference for an advanced degree (Master's or Ph.D.). You must possess 2-4 years of hands-on experience in data science and machine learning, including a minimum of 2 years in Generative AI development. Proficiency in programming languages like Python or R, along with experience in data manipulation and analysis libraries, is essential. A strong grasp of machine learning techniques, algorithms, LLMs, NLP techniques, and evaluation methods for generative outputs is required. Your problem-solving skills, ability to work independently and collaboratively, excellent communication, and leadership qualities are key attributes we seek. Preferred qualifications include prior experience in finance or banking industries, familiarity with cloud computing platforms, knowledge of data visualization tools, and a track record of contributions to the data science community. Hands-on experience with vector databases and embedding techniques would be advantageous. Join NiCE as we disrupt the market with our innovative solutions and global presence. Be part of a team that thrives in a fast-paced, collaborative, and creative environment, where learning and growth opportunities are endless. If you are passionate, innovative, and eager to excel, you might just be the perfect addition to our team. Experience NiCE-FLEX, our hybrid work model that offers maximum flexibility - 2 days in the office and 3 days of remote work each week. Office days focus on face-to-face interactions, fostering teamwork, collaborative thinking, and a vibrant atmosphere that sparks innovation and new ideas. If you are ready to embark on a rewarding journey with NiCE, apply now and seize the opportunity to work with the best of the best in a company that leads the market in AI, cloud, and digital domains. Requisition ID: 8296 Reporting into: Tech Manager Role Type: Data Scientist About NiCE: NICELtd. (NASDAQ: NICE) is a global leader in software products, serving over 25,000 businesses worldwide, including 85 of the Fortune 100 companies. Our solutions deliver exceptional customer experiences, combat financial crime, and ensure public safety. NiCE software manages more than 120 million customer interactions and monitors over 3 billion financial transactions daily. Renowned for our innovation in AI, cloud, and digital technologies, NiCE has a workforce of over 8,500 employees across 30+ countries and is consistently acknowledged as the market leader in its domains.,

Posted 3 weeks ago

Apply

6.0 - 11.0 years

25 - 30 Lacs

hyderabad, pune

Work from Office

Preferred candidate profile We are seeking a highly skilled and motivated Senior Data Scientist with deep expertise in Generative AI , Machine Learning , Deep Learning , and advanced Data Analytics . The ideal candidate will have hands-on experience in building, deploying, and maintaining end-to-end ML solutions at scale, preferably within the Telecom domain. You will be part of our AI & Data Science team, working on high-impact projects ranging from customer analytics, network intelligence, churn prediction, to generative AI applications in telco automation and customer experience. Role & responsibilities Key Responsibilities: Design, develop, and deploy advanced machine learning and deep learning models for Telco use cases such as: Network optimization Customer churn prediction Usage pattern modelling Fraud detection GenAI applications (e.g., personalized recommendations, customer service automation) Lead the design and implementation of Generative AI solutions (LLMs, transformers, text-to-text/image models) using tools like OpenAI, Hugging Face, LangChain, etc. Collaborate with cross-functional teams including network, marketing, IT, and business to define AI-driven solutions. Perform exploratory data analysis , feature engineering, model selection, and evaluation using real-world telecom datasets (structured and unstructured). Drive end-to-end ML solution deployment into production (CI/CD pipelines, model monitoring, scalability). Optimize model performance and latency in production, especially for real-time and edge applications. Evaluate and integrate new tools, platforms, and AI frameworks to advance Vis data science capabilities. Provide technical mentorship to junior data scientists and data engineers. Required Qualifications & Skills: 6 to 9 years of industry experience in Machine Learning, Deep Learning, and Advanced Analytics. Strong hands-on experience with GenAI models and frameworks (e.g., GPT, BERT, Llama, LangChain, RAG pipelines). Proficiency in Python , and libraries such as scikit-learn, TensorFlow, PyTorch, Hugging Face Transformers , etc. Experience in end-to-end model lifecycle management , from data pre-processing to production deployment (MLOps). Familiarity with cloud platforms like AWS, GCP, or Azure; and ML deployment tools (Docker, Kubernetes, MLflow, FastAPI, etc.). Strong understanding of SQL , big data tools (Spark, Hive), and data pipelines. Excellent problem-solving skills with a strong analytical mindset and business acumen. Prior experience working on Telecom datasets or use cases is a strong plus. Preferred Skills: Experience with vector databases , embeddings , and retrieval-augmented generation (RAG) pipelines. Exposure to real-time ML inference and streaming data platforms (Kafka, Flink). Knowledge of network analytics , geo-spatial modelling , or customer behaviour modelling in a Telco environment. Experience mentoring teams or leading small AI/ML projects.

Posted 3 weeks ago

Apply

3.0 - 5.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About Cognite Embark on a transformative journey with Cognite, a global SaaS forerunner in leveraging AI and data to unravel complex business challenges through our cutting-edge offerings including Cognite Atlas AI, an industrial agent workbench, and the Cognite Data Fusion (CDF) platform. We were awarded the 2022 Technology Innovation Leader for Global Digital Industrial Platforms & Cognite was recognized as 2024 Microsoft Energy and Resources Partner of the Year. In the realm of industrial digital transformation, we stand at the forefront, reshaping the future of Oil & Gas, Chemicals, Pharma and other Manufacturing and Energy sectors. Join us in this venture where AI and data meet ingenuity, and together, we forge the path to a smarter, more connected industrial future. Learn More About Cognite Here Cognite Product Tour 2024 Cognite Product Tour 2023 Data Contextualization Masterclass 2023 Our values Impact : Cogniters strive to make an impact in all that they do. We are result-oriented, always asking ourselves. Ownership : Cogniters embrace a culture of ownership. We go beyond our comfort zones to contribute to the greater good, fostering inclusivity and sharing responsibilities for challenges and success. Relentless : Cogniters are relentless in their pursuit of innovation. We are determined and deliverable (never ruthless or reckless), facing challenges head-on and viewing setbacks as opportunities for growth. The Role As an AI/ML Engineer in the ATLAS AI Co-Innovation team, you will help push the technical boundaries of whats possible with industrial GenAI. Youll design and optimize advanced AI models and agent architectures that interact with complex, real-world industrial data. Youll operate at the technical core of customer-facing coinnovation, working closely with solution engineers, product teams, and customer data to build smart, scalable AI components that power next-generation industrial workflows This role demands strong AI/ML engineering skills, deep curiosity, and the ability to adapt cutting-edge research into usable, high-impact solutions. Responsibilities End-to-End Prototyping Build cross-stack prototypes using ATLAS AI, CDF, and open-source AI frameworks to solve real customer challenges. Agent Workflow Design Design and implement multi-agent workflows that combine LLMs, tool use, and reasoning over industrial data. Tech Exploration & Integration Evaluate and integrate new GenAI tools, open-source frameworks, and APIs into ATLAS AI workflows. System Optimization Benchmark performance, tune retrieval and reasoning pipelines, and ensure scalability in real-world industrial deployments. Collaboration & Co-Innovation Work with solution engineers and customer teams to align models and agent behaviors with business value and industrial constraints. What Were Looking For - Must-Have Skills 3+ years of experience in AI/ML engineering, with hands-on delivery of models. Proficiency in working with foundation models (LLMs), including :Prompt engineering, evaluation, and (when relevant) fine-tuning. RAG pipelines and integration with knowledge bases or vector databases. Strong Python skills with experience using frameworks such as LangChain, Transformers, or similar. Understanding of cloud-native development, model training workflows, and ML pipeline orchestration (e.g., data labeling, feature selection, model retraining). Proven ability to write clean, maintainable, and scalable code, following engineering best practices for testing, version control, and review. A maker mindset with bias toward rapid iteration, showing rather than telling, and learning by doing. Bonus Skills Experience with Cognite Data Fusion (CDF). Experience integrating AI workflows with time series, asset hierarchies, or knowledge graphs. Deep learning or traditional ML background (e.g., model architecture selection, hyperparameter tuning, evaluation pipelines). Understanding of industrial data types (e.g., time series, contextual events, industrial knowledge graphs). Experience labeling industrial datasets, including annotation strategies and working with imperfect or sparse labels. Join the global Cognite community! ???? Join an organization of 70 different nationalities ???? with Diversity, Equality and Inclusion (DEI) in focus ???? Office location Rathi Legacy (Rohan Tech Park ) Hoodi (Bengaluru) A highly modern and fun working environment with sublime culture across the organization, follow us on Instagram @ cognitedata ???? to know more Flat structure with direct access to decision-makers, with minimal amount of bureaucracy Opportunity to work with and learn from some of the best people on some of the most ambitious projects found anywhere, across industries Join our HUB ????? to be part of the conversation directly with Cogniters and our partners. Hybrid work environment globally Why choose Cognite ???? ???? Join us in making a real and lasting impact in one of the most exciting and fastest-growing new software companies in the world. We have repeatedly demonstrated that digital transformation, when anchored on strong DataOps, drives business value and sustainability for clients and allows front-line workers, as well as domain experts, to make better decisions every single day. We were recognized as one of CNBC&aposs top global enterprise technology startups powering digital transformation ! And just recently, Frost & Sullivan named Cognite a Technology Innovation Leader ! ???? Most recently Cognite Data Fusion Achieved Industry First DNV Compliance for Digital Twins ???? Apply today! If you&aposre excited about the opportunity to work at Cognite and make a difference in the tech industry, we encourage you to apply today! We welcome candidates of all backgrounds and identities to join our team. We encourage you to follow us on Cognite LinkedIn ; we post all our openings there. Show more Show less

Posted 3 weeks ago

Apply

3.0 - 6.0 years

6 - 12 Lacs

gurugram

Hybrid

Build Gen AI applications using LLMs (OpenAI, LLaMA,Falcon) Develop (RAG) pipelines with vector databases such as ChromaDB, Pinecone, LanceDB. Implement & optimize prompt engineering, embeddings, & semantic search Fine-tune and adapt pre-trained LLMs Required Candidate profile Strong programming skills in Python LangChain, Transformers (Hugging Face, BERT, GPT models) Vector databases (Chroma, Pinecone, LanceDB, Weaviate, FAISS) Deep Learning, Neural Networks, NLP technique

Posted 3 weeks ago

Apply

9.0 - 14.0 years

30 - 45 Lacs

noida, south goa, hyderabad

Hybrid

Job Title: GenAI Architect Experience Required: 10+ years in software architecture or machine learning roles, including at least 2+ years focused on Generative AI or Large Language Models (LLMs). Role Overview We are seeking a GenAI Architect to lead the design, development, and deployment of cutting-edge Generative AI solutions. This role requires deep expertise in LLMs, advanced AI architectures, and the ability to build scalable and production-ready systems leveraging the latest in AI technologies. Key Responsibilities Architect and implement GenAI solutions, including LLMs, RAG pipelines, embedding models, and vector search . Design and optimize prompt engineering strategies, fine-tuning approaches , and mitigate hallucinations in AI outputs. Develop robust AI workflows using Python, LangChain, Hugging Face Transformers , and vector databases such as Pinecone or Weaviate . Integrate and leverage GenAI APIs (OpenAI, Azure OpenAI, Anthropic, Google Vertex AI). Deploy AI applications in cloud-native environments (AWS, GCP, Azure) using Kubernetes and containerized architectures . Lead model evaluation, performance tuning, and ensure scalable architecture for enterprise use cases. Collaborate with business stakeholders to translate complex AI concepts into actionable solutions and value-driven outcomes. Required Skills & Qualifications 10+ years of experience in software architecture or machine learning, with 2+ years in GenAI/LLM-focused roles . Strong expertise in: LLMs and RAG pipelines Prompt engineering, fine-tuning, and hallucination mitigation LangChain, Hugging Face Transformers, Python Vector databases (Pinecone, Weaviate, etc.) Proficiency in deploying solutions on AWS/GCP/Azure with Kubernetes. Experience with model evaluation frameworks and industry best practices. Ability to communicate AI strategies and outcomes to non-technical stakeholders. Preferred Qualifications Hands-on experience with LLMOps tools (Weights & Biases, MLflow, Trulens, PromptLayer). Knowledge of domain-specific AI applications (real estate, finance, risk modeling). Exposure to multi-modal GenAI (text, image, document, voice). Contributions to open-source projects, research publications, or patents in AI/ML.

Posted 3 weeks ago

Apply

9.0 - 14.0 years

30 - 45 Lacs

pune, bengaluru, mumbai (all areas)

Hybrid

Job Title: GenAI Architect Experience Required: 10+ years in software architecture or machine learning roles, including at least 2+ years focused on Generative AI or Large Language Models (LLMs). Role Overview We are seeking a GenAI Architect to lead the design, development, and deployment of cutting-edge Generative AI solutions. This role requires deep expertise in LLMs, advanced AI architectures, and the ability to build scalable and production-ready systems leveraging the latest in AI technologies. Key Responsibilities Architect and implement GenAI solutions, including LLMs, RAG pipelines, embedding models, and vector search . Design and optimize prompt engineering strategies, fine-tuning approaches , and mitigate hallucinations in AI outputs. Develop robust AI workflows using Python, LangChain, Hugging Face Transformers , and vector databases such as Pinecone or Weaviate . Integrate and leverage GenAI APIs (OpenAI, Azure OpenAI, Anthropic, Google Vertex AI). Deploy AI applications in cloud-native environments (AWS, GCP, Azure) using Kubernetes and containerized architectures . Lead model evaluation, performance tuning, and ensure scalable architecture for enterprise use cases. Collaborate with business stakeholders to translate complex AI concepts into actionable solutions and value-driven outcomes. Required Skills & Qualifications 10+ years of experience in software architecture or machine learning, with 2+ years in GenAI/LLM-focused roles . Strong expertise in: LLMs and RAG pipelines Prompt engineering, fine-tuning, and hallucination mitigation LangChain, Hugging Face Transformers, Python Vector databases (Pinecone, Weaviate, etc.) Proficiency in deploying solutions on AWS/GCP/Azure with Kubernetes. Experience with model evaluation frameworks and industry best practices. Ability to communicate AI strategies and outcomes to non-technical stakeholders. Preferred Qualifications Hands-on experience with LLMOps tools (Weights & Biases, MLflow, Trulens, PromptLayer). Knowledge of domain-specific AI applications (real estate, finance, risk modeling). Exposure to multi-modal GenAI (text, image, document, voice). Contributions to open-source projects, research publications, or patents in AI/ML.

Posted 3 weeks ago

Apply

3.0 - 5.0 years

0 Lacs

mumbai, maharashtra, india

On-site

About the Role We are seeking a Agentic AI Developer with 35 years of total software/AI experience and proven hands-on work in Agentic AI . The ideal candidate has built LLM-powered agents using frameworks like LangChain, AutoGen, CrewAI, or Semantic Kernel, and can design, deploy, and optimize autonomous AI systems for real-world business use cases. Key Responsibilities Architect, build, and deploy LLM-driven agents that can plan, reason, and execute multi-step workflows. Work with agent orchestration frameworks (LangChain, AutoGen, CrewAI, Semantic Kernel, Haystack, etc.). Develop and maintain tools, APIs, and connectors for extending agent capabilities. Implement RAG pipelines with vector databases (Pinecone, Weaviate, FAISS, Chroma, etc.). Optimize prompts, workflows, and decision-making for accuracy, cost, and reliability . Collaborate with product and engineering teams to design use-casespecific agents (e.g., copilots, data analysts, support agents). Ensure monitoring, security, and ethical compliance of deployed agents. Stay ahead of emerging trends in multi-agent systems and autonomous AI research . Required Skills 35 years of professional experience in AI/ML, software engineering, or backend development . Demonstrated hands-on experience in building agentic AI solutions (not just chatbots). Proficiency in Python (TypeScript/JavaScript is a plus). Direct experience with LLM APIs (OpenAI, Anthropic, Hugging Face, Cohere, etc.). Strong knowledge of vector databases and embeddings . Experience integrating APIs, external tools, and enterprise data sources into agents. Solid understanding of prompt engineering and workflow optimization . Strong problem-solving, debugging, and system design skills. Nice to Have Experience with multi-agent systems (agents collaborating on tasks). Prior contributions to open-source agentic AI projects . Cloud deployment knowledge ( AWS/GCP/Azure ) and MLOps practices. Background in reinforcement learning or agent evaluation . Familiarity with AI safety, monitoring, and guardrails . What We Offer Work on cutting-edge AI agent projects with direct real-world impact. Collaborative environment with strong emphasis on innovation & experimentation . Competitive salary and growth opportunities. Opportunity to specialize in one of the fastest-growing areas of AI . Show more Show less

Posted 3 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies