Technology Architect

5 - 9 years

0 Lacs

Posted:9 hours ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: You will be responsible for designing scalable GenAI systems like RAG pipelines and multi-agent systems. You will have to choose between hosted APIs and open-source models, as well as architect hybrid systems combining LLMs with traditional software. Key Responsibilities: - Benchmark models such as GPT-4, Claude, Mistral, and LLaMA. - Understand trade-offs including latency, cost, accuracy, and context length. - Design enterprise-grade RAG systems using vector databases like Pinecone, Weaviate, and Qdrant with LangChain or LlamaIndex. - Implement security, privacy, and governance measures including data privacy, access control, and audit logging. - Optimize cost and monitor GenAI inference costs using observability tools like Arize, WhyLabs, and PromptLayer. - Fine-tune open-source models using PEFT, LoRA, QLoRA, and perform knowledge distillation for smaller, faster models. - Design multi-agent systems and agent workflows such as AutoGen, CrewAI, and LangGraph. - Integrate LLMs with external tools, APIs, and databases, and manage tool routing effectively. - Identify use cases with high ROI, balance feasibility, desirability, and viability, and lead GenAI proof of concepts and MVPs. - Mentor and upskill teams by training developers on prompt engineering, LangChain, and establishing GenAI best practices through code reviews and internal hackathons or innovation sprints. Qualifications Required: - Strong understanding of AI solution architecture and model evaluation and selection. - Proficiency in designing scalable GenAI systems and enterprise-grade RAG systems. - Experience in security, privacy, and governance implementation. - Ability to optimize costs and monitor GenAI inference costs effectively. - Advanced technical skills in model fine-tuning, distillation, and designing multi-agent systems. - Leadership experience in team and product leadership, including mentoring and upskilling teams.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Cognizant logo
Cognizant

IT Services and IT Consulting

Teaneck New Jersey

RecommendedJobs for You

bengaluru east, karnataka, india

chennai, tamil nadu, india