Baner, Pune, Maharashtra, India
Department
CoE- AI & Data
Job posted on
Oct 13, 2025
Employment type
Permanent
Position: Architect AI & ML
Experience- 8+ Years
Job Location- Pune
We are seeking a dynamic and experienced leader to drive the growth of our AI practice with a focus on Generative AI, advanced NLP solutions, and Large Language Models (LLMs). This role is ideal for a seasoned professional who combines technical expertise with exceptional leadership skills, customer-facing experience, and a vision for scaling teams and capabilities.
Skills & Qualifications
Technical Expertise:
- Extensive experience with NLP techniques and multi-class/multi-label text classification.
- Hands-on experience fine-tuning private LLMs (e.g., LLaMA, Gemma).
- Proficiency in PyTorch, Hugging Face, LangChain, Haystack, and related frameworks.
- Strong knowledge of model orchestration tools like MLFlow or KubeFlow.
- Familiarity with RAG techniques and vector stores (e.g., Pinecone, ChromaDB).
- Strategic Mindset:
- Visionary thinking with the ability to translate emerging AI trends into actionable business strategies.
Good to have:
- Experience with multi-modal AI models (e.g., image-text, text-audio models).
- Expertise in Databricks for scalable model development and deployment.
- Knowledge of Explainability AI tools (e.g., Captum) for interpretable models.
Key Responsibilities
Leadership & Growth:
- Lead, mentor, and grow a high-performing AI team, fostering innovation and collaboration.
- Develop and execute strategies to expand the practice and deliver measurable business value to clients.
Customer Engagement:
- Serve as a confident and articulate interface with clients, ensuring clear communication of AI strategies, solutions, and outcomes.
- Build trusted partnerships with clients, understanding their needs and aligning solutions to their goals.
AI Solution Development:
- Design and implement state-of-the-art NLP solutions, focusing on multi-class and multi-label text classification.
- Fine-tune and deploy private LLMs (e.g., LLaMA, Gemma) for tailored business applications.
- Develop Retrieval-Augmented Generation (RAG) pipelines leveraging vector databases like Pinecone or ChromaDB for high-performance solutions.
Operational Excellence:
- Oversee end-to-end model lifecycle management, including training, deployment, and monitoring.
- Integrate explainability into AI models, ensuring transparency and trust in decision-making.
- Collaborate with external LLM providers (e.g., OpenAI, Claude) to enhance and integrate AI capabilities.
Leadership & Communication:
- Proven ability to lead and inspire teams, with a track record of scaling AI practices.
- Exceptional communication and presentation skills, capable of engaging diverse stakeholders confidently.
Additional Skills
Experience with multi-modal AI models (e.g., image-text, text-audio models).
Expertise in Databricks for scalable model development and deployment.
Knowledge of Explainability AI tools (e.g., Captum) for interpretable models.