Job
Description
Job Title: Sr Data Scientist Organization: Living Things Pvt. Ltd Location: IIT Bombay, Powai, Mumbai Job Type: Full-Time Experience Level: Mid-Level (3+ years experience) About Us: Living Things is a pioneering IoT platform by iCapotech Pvt Ltd, dedicated to accelerating the net zero journey towards a sustainable future. We bring mindfulness in energy usage by our platform. Our solution seamlessly integrates with existing air conditioners, empowering businesses & organisations to optimise & reduce energy usage, enhance operational efficiency, reduce carbon footprints, and drive sustainable practices. Analysis of Electricity consumption across all locations from Electricity Bills. By harnessing the power of real-time data analytics and intelligent insights, our energy saving algorithm helps in saving a minimum of 15% on Air Conditioner’s energy consumption. About the Role: We are seeking a highly skilled and passionate Data Scientist to join our team who will play a pivotal role in developing and deploying cutting-edge solutions, particularly in the domain of document extraction using State of the Art Large Language Models (LLMs) and Retrieval-augmented generation (RAG). Job Responsibilities: Document Extraction using LLMs: Design and develop robust document entity extraction models using State of the Art LLMs. Fine-tune LLMs for specific document extraction tasks. Evaluate model performance and optimize for accuracy, efficiency, and scalability. RAG Development: Development of RAG pipelines for Question answering based on documents Agentic RAG pipelines for communicating with the database Apply advanced machine learning and deep learning algorithms to solve complex data-driven problems. LLM Ops: Implement and maintain robust LLM Ops pipelines for finetuning training and monitoring. Develop and implement strategies for continuous model improvement and retraining. Collaboration & Communication: Effectively communicate technical concepts to both technical and non-technical audiences. Collaborate with cross-functional teams to ensure successful project delivery. Skills and Qualifications: Essential: 3+ years of hands-on experience in developing and deploying machine learning models. Experience with fine-tuning and deploying LLMs. 3+ years of experience with building RAG pipelines. Strong proficiency in Python and SQL. Deep understanding of machine learning and deep learning concepts. Experience with Docker and Git. Preferred: Master's degree or PhD in Computer Science, Data Science, Statistics, or a related field. Proficiency with using open state-of-the-art technologies for RAG components, including: Efficient vector databases: Faiss, Milvus, Qdrant, Weaviate Dense retrieval methods: DPR (Dense Passage Retriever), ColBERT, ANCE Knowledge graph embeddings: RDF2Vec, TransE, RotatE LLM integrations: Hugging Face Transformers, OpenAI API Search engines: Elasticsearch, Solr Experience with cloud computing platforms (e.g., Google Cloud Platform, AWS, Azure). Strong understanding of natural language processing (NLP) techniques. Excellent communication and presentation skills. Show more Show less