Jobs
Interviews

1 Llama3 Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

8.0 - 12.0 years

0 Lacs

karnataka

On-site

You are a highly skilled and motivated AI Tech Lead sought by Rakuten, a global internet service company offering a wide range of services to approximately 1.4 billion customers worldwide. Your primary responsibility will be to develop and scale next-generation AI-driven platforms, particularly e-commerce catalog systems, by integrating cutting-edge AI technologies such as Large Language Models (LLMs). To excel in this role, you should possess a strong background in backend engineering, Artificial Intelligence (AI), Data Science, and Machine Learning (ML), with a focus on designing, developing, and deploying end-to-end ML pipelines and AI-driven solutions. This position requires a combination of technical leadership, backend development, and AI/ML expertise to create data-intensive solutions for various business use cases. As the AI Tech Lead, your key responsibilities will include: Leading the design, development, and deployment of scalable, production-grade platforms for AI-driven applications, particularly e-commerce catalog operations. Defining architectural standards for backend systems to ensure scalability, fault tolerance, and maintainability. Providing mentorship to the engineering team to foster technical excellence and best practices. Driving cross-functional collaboration to align technical solutions with business objectives. Designing and implementing solutions utilizing Large Language Models (LLMs) such as OpenAI GPT APIs, LLAMA2/LLAMA3, or similar frameworks. Fine-tuning and optimizing LLMs for performance, accuracy, and relevance in applications like conversational AI, catalog optimization, and personalization. Building and maintaining Retrieval-Augmented Generation (RAG) systems and vectorized databases for enhanced AI capabilities. Collaborating with data science teams to deploy AI/ML models for predictive analytics, intent classification, Named Entity Recognition (NER), and other NLP tasks. Developing and maintaining robust backend systems using Java, Python, and frameworks like Spring Boot, FastAPI, Django, and Flask. Creating APIs and microservices to handle high volumes of data in real-time with low latency. Optimizing platform performance to support large-scale catalogs and global user traffic. Ensuring high availability and fault tolerance for critical operations. Working with data engineers to design and optimize data pipelines, ensuring the availability and quality of training datasets. Developing scalable systems to process and store large volumes of data using big data technologies like Hadoop, Spark, or Kafka. Leveraging advanced tools like MLflow, Prefect, and AWS Sagemaker for efficient model training, deployment, and monitoring. Deploying and managing backend services and ML models on cloud platforms like AWS, GCP, or Azure. Setting up and optimizing CI/CD pipelines for seamless deployments and updates. Monitoring system and model performance to ensure smooth operations and rapid troubleshooting. Utilizing containerization tools like Docker and Kubernetes for scalable deployments. Partnering with product management, business development, and engineering teams to define platform requirements and AI/ML solutions. Communicating complex technical concepts to non-technical stakeholders effectively. Staying updated on industry trends in AI, ML, and backend development to drive innovation. To qualify for this role, you should have: Minimum 8 years of experience in backend development, with expertise in Java, Python, Spring Boot, microservices, and RESTful APIs. Experience in designing and scaling production-grade platforms for high-volume applications. Proficiency in integrating LLMs like OpenAI GPT, LLAMA2/LLAMA3, and frameworks such as Hugging Face Transformers. Hands-on experience with TensorFlow, PyTorch, or similar tools. Strong knowledge of big data technologies like Hadoop, Spark, or Kafka. Familiarity with RAG systems, vectorized databases, and retrieval-based AI. Proven experience with cloud platforms (AWS, Azure, GCP) and containerization tools like Docker and Kubernetes. Expertise in CI/CD pipelines and system monitoring.,

Posted 2 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies