Jobs
Interviews

2 Pretraining Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

7.0 - 11.0 years

0 Lacs

karnataka

On-site

As a Generative AI Solution Architect at NVIDIA, you will leverage your expertise in training Large Language Models (LLMs) and implementing workflows based on Pretraining, Finetuning LLMs & Retrieval-Augmented Generation (RAG). Your role will involve architecting cutting-edge solutions that harness the power of NVIDIA's generative AI technologies. You must possess a deep understanding of language models, especially open source LLMs, and excel in designing and implementing RAG-based workflows. Your responsibilities will include collaborating with customers to identify language-related business challenges and tailor solutions, supporting pre-sales activities by delivering technical presentations and demonstrations, engaging with NVIDIA engineering teams to provide feedback, and working directly with customers/partners to understand their requirements and challenges. You will lead workshops and design sessions to define generative AI solutions focused on LLMs and RAG workflows, and train and optimize Large Language Models using NVIDIA's hardware and software platforms. To qualify for this role, you should hold a Master's or Ph.D. in Computer Science, Artificial Intelligence, or have equivalent experience. Additionally, you must have at least 7 years of hands-on experience in a technical AI role, with a strong emphasis on training LLMs and a proven track record of deploying and optimizing LLM models for production environments. Your expertise should cover state-of-the-art language models like GPT-3, BERT, or similar architectures, and you should be proficient in training and fine-tuning LLMs using frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers. Moreover, you should possess proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms, particularly GPUs. Your knowledge of GPU cluster architecture and parallel processing will be crucial for accelerated model training and inference. Strong communication and collaboration skills are essential for articulating complex technical concepts to diverse audiences and leading workshops and training sessions. To further distinguish yourself, experience in deploying LLM models in cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure, optimizing LLM models for inference speed and resource efficiency, familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes), and a deep understanding of GPU cluster architecture and distributed computing concepts will be advantageous. Join NVIDIA, a technology leader offering competitive salaries, a comprehensive benefits package, and a dynamic work environment. If you are a creative engineer passionate about technology and seeking an opportunity to work with some of the brightest minds in the industry, we invite you to apply and be part of our growing engineering teams.,

Posted 1 week ago

Apply

5.0 - 9.0 years

0 Lacs

noida, uttar pradesh

On-site

As a Senior Applied Researcher at Genloop, you will lead cutting-edge work in small language model (SLM) training, LLM customization, and domain adaptation. We believe that domain intelligence is crucial to bring GenAI into enterprise production, especially in cases where a 1-year employee outperforms a 1st-day hire significantly. Your responsibilities will include designing and conducting experiments, customizing LLMs for specific enterprise domains, evaluating model performance, collaborating with engineering and product teams, contributing to research documentation and open-source releases, and staying updated on LLM architectures and pretraining objectives. Additionally, you will mentor junior researchers and provide guidance to the broader team based on insights from the frontier. To qualify for this role, you should have at least 5 years of experience in ML research or applied deep learning, with a focus on NLP, foundation models, or multi-modal systems. A deep understanding of transformers, language modeling, and sequence generation is required, along with hands-on experience in pretraining, SFT, RLHF, and efficient fine-tuning methods. Strong publications or open-source contributions would be advantageous. Graduation from a Tier 1 institution or demonstrating exceptional skills are preferred. Genloop is a research-first AI company that specializes in building customized, continuously learning AI systems. Our team comprises researchers and engineers from prestigious institutions and tech companies, working on models with real-world, production-grade impact. In terms of compensation and benefits, we offer a competitive salary, meaningful startup equity, and industry-leading benefits. The exact compensation will be based on your experience, expertise, and location. Genloop is an Equal Opportunity Employer that values diversity and is dedicated to creating an inclusive and respectful workplace for all.,

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies