Home
Jobs

2 Distributed Training Jobs

Filter
Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0.0 - 5.0 years

0 - 12 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

Foundit logo

IBM Research is the innovation and growth engine of the IBM corporation. It is the largest industrial research organization in the world with 12 labs on 6 continents. IBM Research produces more breakthroughsmore than 9 patents are produced every daythan any other organization in the world. IBM employs over 3200 researchers worldwide. IBM Research India (IRL) is the leading industrial research lab in India, shaping the future of computing across AI, Hybrid Cloud and Quantum Computing. IRL has a long legacy of ground-breaking innovation in the areas of computer science and its applications to a wide variety of disciplines and offerings for IBM. IRL researchers are working on projects that are pushing the state of the art across Foundation Models, optimized runtime stacks for FM workloads such as tuning, large scale data engineering and pre-training, multi-accelerator model optimization, agentic workflows and modalities across language, code, time series, IT automation and geospatial. We are strong proponents of open-source community-driven software and model development, and our work spans a wide spectrum from research collaborations with academia to developing enterprise-grade commercial software. Your role and responsibilities Research Engineer position at IBM India Research Lab is a challenging, dynamic and highly innovative role. Some of our current areas of work where we are actively looking for top talent are: Optimized runtime stacks for foundation model workloads including fine-tuning, inference serving and large-scale data engineering, with a focus on multi-stage tuning including reinforcement learning, inference-time compute, and data preparation needs for complex AI systems. Optimizing models to run on multiple accelerators including IBM's AIU accelerator leveraging compiler optimizations, specialized kernels, libraries and tools. Developing use cases that effectively leverage the infrastructure and models to deliver value Pre-training language and multi-modal foundation models working with large scale distributed training procedures, model alignment, creating specialized pipelines for various tasks including effective LLM-generated data pipelines, creating frameworks for collecting human data and deploying models in user-centric platforms. Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise You should have one or more of the following: A master's degree in computer science, AI or related fields from a top institution 0-8 years of experience working with modern ML techniques including but not limited to model architectures, data processing, fine-tuning techniques, reinforcement learning, distributed training, inference optimizations Experience with big data platforms like Ray and Spar Experience working with Pytorch FSDP and HuggingFace libraries Programming experience in one of the following: Python, web development technologies Growth mindset and a pragmatic attitude Preferred technical and professional experience Peer-reviewed research at top machine learning or systems conferences Experience working with pytorch.compile, CUDA, triton kernels, GPU scheduling, memory management Experience working with open-source communities

Posted 1 week ago

Apply

10.0 - 12.0 years

0 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

Foundit logo

The Oracle Global Business Unit (GBU) Generative AI team is responsible for leading Generative AI and Agent needs of business applications serving variety of markets including Finance, Hospitality, Construction and Engineering, Energy & Water etc. Our goal is to enable customers to apply AI to solve their business problems with Oracle's assistance and expertise in Generative AI. In this role, you will have an opportunity to work with teams of applied scientists and engineers to deliver high quality generative ai and agent features that delights our customers with the confidence that their data are safe and protected. Your Opportunity We are seeking a Principal Applied Scientist (IC4) to spearhead Generative AI and Agent use cases that support GBU business applications as well as GBU consulting. As an applied scientist, you will be responsible for driving the development and implementation of cutting-edge technologies.We are building a core talented team specialized in Generative AI. We are looking for candidates who are passionate about building state-of-the-art technologies to solve real-world problems and have a solid technical background in deep learning, especially natural language processing (NLP) and multimodal models, to join this team. You will collaborate with a team of world-class scientists, engineers and product managers.We're looking for a person who will bring a passion for innovative products, strong collaboration skills and the ability to work closely with both development and consulting teams. You'll be a Generative AI expert who is hands-on as well as be adept at evangelizing and influencing multiple stakeholders without direct authority on best practices and to get things done efficiently. Most importantly - we believe in a people-first approach. Our team consists of people from a wide variety of backgrounds, with different professional and life experiences, who support each other to build things the right way and enjoy ourselves while doing it. What we offer Being part of one of the most visionary and mission-driven organizations in Oracle, cooperating with talented peers with diverse backgrounds worldwide. High visibility to senior leadership, as well as technical leaders and partners. Opportunity to build state-of-the-art technologies in large language models and generative AI at scale. Close partnership with product managers and software engineers to deploy Generative AI features into products in various business-critical scenarios. Building performance evaluations of Generative AI systems for continuous improvement of alignment with stakeholders growing expectations. What You'll Do Develop, implement, and optimize large language models and generative AI technologies, including training/finetuning and computation optimizations. Collaborate with software engineers to deploy LLM / Generative AI models and Agents into production environments. Stay up-to-date with the latest advancements in the field of generative AI. Collaborate with cross-functional teams to drive the development and adoption of LLM and generative AI solutions across various organizations in the company. Work directly with key customers and accompany them on their AI journey - understanding their requirements, help them envision and design the right solutions and work together with their engineering and data science team to remove blockers and translate the feedback into actionable items for individual service owners. Design and build solutions and help GBU development teams reach successful pilots, PoCs and feature releases with our AI/Gen AI and DS technologies. Bring back learnings from these engagements to standardize Generative AI and Agent implementations for efficiency, scale and ease of maintenance. Support GBU consulting with re-usable solution patterns and reference solutions / showcases that can apply across multiple customers. Being enthusiastic, self-motivated, and a great collaborator. Lead patent filings and author papers to show innovative enterprise grade developments. Be our product evangelist - engage directly with customers and partners, participate and present in external events and conferences, etc. Qualifications: PhD, MS in computer science, engineering, mathematics or a field related to deep learning. Strong knowledge of ML fundamentals - supervised vs unsupervised modeling, time series, highly unbalanced and noisy data sets, complex feature engineering, recommendation systems, using and optimizing gradient boosting models, NLP, deep learning on all kinds of unstructured data. 5+ (for Senior), 7+ (for Principal), 10+ (for Sr Principal) years of work experience including a minimum of 2-year experience in developing large-scale ML solutions, and in particular deep learning solutions in the NLP field. Proficiency with deep learning frameworks (such as PyTorch or TensorFlow) and deep learning architectures (especially Transformers). Hands-on experience with distributed training of large language models. Strong development experience of deep learning modeling in Python. Familiarity with the latest advancements in LLM and generative AI technologies. Familiarity with engineering best practices, including shared codebase, version control, containerization, etc. Passionate about being a builder and working with talented peers to solve hard problems at scale. Good communication skills to convey technical concepts in straightforward terms with product managers and various stakeholders. Preferred Skills Publications in top-tier deep learning conferences or significant contributions to prominent deep learning repositories Industrial experience in system design, software development, and production deployment Excel in transforming ambiguous requirements into actionable plans with deep learning techniques for problem-solving. First-hand experience with deep reinforcement learning First-hand experience with the latest technologies in LLM and generative AI such as parameter-efficient finetuning and instruction finetuning is a plus Familiarity with the latest advancements in computer vision and multimodal models is a plus Top-tier performance in prestigious deep learning leaderboards or large model-related competitions is a plus. Career Level - IC5 Drives and plans implementation of company policy for achieving business goals. Defines the bar for science practices, and helps teams achieve those goals. Identifies and mitigates risks across full set of systems, particularly at the intersection of business and engineering. Innovate AI and ML powered solutions (rich APIs, ML models and end to end services) with strategic ISVs and customers. Develop deep product intuition to influence future product roadmaps and drive decision making. Clearly articulate technical work to audiences of all levels and across multiple functional areas in both internal and external settings. Engage in forward looking research both internal and with academic institutions globally. Hires and mentors across the org. Perform an active role in team planning, review and retrospective events. Ensures experiments are ready for hand-off to Software Developers ship into production. May perform other duties as assigned.

Posted 3 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies