Lead AI Compiler Engineer

3 - 8 years

3 - 8 Lacs

Posted:3 days ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

We are looking for an AI Compiler Engineer to join this high impact team working in the growing field of on-device AI inference acceleration as an individual contributor or as a technical lead in the AI group (AIG).

As an AI Compiler Engineer, you will design and optimize AI compiler stack and tools that enable efficient execution of state-of-the-art open source as well as proprietary AI models such as LLMs, transformer models, etc., to AMD NPUs for on-device AI inference use-cases.

You will work on transforming high-level AI models into efficient, low-level code that can run on NPU.

Your work will directly impact the performance, efficiency, and scalability of our AI solutions.

SCOPE OF WORK:

  • Operator Fusion:

    Identify and implement performance optimization opportunities by reducing memory traffic through operator fusion at different memory hierarchy levels, e.g., attention block.
  • Problem Partitioning and Dataflow Orchestration:

    Design of algorithms to optimally map given AI operation to the NPU comprising of an interconnected array of AI engines.
  • Design and implementation of algorithms to orchestrate dataflow through multi-level memory hierarchy.
  • Kernel Design and Development:

    Design and implement highly optimized C++/intrinsic based kernels for AI related operators.
  • Develop vectorized code that leverages

    SIMD (Single Instruction, Multiple Data)

    and

    VLIW (Very Long Instruction Word)

    for optimal performance.
  • Perform performance, program memory, and accuracy tradeoffs.
  • Testing and Validation:

    Develop CPU models for the ML operators in C++/Python to validate accuracy.
  • Write unit tests and integration tests to ensure correctness and reliability.
  • Performance Profiling and Tuning:

    Profile and analyze the performance of model layers.
  • Identify performance/accuracy bottlenecks and alleviate those.
  • Documentation and Collaboration:

    Effective technical communication of day-to-day work and document design specs.
  • Follow good coding practices, using version control system.
  • Collaborate with cross-functional teams spanning over AI research, core architecture, and software engineering.

REQUIRED SKILLS:

  • Excellent

    C/C++ and Python coding skills

    .
  • Good understanding of

    SIMD, VLIW processor architecture

    .
  • Experience with

    vectorized programming (SIMD)

    .
  • Thorough understanding of

    fixed and floating point arithmetic

    .
  • Good understanding of various operators in state-of-the-art AI models.
  • Knowledge of low-level hardware details (

    cache hierarchy, DMA programming

    ).
  • Excellent problem-solving skills, especially on debug, and a passion for on-device AI.
  • Prefer candidates with past experience on

    AI compiler design

    .

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

hyderabad, telangana, india

ahmedabad, gujarat, india

ahmedabad, gujarat, india

hyderabad, telangana, india

bengaluru, karnataka, india