AI Engineer intern

DeepLure Research

0 years

0 Lacs

india

Posted:1 day ago| Platform:

Apply

Skills Required

ai engineering ml design model inference research prototype test preprocessing rest tooling deployment coding testing linting pytorch cuda budgeting linux docker sharding profiling profiler tuning learning writing code constraints compensation optimization portfolio

Work Mode

Remote

Job Type

Contractual

Job Description

🚀 Role Overview

AI Engineer Intern

🧠 What You’ll Do

Prototype and evaluate ML models
(Diffusion, VAE, Transformers, RLHF and related architectures).
Build and test end-to-end model pipelines
from preprocessing to inference and postprocessing.
Develop an Inference API
around models (REST/gRPC) and deploy to cloud/servers.
Implement input-batching and dynamic batching strategies
to maximize throughput and reduce latency.
Optimize memory usage and GPU performance
, including careful tensor placement, mixed precision, and memory profiling.
Integrate model-serving frameworks
and production tooling; optionally work with NVIDIA Triton for high-performance serving.
Write reliable tests and monitoring
for inference correctness, performance, and resource consumption.
Collaborate with engineers and researchers
to iterate on model improvements and deployment strategies.

🔧 Required Skills

Proficient in Python
with production-grade coding practices (testing, linting, packaging).
Strong PyTorch expertise
, including an understanding of its execution model, autograd, tensors, CUDA contexts, and memory behavior.
Experience building inference pipelines
and deploying model-backed APIs.
Practical GPU knowledge
: memory budgeting, mixed-precision (AMP), CUDA streams, and utilization trade-offs.
Comfort with Linux command line, Docker, and basic cloud concepts
.

✨ Nice-to-Have (Optional)

NVIDIA Triton experience
for model serving.
Familiarity with
HuggingFace Transformers, Diffusers, Accelerate, xformers
.
Experience with
distributed inference, model sharding, or tensor parallelism
.
Background in
profiling tools
(Nsight, nvprof, PyTorch profiler) and memory tracing.
Knowledge of
RLHF
, fine-tuning workflows, or advanced generative modeling tricks.

🎯 Who Should Apply

College students in
1st, 2nd, 3rd, or 4th year
eager to build production machine learning systems.
People who enjoy bridging research and engineering, writing clean code, and iterating quickly.
Problem-solvers who can balance correctness, speed, and resource constraints in real systems.

📍 Location and Time

Remote
role with flexible hours. Must be able to collaborate during reasonable overlap with the team.

💸 Compensation

INR 10,000 – 15,000 per month
.

✅ What We Offer

Real responsibility and a chance to ship end-to-end features.
Mentorship from senior ML engineers and researchers.
Exposure to production-grade model serving, GPU optimization, and cloud deployment.
A portfolio-building internship with measurable impact.

📩 How to Apply

Fill the following Google form https://forms.gle/k3kT98CNDu8urMSK8 before 12th Nov 2025.

More Jobs at DeepLure Research

AI Engineer intern

india

Experience: Not specified

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.