Posted:12 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Contractual

Job Description

AI Engineer — Image‑to‑Video (Mid‑Level)
Location: Mumbai (on‑site/hybrid)Contract: 6 months, extendableStart: ASAPWhat you’ll do
  • Build, fine‑tune, and ship image‑to‑video generation pipelines (prompt‑to‑video, storyboard‑to‑video, identity‑preserving headshots)
  • Integrate and iterate on SOTA components (Stable Video Diffusion, AnimateDiff, LTX‑Video/13B variants, CogVideo‑X, ControlNet‑style conditioning).
  • Optimize inference for throughput and latency (TorchScript/ONNX, TensorRT, CUDA kernels, xFormers/Flash‑Attention, mixed precision).
  • Handle multi‑GPU training/inference (DDP, gradient checkpointing, sharded weights, efficient sampling).
  • Own dataset curation and augmentation for faces/motion; enforce consent, licensing, and privacy.
  • Build evaluation loops and dashboards (FVD, CLIP/ID‑similarity, temporal consistency, face‑ID retention).
  • Productionize with Docker and CI/CD; wire up tracking (W&B/ClearML) and experiment reproducibility.
  • Collaborate with design and product to convert creative briefs into deployable features and A/B tests.
Must‑have
  • 3–5 years total software/ML experience with 1–2+ years in generative video or diffusion work.
  • Strong Python + PyTorch, Diffusers, and CV fundamentals (spatiotemporal models, sampling).
  • Proven experience with multi‑GPU (DDP/NCCL) and performance profiling on Linux.
  • Solid grasp of FFmpeg, video codecs/bitrates, and post‑processing pipelines.
  • Portfolio: repo(s), demo links, or a short reel showing your image‑to‑video work.
Nice‑to‑have
  • Experience with ComfyUI nodes/graphs, LoRA/ControlNet training, face‑ID preservation, or lip‑sync.
  • Triton kernels, custom schedulers/samplers, quantization (INT8/FP8) for fast inference.
  • MLOps on AWS/GCP/Azure, Kubernetes, vector stores, prompt orchestration.
Tools you’ll touch
  • PyTorch, Diffusers, CUDA, TensorRT/ONNX, xFormers/Flash‑Attention, FFmpeg, Docker, W&B/ClearML, ComfyUI, GitHub Actions.
What we offer
  • Competitive contract compensation (INR, market‑aligned) with extension potential.
  • High‑impact ownership on production creative pipelines.
  • Modern GPU stack and a fast path from prototype → production.
How to apply
  • Email hr@mugshotstudios.com with subject “AI Engineer — Image‑to‑Video (Mumbai)” and include:
  • • Resume/CV, links to GitHub and any demos/reels.
  • • 3–5 bullet points on your most relevant image→video work.
  • • Earliest start date and work authorization status for India.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

mumbai, maharashtra, india

hyderabad, telangana, india

pune, maharashtra, india

gurugram, haryana, india

mumbai metropolitan region