Home
Jobs

Deep Learning Engineer (Computer Vision & Audio Analysis)

3 - 8 years

6 - 10 Lacs

Posted:2 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Deep Learning Engineer (Computer Vision & Audio Analysis)
Location: Remote / Client Location
Experience: 3+ Years
Employment Type: Full-Time (Client Deployment via Hubnex Labs)
Role Overview
Join a pioneering AI team to design and deploy cutting-edge deep learning solutions for computer vision and audio analysis. You ll leverage CNNs, Vision Transformers, attention mechanisms, and multi-modal techniques to solve complex real-world challenges in object detection, video processing, and audio classification.
Key Responsibilities
  • Design, develop, and optimize deep learning models for image/video analysis (object detection, segmentation) and audio classification tasks.
  • Implement and fine-tune CNN architectures , Vision Transformers (ViT, Swin), and attention mechanisms (SE, CBAM, self/cross-attention).
  • Process multi-modal data:
  1. Video
    : Apply spatiotemporal modeling (3D CNNs, temporal attention)
  2. Audio
    : Extract features (spectrograms, MFCCs) and build classification pipelines
  • Utilize pretrained models (transfer learning) and multi-task learning frameworks.
  • Optimize models for accuracy, speed, and robustness using PyTorch/TensorFlow.
  • Collaborate with MLOps teams to deploy solutions into production.
Required Skills
  • Programming : Advanced Python (PyTorch/TensorFlow)
  • Computer Vision :
  1. Vision Transformers (ViT, Swin, DeiT)
  2. Object detection (YOLO, SSD, Faster R-CNN, DETR)
  3. Video analysis (temporal modeling)
  • Audio Processing : Feature extraction (MFCCs, spectrograms) and classification
  • Modeling Expertise :
  1. Attention mechanisms (self/cross-attention, SE, CBAM)
  2. Transfer learning and fine-tuning
  3. Training strategies (LR scheduling, early stopping, data augmentation)
  • Experience handling large-scale datasets and building data pipelines.
Preferred Qualifications
  • Exposure to multi-modal learning (combining vision/audio/text)
  • Familiarity with R for statistical analysis
  • Publications or projects in CVPR/NeurIPS/ICML
This role is for a client of Hubnex Labs. Selected candidates will represent Hubnex while working directly with the client s AI team.

Mock Interview

Practice Video Interview with JobPe AI

Start Deep Learning Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Hubnex

27 Jobs

RecommendedJobs for You