Junior Deep Learning Engineer

1 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Nanonets has a vision to help computers see the world starting with reading and understanding documents. Machine Learning (ML) is no longer a futuristic concept—it's a present-day powerhouse transforming the business landscape. Nanonets is at the forefront of this transformation, offering innovative ML solutions designed to make document related processes faster than ever before.


From automating data extraction processes to enhancing reconciliation, our solutions are designed to revolutionize workflows, optimize operations, and unlock untapped potential for our clients. Our client footprint spans across brands such as Toyota, Boston Scientific, Bill.com and Entergy to name a few enabling businesses across a myriad of industries to unlock the potential of their visual and textual data.


Here's a quick 1-minute intro video.


We recently announced a series B round of $29 million in funding by Accel and are backed by the likes of existing investors including Elevation Capital & YCombinator. This infusion of capital underscores our commitment to driving innovation and expanding our reach in delivering cutting-edge AI solutions to businesses worldwide.


Read about the release here:

https://www.forbes.com/sites/davidprosser/2024/03/12/why-enterprises-are-learning-to-love-nanonets-automation/?sh=6d79ec8f3ca1

https://techcrunch.com/2024/03/12/nanonets-funding-accel-india/amp/


We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.


About the role

The role can be summed up as building and deploying cutting edge generalised deep learning architectures that can solve complex business problems like converting unstructured data into structured format without hand-tuning features/models. You are expected to build state of the art models that are best in the world for solving these problems, continuously experimenting and incorporating new advancements in the field into these architectures.


What we’re looking for

  • Strong Machine Learning concepts.
  • Strong command in low-level operations involved in building architectures like Transformers, Efficientnet, ViT, Faster-rcnn, etc., and experience in implementing those in pytorch/jax/tensorflow.
  • 1-3 years of experience with the latest semi-supervised, unsupervised and few shot architectures in Deep Learning methods in NLP/CV domain.
  • Strong command in probability and statistics.
  • Strong programming skills.
  • Have previously shipped something of significance, either implemented some paper or made significant changes in an existing architecture etc.


Ideal candidate should have the following skillset

  • Python
  • Tensorflow
  • Experience building and deploying systems
  • Experience with Theano/Torch/Caffe/Keras all useful
  • Experience Data warehousing/storage/management would be a plus
  • Experience writing production software would be a plus
  • The ideal candidate should have developed their own DL architectures apart from using open source architectures.
  • Ideal candidate would have extensive experience with computer vision applications.


Interesting Projects Other DL Engineers Have Completed

  • Setting New Standards:

    Through our Automation Benchmark, we are defining how AI systems are measured on grounding, reliability, and performance.
  • Proven Adoption:

    Our Nanonets-OCR-S model on Hugging Face has already

    ~225,000 downloads

    , validating its global impact and utility.
  • Global Recognition:

    Our research and open-source contributions are recognized by leading voices in AI (example).
  • Enterprise-Ready AI:

    Our models don’t just output predictions - they provide

    grounded answers with confidence scores

    to enable trustworthy decision-making.
  • Agentic OCR Systems:

    Unlike traditional OCR, our models are

    agentic

    - capable of reasoning about inputs, adapting to task context, and chaining multiple steps to deliver structured, actionable data.
  • VLM + LLM Innovation:

    From text to vision-language, we are solving alignment, hallucination reduction, and cross-modal understanding at scale -

    leveraging the latest techniques like RLHF, PEFT, and advanced fine-tuning

    to push what’s possible.


Key Responsibilities

  • Understand specific customer requirements, develop and apply SOTA GenAI solutions to their workflows
  • Develop and fine-tune OCR and Vision Language Models (text detection, recognition, entity extraction, layout understanding).
  • Build and maintain data pipelines for documents, including cleaning, augmentation, and annotation.
  • Implement and evaluate document parsing solutions for invoices, receipts, IDs, contracts, forms, etc.
  • Work with LLMs/VLMs to enhance document understanding and enable intelligent reasoning over documents.
  • Collaborate with senior engineers to deploy models in production with scalable APIs and workflows.
  • Track and improve accuracy, robustness, and latency using proper evaluation metrics


Qualifications

Must-Have:

  • 1–3 years of experience in Machine Learning / AI Engineering/ Deep Learning
  • Strong programming skills in Python and familiarity with PyTorch or TensorFlow.
  • Experience with data preprocessing, training, and evaluation for vision or NLP tasks.
  • Experience working with LLMs or multimodal models (Hugging Face transformers, Nanonets-OCR-S, Qwen-VL, LLaMa).
  • Knowledge of REST APIs, Docker, Git, Kubernetes and basic cloud deployment (AWS/GCP/Azure).
  • Good understanding of ML fundamentals (supervised learning, evaluation metrics, error analysis)
  • Basic understanding of agentic AI workflows (document reasoning, confidence scoring, grounding).
  • Have previously shipped something of significance, either implemented some paper or made significant changes in an existing architecture etc.
  • Strong problem-solving and analytical skills

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You