Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in Bengaluru
>
Sarvam
>
Machine Learning Engineer

Machine Learning Engineer

Sarvam

3 - 8 years

9 - 13 Lacs

Bengaluru

Posted:7 months ago| Platform:

Apply

Skills Required

Computer vision Backend Prototype GIT Machine learning Debugging Open source SQL Python

Work Mode

Work from Office

Job Type

Full Time

Job Description

Machine Learning Engineer Computer Vision VLM Full Time Engineering On-Site Bengaluru, Karnataka, India Machine Learning Engineer Computer Vision Vision Language Models (VLMs) About Sarvam AI Sarvam.ai is a pioneering generative AI startup headquartered in Bengaluru, India. We are dedicated to transformative R D in language technologies, building scalable and efficient Large Language Models (LLMs) that serve a wide spectrum of languages especially Indic languages. Our mission is to re imagine human computer interaction and craft novel AI driven solutions that make language technology inclusive for diverse communities worldwide. Role Overview As a Machine Learning Engineer (MLE) in the Vision Language team, you will build and refine vision, OCR, and language models for varied use cases. Your work will span research, scalable training, and rigorous evaluation of cutting edge computer vision and VLM systems. Key Responsibilities Model R D Prototype and fine tune state of the art vision architectures and vision language models. Design and evaluate multimodal fusion strategies for robust image text understanding. Data Training Pipelines Build distributed pipelines (PySpark / Ray) to curate and preprocess large scale multimodal datasets (images, geospatial rasters, PDFs, video frames, captions). Implement efficient training loops in PyTorch/Lightning with mixed precision, gradient accumulation, and multi GPU ( 4) parallelism. Domain Focused Applications Develop models for geospatial analysis, Indic document intelligence (OCR + layout), visual question answering (VQA), and broader computer vision use cases. Evaluation Benchmarking Define and automate task specific metrics for OCR accuracy, retrieval, dense captioning, and VQA; maintain regression dashboards and ablation suites. Required Qualifications Experience : 2 3 years in ML engineering with emphasis on classical computer vision and modern vision language models. Education : Bachelor s or Master s in Computer Science, AI/ML, or related fields. Technical Skills Strong Python PyTorch; comfortable with CUDA profiling and tensor debugging. Hands on experience training CV models (CNNs, ViTs) and/or VLMs on 4 GPU nodes. Proven ability to build, deploy, and monitor pipelines for OCR, object detection, and segmentation. Solid grasp of computer vision fundamentals (detection, segmentation, representation learning) and transformer mechanics. Software Engineering Fundamentals : Proficiency with Git, unit tests, structured logging, Docker, and CI/CD. Ability to select and integrate appropriate databases (SQL, NoSQL, vector stores) for large scale multimodal data. Experience designing scalable backend APIs/micro services (gRPC/REST), including monitoring and observability best practices. Preferred Qualifications Publications or submissions in CVPR/ICCV/ECCV, EMNLP, ACL. Prior work on multilingual or low resource vision language tasks. Experience with data centric AI (active learning, synthetic augmentation). Contributions to open source vision/NLP libraries (Hugging Face, OpenCV, Detectron2). Familiarity with distributed schedulers (KubeFlow, Slurm). Made with

More Jobs at Sarvam

Machine Learning Engineer

Bengaluru

3 - 8 yrs

INR 9 - 13 Lacs

AI Researcher / Research Engineer

Bengaluru

3 - 8 yrs

INR 10 - 14 Lacs

Software Engineer - Backend Systems

Bengaluru

3 - 8 yrs

INR 12 - 15 Lacs

Frontend Engineer

Bengaluru

1.0 - 2.0 yrs

INR 14 - 18 Lacs

Customer Success Lead - Sarvam AI

Bengaluru

5.0 - 10.0 yrs

INR 13 - 17 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Machine Learning Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Sarvam

Information Technology

Bangalore

Login to

Please Verify Your Phone or Email

Confirm Action

Machine Learning Engineer