On-site
Full Time
About the Role
We are building the next generation of spatial intelligence where robots and 3D systems understand and interact with the world in real time. As a Multimodal LLM Engineer, you will design, train, and deploy vision-language models that understand detected objects, 3D environments, and dynamic scenes. Your work will enable robots and digital tools to reason about objects, context, safety, and actions—entirely on-device.
You will collaborate closely with perception, robotics, and systems engineers to bring together 3D vision, object detection, and LLM reasoning into a unified real-time intelligence engine.
This is a highly technical role with direct impact on core product capabilities.
Responsibilities
Minimum Qualifications
Job Type: Full-time
Pay: ₹1,500,000.00 - ₹3,000,000.00 per year
O-HIVE
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now15.0 - 30.0 Lacs P.A.
15.0 - 30.0 Lacs P.A.