Posted:1 week ago| Platform:
Work from Office
Full Time
You will push the boundaries of the state-of-the-art in audio and media technologies. The ideal candidate would have a strong background in deep learning, both in terms of conceptual understanding, as well as practical experience. A core aspect of this role involves being able to keep up to date with the literature, implement, and innovate with the bleeding edge in generative models, self-supervised learning, and multi-modal learning. With the explosion of large language models and natural language processing, you will partner closely with Dolby s worldwide AI research staff, which actively pursues the integration of such models into audio and media experiences. You will be able to hit the ground running, innovate, and contribute to such projects. Consequently, experience with language models, question answering, vision-language models, captioning, etc. would be highly beneficial. Consequently, knowledge or experience in any/all of the following are helpful: Diffusion, autoregressive, or other generative models. Self-supervised, contrastive learning, auto-encoders. Audio, image, or text applications Source separation, text-to-speech, music synthesis, image segmentation, image captioning, question answering, language models, etc. Main responsibilities Partner closely with other domain experts to refine and execute Dolby s technical strategy in artificial intelligence and machine learning. Use deep learning to create new solutions (including foundation models) and enhance existing applications. Push the state-of-the-art and develop intellectual property. Transfer technology to product groups and draft patent applications. Advise internal leaders on recent deep learning advancements in the industry and academia to further influence research direction and business decisions. Requirements Ph.D. in Computer Science or similar field. A strong background in deep learning, both in terms of conceptual understanding, as well as practical experience. Strong knowledge and interest in audio processing Knowledge in video, or text processing is desirable. Strong publication record, with publications in major machine learning conferences (e.g. NeurIPS, ICLR, ICML). Publications in top domain-specific conferences is desirable (e.g., ACL, CVPR, ICASSP). Good knowledge about current machine learning literature. Highly skilled in Python and one or more popular deep learning frameworks (TensorFlow or PyTorch). Ability to envision new technologies and turn them into innovative products. Good communication and collaboration skills.
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Navi Mumbai
INR 0.7 - 2.75 Lacs P.A.
INR 15.0 - 20.0 Lacs P.A.
Tiruchirapalli
INR 10.0 - 12.0 Lacs P.A.
INR 3.0 - 6.0 Lacs P.A.
Thane, Panvel, Navi Mumbai
INR 3.0 - 8.0 Lacs P.A.
INR 18.0 - 25.0 Lacs P.A.
INR 3.25 - 8.25 Lacs P.A.
INR 4.5 - 6.5 Lacs P.A.
Chennai
INR 5.0 - 14.0 Lacs P.A.
INR 4.0 - 8.0 Lacs P.A.