Jobs
Interviews

2 Deepspeech Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

2.0 - 4.0 years

9 - 19 Lacs

bengaluru

Work from Office

Job Title: AI Audio Developer Location : Bangalore Job Category: Regular Joining : Immediate Max 30 days Job Description We are looking for a highly motivated and skilled AI Audio/Speech Developer to join our team in Bangalore. The ideal candidate should be capable of working independently and possess strong expertise in building AI and Deep Learning (DL) models from scratch . The role demands the ability to translate research papers into working code and contribute to cutting-edge audio AI solutions. Experience 2-4 years of experience in development with 2+ years of experience in Audio and ML/DL Training, fine tuning and optimization of different flavors of transformer models Hands-on experience with LLM, fine tuning and Optimization of LLM Desired Skills Have: Hands-on experience with ASR frameworks like Kaldi, DeepSpeech, or Wav2Vec Knowledge of acoustic models, language models, and their integration Experience in working with pre-trained models such as Wav2Vec 2.0, HuBERT, or Whisper Experience with speech corpora and dataset preparation for ASR training and evaluation Knowledge of model optimization techniques for real-time ASR applications Working experience of LLM fine tuning, optimization and performance improvement Good to Have: AI Audio related certification ASR experience

Posted 6 days ago

Apply

5.0 - 9.0 years

0 Lacs

noida, uttar pradesh

On-site

As a Speech Architect, you will lead the development of cutting-edge speech recognition and processing systems, focusing on complex tasks such as speaker diarization, automatic speech recognition (ASR), Sentiment/Emotion recognition, and transcription. You will guide a team of engineers and collaborate closely with other departments to deliver high-impact solutions. Lead and mentor a team of speech engineers, providing technical guidance and ensuring the successful delivery of projects. Architect and design end-to-end speech processing pipelines, from data acquisition to model deployment, ensuring systems are scalable, efficient, and maintainable. Develop and implement advanced machine learning models for speech recognition, speaker diarization, and related tasks using techniques such as deep learning, transfer learning, and ensemble methods. Conduct research to explore new methodologies and tools in the field of speech processing, publish findings, and present at industry conferences. Continuously monitor and optimize system performance, focusing on accuracy, latency, and resource utilization. Work closely with product management, data science, and software engineering teams to define project requirements and deliver innovative solutions. Engage with customers to understand their needs and provide tailored speech solutions, assisting in troubleshooting and optimizing deployed systems. Establish and enforce best practices for code quality, documentation, and model management within the team. Bachelor's, Master's, or Ph.D. in Computer Science, Electrical Engineering, or a related field. 5+ years of experience in speech processing, machine learning, and model deployment, with demonstrated expertise in leading projects and teams. In-depth knowledge of speech processing frameworks like Wave2vec, Kaldi, HTK, DeepSpeech, and Whisper. Experience with NLP, STT, Speech to Speech LLMs, and frameworks like Nvidia NEMO, PyAnnote. Proficiency in Python and machine learning libraries such as TensorFlow, PyTorch, or Keras. Experience with large-scale ASR systems, speaker recognition, and diarization algorithms. Strong understanding of neural networks, sequence-to-sequence models, transformers, and attention mechanisms. Familiarity with NLP techniques and their integration with speech systems. Expertise in deploying models on cloud platforms and optimizing for real-time applications. Excellent leadership and project management skills. Strong communication skills and ability to work cross-functionally. Experience with low-latency streaming ASR systems. Knowledge of speech synthesis, STT (Speech-to-Text), and TTS (Text-to-Speech) systems. Experience in multilingual and low-resource speech processing.,

Posted 1 week ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies