Jobs
Interviews

2 Asr Systems Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 9.0 years

0 Lacs

noida, uttar pradesh

On-site

As a Speech Architect, you will play a crucial role in leading the development of cutting-edge speech recognition and processing systems, with a specific focus on tasks such as speaker diarization, automatic speech recognition (ASR), sentiment/emotion recognition, and transcription. Your responsibilities will involve guiding a team of engineers and fostering close collaboration with other departments to deliver impactful solutions. You will be tasked with leading and mentoring a team of speech engineers, providing them with technical guidance and ensuring the successful execution of projects. Additionally, you will be responsible for architecting and designing end-to-end speech processing pipelines, from data acquisition to model deployment, with a keen emphasis on scalability, efficiency, and maintainability. In your role, you will develop and implement advanced machine learning models for speech recognition and speaker diarization, leveraging state-of-the-art techniques like deep learning, transfer learning, and ensemble methods. Moreover, you will engage in research activities to explore new methodologies and tools within the speech processing domain, with a focus on publishing findings and presenting at industry conferences. Continuously monitoring and optimizing system performance will be a key aspect of your responsibilities, with a particular emphasis on enhancing accuracy, reducing latency, and optimizing resource utilization. Collaboration with product management, data science, and software engineering teams will also be essential in defining project requirements and delivering innovative solutions. Your interactions with customers will involve understanding their unique needs and providing tailored speech solutions, as well as troubleshooting and optimizing deployed systems. Establishing and enforcing best practices for code quality, documentation, and model management within the team will also be a critical part of your role. Qualified candidates for this position should hold a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field, along with at least 5 years of experience in speech processing, machine learning, and model deployment. Proficiency in speech processing frameworks like Wave2vec, Kaldi, HTK, DeepSpeech, and Whisper, as well as experience with NLP, STT, Speech to Speech LLMs, and frameworks like Nvidia NEMO and PyAnnote, is required. Additionally, expertise in Python and machine learning libraries such as TensorFlow, PyTorch, or Keras, along with experience in large-scale ASR systems, speaker recognition, and diarization algorithms, is essential. A strong understanding of neural networks, sequence-to-sequence models, transformers, attention mechanisms, as well as familiarity with NLP techniques and their integration into speech systems, is highly beneficial. Soft skills such as excellent leadership, project management, and communication skills are also necessary for this role. Preferred qualifications include experience with low-latency streaming ASR systems, knowledge of speech synthesis, STT, and TTS systems, as well as experience in multilingual and low-resource speech processing.,

Posted 1 week ago

Apply

5.0 - 9.0 years

0 Lacs

noida, uttar pradesh

On-site

As a Speech Architect, you will lead the development of cutting-edge speech recognition and processing systems, focusing on complex tasks such as speaker diarization, automatic speech recognition (ASR), Sentiment/Emotion recognition, and transcription. You will guide a team of engineers and collaborate closely with other departments to deliver high-impact solutions. Lead and mentor a team of speech engineers, providing technical guidance and ensuring the successful delivery of projects. Architect and design end-to-end speech processing pipelines, from data acquisition to model deployment, ensuring systems are scalable, efficient, and maintainable. Develop and implement advanced machine learning models for speech recognition, speaker diarization, and related tasks using techniques such as deep learning, transfer learning, and ensemble methods. Conduct research to explore new methodologies and tools in the field of speech processing, publish findings, and present at industry conferences. Continuously monitor and optimize system performance, focusing on accuracy, latency, and resource utilization. Work closely with product management, data science, and software engineering teams to define project requirements and deliver innovative solutions. Engage with customers to understand their needs and provide tailored speech solutions, assisting in troubleshooting and optimizing deployed systems. Establish and enforce best practices for code quality, documentation, and model management within the team. Bachelor's, Master's, or Ph.D. in Computer Science, Electrical Engineering, or a related field. 5+ years of experience in speech processing, machine learning, and model deployment, with demonstrated expertise in leading projects and teams. In-depth knowledge of speech processing frameworks like Wave2vec, Kaldi, HTK, DeepSpeech, and Whisper. Experience with NLP, STT, Speech to Speech LLMs, and frameworks like Nvidia NEMO, PyAnnote. Proficiency in Python and machine learning libraries such as TensorFlow, PyTorch, or Keras. Experience with large-scale ASR systems, speaker recognition, and diarization algorithms. Strong understanding of neural networks, sequence-to-sequence models, transformers, and attention mechanisms. Familiarity with NLP techniques and their integration with speech systems. Expertise in deploying models on cloud platforms and optimizing for real-time applications. Excellent leadership and project management skills. Strong communication skills and ability to work cross-functionally. Experience with low-latency streaming ASR systems. Knowledge of speech synthesis, STT (Speech-to-Text), and TTS (Text-to-Speech) systems. Experience in multilingual and low-resource speech processing.,

Posted 1 week ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies