Home
Jobs

4 Speech Processing Jobs

Filter
Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

2.0 - 5.0 years

10 - 20 Lacs

Noida

Work from Office

Naukri logo

What would you do? System Design: Architect and design end-to-end speech processing pipelines, from data acquisition to model deployment. Ensure systems are scalable, efficient, and maintainable. Advanced Modeling: Develop and implement advanced machine learning models for speech recognition, speaker diarization, and related tasks. Utilize state-of-the-art techniques such as deep learning, transfer learning, and ensemble methods. Research and Development: Conduct research to explore new methodologies and tools in the field of speech processing. Publish findings and present at industry conferences. Performance Optimization: Continuously monitor and optimize system performance, focusing on accuracy, latency, and resource utilization. Collaboration: Work closely with product management, data science, and software engineering teams to define project requirements and deliver innovative solutions. Customer Interaction: Engage with customers to understand their needs and provide tailored speech solutions. Assist in troubleshooting and optimizing deployed systems. Documentation and Standards: Establish and enforce best practices for code quality, documentation, and model management within the team. Required Skills 2+ years of experience in speech processing, machine learning, and model deployment. Demonstrated expertise in leading projects and teams. Technical skills: • Excellent knowledge in Python / Java programming. • In-depth knowledge of speech processing frameworks like, Wave2vec, Kaldi, HTK, DeepSpeech and Whisper. • Experience with NLP, STT, Speech to Speech LLMs and frameworks like Nvidia NEMO, PyAnnote. • Proficiency in Python and machine learning libraries such as TensorFlow, PyTorch, or Keras. • Experience with large-scale ASR systems, speaker recognition, and diarization algorithms. • Strong understanding of neural networks, sequence-to-sequence models, transformers and attention mechanisms. • Familiarity with NLP techniques and their integration with speech systems. • Expertise in deploying models on cloud platforms and optimizing for real-time applications. Good to have: • Experience with low-latency streaming ASR systems. • Knowledge of speech synthesis, STT (Speech-to-Text) and TTS (Text-to-Speech) systems. • Experience in multilingual and low-resource speech processing.

Posted 3 weeks ago

Apply

2 - 6 years

4 - 9 Lacs

Jaipur

Work from Office

Naukri logo

Role & responsibilities Linguistic Leadership: Oversee transcription and annotation projects, ensuring high-quality linguistic data for AI/ML applications. Quality Assurance & Guideline Development: Define and implement transcription and annotation best practices to improve AI training datasets. Advanced Linguistic Analysis: Identify dialectal variations, phonetic challenges, and speech disfluencies impacting AI performance. AI Model Training Support: Collaborate with AI engineers, data scientists, and NLP researchers to enhance speech recognition and language models. Cross-Language Expertise: Support multilingual speech data initiatives across Indic languages and English. Research & Innovation: Stay updated on Generative AI, ASR, and NLP trends , bringing linguistic insights to AI model development.

Posted 2 months ago

Apply

2 - 6 years

4 - 9 Lacs

Jaipur

Work from Office

Naukri logo

Role & responsibilities Linguistic Leadership: Oversee transcription and annotation projects, ensuring high-quality linguistic data for AI/ML applications. Quality Assurance & Guideline Development: Define and implement transcription and annotation best practices to improve AI training datasets. Advanced Linguistic Analysis: Identify dialectal variations, phonetic challenges, and speech disfluencies impacting AI performance. AI Model Training Support: Collaborate with AI engineers, data scientists, and NLP researchers to enhance speech recognition and language models. Cross-Language Expertise: Support multilingual speech data initiatives across Indic languages and English. Research & Innovation: Stay updated on Generative AI, ASR, and NLP trends , bringing linguistic insights to AI model development.

Posted 2 months ago

Apply

0 - 3 years

25 - 40 Lacs

Noida

Hybrid

Naukri logo

Required Educational Qualification: PhD/ MTech / BTech with relevant experience from Tier 1 Campus only. Desired Experience: 0 - 4 Years Job Objective: You will be building innovative Machine Learning solutions for vital and complex business problems. Key Responsibilities Design and develop audio processing and deep learning models for speech recognition, sound classification, and voice synthesis. Implement and optimize Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) systems. Research and apply generative models for audio generation, voice cloning, and synthetic speech creation. Develop algorithms for noise reduction, audio enhancement, and acoustic modeling. Work on speaker recognition, voice identification, and emotion detection using advanced AI techniques. Collaborate with cross-functional teams to integrate audio models into real-time production environments. Build multimodal AI systems that combine audio with text, video, or image data. Stay current with advancements in audio AI research, deep learning, and speech processing technologies. Required Skills and Qualifications Ph.D., Masters degree, Bachelors in Data Science, Computer Science, Statistics, Electrical Engineering, Mathematics or related analytical field. Excellent problem-solving and analytical skills, with a focus on delivering data-driven business solutions audio models. Expertise in audio and speech processing models using deep learning frameworks (TensorFlow, PyTorch, OpenAI Whisper). Proficiency in audio signal processing techniques, spectrogram analysis, and waveform-based modeling. Experience with speech-to-text, audio generation, and voice synthesis systems. Strong programming skills in Python Expertise in data pre-processing, feature engineering, and data modeling techniques with knowledge in Pandas, scikit-learn, matplotlib, numpy, xgboost, dask, scipy, spacy and other relevant libraries. Strong programming skills in Python and other relevant query languages for data manipulation and analysis. Experience with big data technologies such as Hadoop, Spark is plus. Experience working in cross-functional teams and managing multiple stakeholders. Excellent problem-solving, critical thinking, and communication skills. Connect with me on Linked: Prince Narang ( https://www.linkedin.com/in/princenarang/ )

Posted 3 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies