Posted:2 weeks ago|
Platform:
On-site
Contractual
● Help machines talk like humans.
● Drive improvements not just through compute or algorithms — but through high-quality, diverse, real-world data.
This is not a “just another internship.” You’ll be directly contributing to the global race to perfect speech AI:
○ Design and run evaluations for ASR and speech-to-speech systems.
○ Fine-tune speech recognition systems (like Whisper/wav2vec2) to push
Word Error Rates toward ~5%.
○ Experiment with multilingual, code-switched, and noisy speech to mimic real-world conditions.
● Final-year undergraduates (B.Tech/B.E.) in CSE, EE, AI/ML, or related fields.
● Hands-on experience in one or more of:
○ Fine-tuning speech or language models (Whisper, wav2vec2, HuBERT, SER, etc.)
○ Building speech-driven projects (assistants, classifiers, chatbots, SER systems)
○ Working with PyTorch, TensorFlow, or Hugging Face transformers.
● Bonus: past projects on GitHub, Kaggle, or research papers.
speech-to-speech models are developed. These are not side projects: the problems you’ll work on may define how billions of people interact with machines in the future.
Josh Talks
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
gurugram, haryana, india
Experience: Not specified
Salary: Not disclosed
gurugram, haryana, india
Experience: Not specified
Salary: Not disclosed