form :https://forms.gle/ncGqEJrJDvEDhXtL7AI Researcher, Speech & Audio, Intern -Internship Opportunity at JoshTalks AI Lab(ai.joshtalks.com)Location: Gurgaon, IndiaType: Full-time Internship (6–12 months)Who: Final-year engineering students or recent graduates passionate about AI/MLin speech
About Us
At JoshTalks AI Lab, we believe that voice will be the primary medium of interactionbetween man and machine. Our mission is simple yet ambitious:
- Help machines talk like humans.
- Build the benchmarks and datasets that become the backbone of global
progress in speech AI.
- Drive improvements not just through compute or algorithms — but through
high-quality, diverse, real-world data.Our datasets today power some of the largest and most widely used speechmodels in the world (you’ve definitely used them, even if we can’t name them😉).What You’ll Work OnThis is not a “just another internship.” You’ll be directly contributing to the global
Race To Perfect Speech AI
- Benchmarking the world’s speech models
○ Design and run evaluations for ASR and speech-to-speech systems.
○ Create benchmarks that will guide top AI labs on where their modelsfail and where they shine.
○ Fine-tune speech recognition systems (like Whisper/wav2vec2) to push
Word Error Rates toward ~5%.○ Experiment with multilingual, code-switched, and noisy speechto mimic real-world conditions.
○ Your work won’t just sit in a paper. It will influence how the
world’s largest AI models get built, tested, and improved.Who We’re Looking For
- Final-year undergraduates (B.Tech/B.E.) in CSE, EE, AI/ML, or related fields.
- Strong interest in speech, audio, NLP, or multimodal AI.
- Hands-on experience in one or more of:
○ Fine-tuning speech or language models (Whisper, wav2vec2,
HuBERT, SER, etc.)○ Building speech-driven projects (assistants, classifiers, chatbots,SER systems)○ Working with PyTorch, TensorFlow, or Hugging Face transformers.
- Bonus: past projects on GitHub, Kaggle, or research papers.
Why Join Us
- Ownership: Even as a final-year student, you’ll get the chance to own
problems of global importance — from reducing ASR word error rates toward5% to building benchmarks that influence how the next generation of
Speech-to-speech Models Are Developed. These Are Not Side Projects
the problems you’ll work on may define how billions of people interactwith machines in the future.
- Front-row seat in speech AI: Your work will shape benchmarks and datasets
used by the world’s top model labs.
- Learning: Work with experts solving speech challenges across 20+
Indian languages and noisy, real-world audio.
- Impactful projects: The benchmarks and models you help build will
set direction for global AI progress.
- Startup energy, global scale: Small team, big impact — perfect for ambitious
builders.
- Co-Authorship: If any of the work you contribute to is published as a paper,
benchmark report, or dataset release, you will be credited as a co-author.
This means your contributions won’t just stay inside the lab — they’ll bevisible to the wider research community and part of the academic andindustry record.Details
- Location: Gurgaon (on-site preferred for collaboration)
- Duration: 6–12 months
- Type: Paid Internship (full-time)
- Start Date: Flexible for final-year students (aligns with academic calendar)
If you’re someone who dreams of making speech AI as natural as human
conversation, this is your chance to work on the real frontier. Super interested?Skills:- Machine Learning (ML)