Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in gurugram
>
Josh Talks
>
AI Researcher, Speech & Audio, Intern

AI Researcher, Speech & Audio, Intern

Name: Jobpe
Address: T-Hub, Plot No 1/C, Sy No 83/1, Raidurgam panmaktha, Knowledge City Rd, Hyderabad, Telangana, 500081, IN
Telephone: +91-83339-09630
Price range: Free

Josh Talks

0 years

0 Lacs

gurugram haryana india

Posted:3 days ago| Platform:

Apply

Skills Required

ai engineering ml drive algorithms power benchmarking design recognition word multilingual code tuning pytorch tensorflow github research model learning report dataset duration

Work Mode

On-site

Job Type

Contractual

Job Description

AI Researcher, Speech & Audio, Intern - Internship Opportunity at JoshTalks AI Lab (ai.joshtalks.com)

Location: Gurgaon, India

Type: Full-time Internship (6–12 months)

Who: Final-year engineering students or recent graduates passionate about AI/ML in speech

About Us

At JoshTalks AI Lab, we believe that voice will be the primary medium of interaction between man and machine. Our mission is simple yet ambitious:

● Help machines talk like humans.

● Build the benchmarks and datasets that become the backbone of global progress in speech AI.

● Drive improvements not just through compute or algorithms — but through high-quality, diverse, real-world data.

Our datasets today power some of the largest and most widely used speech models in the world (you’ve deﬁnitely used them, even if we can’t name them 😉).

What You’ll Work On

This is not a “just another internship.” You’ll be directly contributing to the global race to perfect speech AI:

1. Benchmarking the world’s speech models

● Design and run evaluations for ASR and speech-to-speech systems.

● Create benchmarks that will guide top AI labs on where their models fail and where they shine.

2. Modeling & Fine-Tuning

● Fine-tune speech recognition systems (like Whisper/wav2vec2) to push Word Error Rates toward ~5%.

● Experiment with multilingual, code-switched, and noisy speech to mimic real-world conditions.

3. Impact at Scale

● Your work won’t just sit in a paper. It will inﬂuence how the world’s largest AI models get built, tested, and improved.

Who We’re Looking For

● Final-year undergraduates (B.Tech/B.E.) in CSE, EE, AI/ML, or related ﬁelds.

● Strong interest in speech, audio, NLP, or multimodal AI.

● Hands-on experience in one or more of:

● Fine-tuning speech or language models (Whisper, wav2vec2, HuBERT, SER, etc.)

● Building speech-driven projects (assistants, classiﬁers, chatbots, SER systems)

● Working with PyTorch, TensorFlow, or Hugging Face transformers.

● Bonus: past projects on GitHub, Kaggle, or research papers.

Why Join Us

● Ownership: Even as a ﬁnal-year student, you’ll get the chance to own problems of global importance — from reducing ASR word error rates toward 5% to building benchmarks that inﬂuence how the next generation of

speech-to-speech models are developed. These are not side projects: the problems you’ll work on may deﬁne how billions of people interact with machines in the future.

● Front-row seat in speech AI: Your work will shape benchmarks and datasets used by the world’s top model labs.

● Learning: Work with experts solving speech challenges across 20+ Indian languages and noisy, real-world audio.

● Impactful projects: The benchmarks and models you help build will set direction for global AI progress.

● Startup energy, global scale: Small team, big impact — perfect for ambitious builders.

● Co-Authorship: If any of the work you contribute to is published as a paper, benchmark report, or dataset release, you will be credited as a co-author. This means your contributions won’t just stay inside the lab — they’ll be visible to the wider research community and part of the academic and industry record.

Details

● Location: Gurgaon (on-site preferred for collaboration)

● Duration: 6–12 months

● Type: Paid Internship (full-time)

● Start Date: Flexible for ﬁnal-year students (aligns with academic calendar)

If you’re someone who dreams of making speech AI as natural as human conversation, this is your chance to work on the real frontier. Super interested? You can also directly write to our founder Shobhit at shobhit@joshtalks.com

To Apply write to hr@joshtalks.com

More Jobs at Josh Talks

Freelance Video Content Creator

Gurugram, Haryana, India

Experience: Not specified

Salary: Not disclosed

Creative Producer (Contractual 3 Months)

Gurugram

1.0 - 4.0 yrs

INR 0 - 0 Lacs

Freelance Video Content Creator

Gurugram, Delhi / NCR

1.0 - 4.0 yrs

INR 0 - 0 Lacs

Data Ops Manager - Images

Gurugram, Haryana, India

5.0 - 5.0 yrs

Salary: Not disclosed

Dataset Manager – Video

Gurugram, Haryana, India

6.0 - 6.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

Josh Talks

E-Learning Providers

Gurgaon Haryana

Login to

Please Verify Your Phone or Email

Confirm Action

AI Researcher, Speech & Audio, Intern