Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Jobs

Interviews

Home
>
Jobs in noida
>
EXL
>
Lead Assistant Manager

Lead Assistant Manager

EXL

3 years

0 Lacs

noida uttar pradesh india

Posted:1 week ago| Platform:

Apply

Skills Required

ai integration service tuning model development latency azure openai vision api engineering mlflow deployment aws processing monitoring tracking failover research cutting optimization inference onnx data pipeline preprocessing rest programming python ml pytorch tensorflow pcm workflow apache airflow docker kubernetes architecture metrics drift reliability learning healthcare mobile leadership electrical technology stack rust typescript ffmpeg postgresql redis training sip

Work Mode

On-site

Job Type

Full Time

Job Description

Senior ASR/TTS Specialist - AI Agent Integration Expert

Company:

EXL Service

Type:

Full-time

Experience:

3+ years

Position Summary

We seek an exceptional

Senior ASR/TTS Specialist

to lead speech AI initiatives and integrate advanced speech technologies with AI agent frameworks. This role focuses on fine-tuning ASR/TTS models, implementing MLOps best practices, and building production-ready speech AI systems powering next-generation conversational AI agents.

Key Responsibilities

Speech AI Model Development & Integration

Model Fine-tuning : Customize state-of-the-art ASR/TTS models for domain-specific applications with <300ms latency

Speech-to-Speech Systems : Build end-to-end S2S pipelines using Amazon Nova Sonic v1.0, Azure OpenAI Realtime (GPT-4o), and Gemini 2.5 Flash Native Audio

Multi-modal Integration : Develop speech models integrating with vision and text modalities in AI agents

Agent Framework Integration : Implement speech capabilities with LangChain/LangGraph, CrewAI, AutoGen, LlamaIndex, and OpenAI Assistants API

MLOps & Production Engineering

Model Lifecycle : Implement comprehensive MLOps pipelines using MLflow, Weights & Biases, and automated CI/CD
Multi-cloud Deployment : Deploy speech models across AWS Bedrock, Google Cloud AI, and Azure Cognitive Services
Real-time Processing : Build WebSocket-based streaming audio systems handling 1000+ concurrent connections
Production Monitoring : Implement WER tracking, latency monitoring, and multi-provider failover mechanisms

Research & Development

Cutting-edge Research : Stay current with latest speech AI breakthroughs and implement novel architectures
Performance Optimization : Optimize models for real-time inference using TensorRT, ONNX, and edge deployment
Data Pipeline Engineering : Build scalable audio ingestion, preprocessing, and augmentation systems

Required Qualifications

Core Technical Skills (Must-Have)

Speech AI Models (3+ years experience):

ASR Systems

: Amazon Nova Sonic v1.0, Google Speech-to-Text, Azure Speech Services, Whisper, Wav2Vec2, Riva -

TTS Systems

: Google TTS, Azure Cognitive Services TTS, ElevenLabs (REST/WebSocket), Tortoise, VITS, FastSpeech2 -

Speech-to-Speech

: Direct S2S without intermediate text, multimodal audio processing -

Cloud Services

: AWS Bedrock Runtime, Google Cloud AI (Gemini API), Azure OpenAI Services

Programming & Frameworks:

Languages

: Expert Python, proficient C++/Rust for optimization -

ML Frameworks

: Advanced PyTorch, TensorFlow 2.x, JAX/Flax -

Audio Processing

: librosa, torchaudio, soundfile, WebRTC, µ-law/PCM conversion -

Agent Frameworks

: Hands-on experience with 3+ of: LangChain, CrewAI, AutoGen, LlamaIndex, OpenAI Assistants

MLOps & Infrastructure (Essential)

MLOps Tools (2+ years):

Experiment Management

: MLflow, Weights & Biases -

Model Serving

: TorchServe, TensorFlow Serving, NVIDIA Triton -

Workflow Orchestration

: Apache Airflow, Kubeflow, Prefect -

Containerization

: Docker, Kubernetes for speech model deployment

Cloud & Production:

Multi-cloud Experience

: AWS (Bedrock, Nova Sonic), Google Cloud (Gemini, Speech APIs), Azure (OpenAI Services) -

Real-time Systems

: Sub-300ms latency, WebSocket architecture, telecom integration (Genesys AudioConnector) -

Monitoring

: Audio quality metrics, model drift detection, production reliability (99.9% uptime)

Preferred QualificationsAdvanced Specializations

Multi-lingual Processing : Cross-lingual transfer learning, zero-shot adaptation
Domain Expertise : Healthcare, legal, technical domain speech AI
Edge AI : TensorRT, Core ML, ONNX optimization for mobile/edge deployment
Research Background : Publications in ICASSP, INTERSPEECH, ICML, NeurIPS

Leadership & Education

Team Leadership : Experience leading speech AI teams and technical initiatives
Education : MS/PhD in Computer Science, Electrical Engineering, or related field
Open Source : Contributions to speech AI libraries and frameworks

Technical Environment

Production Technology Stack

Core Technologies:

Languages

: Python, C++, Rust, TypeScript -

Frameworks

: PyTorch, TensorFlow, JAX, LangChain, CrewAI, AutoGen -

Cloud Services

: AWS Bedrock, Google Cloud AI, Azure OpenAI Services -

Audio Tools

: librosa, torchaudio, WebRTC, FFmpeg -

MLOps

: MLflow, Kubeflow, Docker, Kubernetes, NVIDIA Triton -

Databases

: Vector DBs (Pinecone, Weaviate), PostgreSQL, Redis

Production Models:

- Amazon Nova Sonic v1.0 (Speech-to-Speech streaming) - Gemini 2.5 Flash Native Audio Dialog (Multimodal processing) - Azure OpenAI GPT-4o (Realtime voice conversations) - ElevenLabs (Voice cloning and synthesis)

Infrastructure

GPU Clusters : NVIDIA A100/H100 for model training

Edge Deployment : NVIDIA Jetson, ARM-based targets

Real-time Requirements : <300ms latency, 1000+ concurrent streams

Enterprise Integration : Genesys AudioConnector, SIP protocol, telephony systems

Key Projects & Success Metrics

Primary Focus Areas

Next-gen S2S Systems : Amazon Nova Sonic, Azure OpenAI Realtime, Gemini Native Audio
Multi-cloud Integration : Unified APIs across AWS, Google Cloud, Azure
Conversational AI Agents : Low-latency speech-enabled customer service bots
Telecom Integration : Enterprise telephony and AudioConnector systems
Domain-specific Models : Medical, legal, technical vocabulary fine-tuning

Success Metrics

Performance : <5% WER for domain-specific tasks Latency :

Reliability : 99.9% uptime for production services

Scale : 1000+ concurrent speech streams

More Jobs at EXL

Senior Manager

Gurgaon

7 - 10 yrs

INR 20 - 30 Lacs

GL - Senior Executive

Bengaluru

3 - 6 yrs

INR 3 - 8 Lacs

Executive (Accounts Payable )

Pune

1 - 3 yrs

INR 2 - 3 Lacs

OTC Cash Application - Senior Executive

Bengaluru

1 - 3 yrs

INR 1 - 5 Lacs

OTC Billing - Senior Executive

Bengaluru

1 - 3 yrs

INR 1 - 5 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

EXL

Business Process Management / Analytics

New York

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

Personal Settings

Lead Assistant Manager

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

Senior ASR/TTS Specialist - AI Agent Integration Expert

Company:

Type:

Experience:

Position Summary

Senior ASR/TTS Specialist

Key Responsibilities

Speech AI Model Development & Integration

MLOps & Production Engineering

Research & Development

Required Qualifications

Core Technical Skills (Must-Have)

Speech AI Models (3+ years experience):

ASR Systems

TTS Systems

Speech-to-Speech

Cloud Services

Programming & Frameworks:

Languages

ML Frameworks

Audio Processing

Agent Frameworks

MLOps & Infrastructure (Essential)

MLOps Tools (2+ years):

Experiment Management

Model Serving

Workflow Orchestration

Containerization

Cloud & Production:

Multi-cloud Experience

Real-time Systems

Monitoring

Preferred QualificationsAdvanced Specializations

Leadership & Education

Technical Environment

Production Technology Stack

Core Technologies:

Languages

Frameworks

Cloud Services

Audio Tools

MLOps

Databases

Production Models:

Infrastructure

Key Projects & Success Metrics

Primary Focus Areas

Success Metrics

More Jobs at EXL