Home
Jobs

AI Voice & LLM Integration Engineer

5 - 10 years

15 - 25 Lacs

Posted:1 week ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Role & responsibilities ole Overview As an AI Voice & LLM Integration Developer, you will play a pivotal role in building and optimizing Your work will focus on: • Integrating GPT's ASR (speech-to-text) and TTS (text-to-speech) APIs into our platform. • Handling real-time voice streaming from VoIP/SIP systems. • Optimizing low-latency communication between AI models and telephony platforms (e.g., FreeSWITCH, WebRTC). • Supporting PBX call routing & handoWs with AI agents. • Ensuring high-quality, natural-sounding AI-driven voice interactions. Key Responsibilities Design and develop real-time voice processing pipelines for AI-driven conversations. • Integrate OpenAI's Whisper (ASR) and TTS APIs with VoIP and WebRTC systems. • Develop low-latency audio streaming middleware between telephony providers(Telnyx, Bandwidth) and GPT. • Work with SIP and WebRTC to ensure seamless audio transmission. • Implement PBX call routing and AI handoW mechanisms (e.g., transferring AI-handled calls to human agents). • Optimize AI voice latency and response times for a real-time, natural experience. • Collaborate with VoIP engineers to integrate AI capabilities into SIP-based systems. • Design APIs to manage AI-driven IVR workflows and customer interactions. • Ensure compliance with telephony regulations (e.g., STIR/SHAKEN, call recording laws). • Monitor system performance and implement scalability strategies for voice interactions. Required Qualifications AI & LLM Integration o Experience integrating GPT APIs (ChatGPT, Whisper, TTS engines) for voice applications. o Understanding of real-time AI voice processing and conversational AI workflows. Voice & Audio Streaming Experience working with low-latency, real-time audio streaming. o Familiarity with WebRTC, SIP/RTP, and VoIP streaming. Programming & Middleware Development Proficiency in Python, Node.js, or Go for API and middleware development. o Experience developing RESTful APIs and real-time streaming solutions. VoIP & Telephony Knowledge Familiarity with SIP, FreeSWITCH, Asterisk, or other PBX platforms. o Experience working with SIP trunking providers (Telnyx, Bandwidth, Flowroute, etc.). Performance & Optimization Ability to reduce latency in AI-driven conversations. o Experience with audio codec optimization (G.711, Opus, etc.). One or more of the following additional qualifications: Experience with STT/ASR models beyond GPT (Google Speech-to- Text, Deepgram, Kaldi). o Background in signal processing or audio engineering. o Familiarity with cloud-based voice solutions (AWS Connect, Twilio Voice, Dialogflow CX). o Knowledge of containerization (Docker, Kubernetes) for scaling AI voice workloads. o Experience with multi-tenant architectures for voice AI.

Mock Interview

Practice Video Interview with JobPe AI

Start Generative Ai Interview Now
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Golang Skills

Practice Golang coding challenges to boost your skills

Start Practicing Golang Now
Cirruslabs
Cirruslabs

IT Services and IT Consulting

Alpharetta Georgia

201-500 Employees

102 Jobs

    Key People

  • John Doe

    CEO
  • Jane Smith

    CTO

RecommendedJobs for You