Jobs
Interviews

10 Voice Ai Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

4.0 - 10.0 years

0 Lacs

karnataka

On-site

Sense is looking for a Director of Design to join their team and help enhance their product design organization. In this role, you will oversee the design process for various products including Sense AI Recruiter, Workflows, Voice AI, Chatbot, Talent Intelligence, and CRM. Your responsibilities will include leading design projects, managing and growing the design team, and ensuring the quality of products meets customer needs. As a Design Director at Sense, you will lead, mentor, and support a team of product designers to deliver high-quality products that meet customer requirements. You will work closely with Engineering, Product, and Design leads to develop strategies and operations for initiatives, focusing on user-centered design. Additionally, you will be responsible for defining and expanding the design department, recruiting talent, and representing design across the organization. Collaboration with the founders and other leadership members is essential to define the company's growth trajectory. You will also interact with marketing teams to oversee brand activities. Building strong relationships with your team, fostering open communication, and encouraging innovative ideas will be key to success in this role. In the future, you will have the opportunity to shape the future of AI-powered recruiting, design Voice AI experiences, and redefine the Sense Chatbot platform. You will be instrumental in driving Sense's entry into the large enterprise hiring market and collaborating with the marketing team to bring the new Sense brand to life. The ideal candidate for this role will have 10+ years of experience with a minimum of 4 years in a leadership position. Strong management and mentoring skills, experience in complex enterprise and SaaS products, and a background in HR and Recruiting Tech are desired. The ability to navigate a complex company structure, contribute to significant business impact, and possess a strong grounding in UX product design and visual skills are also essential. Sense offers various perks and benefits, including equity, medical insurance for employees and dependents, quarterly Professional Development allowance, and Company Wellness Days. Sense is an equal-opportunity employer that values diversity, inclusion, and belonging in the workplace. If you are a seasoned design professional with a passion for leading and growing design teams, Sense invites you to join them in shaping the future of product design and innovation.,

Posted 2 days ago

Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

We are looking for a highly experienced Voice AI /ML Engineer to take the lead in designing and deploying real-time voice intelligence systems. This position specifically involves working on ASR, TTS, speaker diarization, wake word detection, and developing production-grade modular audio processing pipelines to support next-generation contact center solutions, intelligent voice agents, and high-quality audio systems. You will be operating at the convergence of deep learning, streaming infrastructure, and speech/NLP technology, with a focus on creating scalable, low-latency systems that cater to diverse audio formats and real-world applications. Your responsibilities will include: - Building, fine-tuning, and deploying ASR models such as Whisper, wav2vec2.0, and Conformer for real-time transcription. - Developing high-quality TTS systems using VITS, Tacotron, FastSpeech for natural-sounding voice generation. - Implementing speaker diarization to segment and identify speakers in multi-party conversations using embeddings and clustering techniques. - Designing wake word detection models with ultra-low latency and high accuracy even in noisy conditions. In addition to the above, you will also be involved in: - Architecting bi-directional real-time audio streaming pipelines utilizing WebSocket, gRPC, Twilio Media Streams, or WebRTC. - Integrating voice AI models into live voice agent solutions, IVR automation, and AI contact center platforms. - Building scalable microservices for audio processing, encoding, and streaming across various codecs and containers. - Leveraging deep learning and NLP techniques for speech and language tasks. Furthermore, you will be responsible for: - Developing reusable modules for different voice tasks and system components. - Designing APIs and interfaces for orchestrating voice tasks across multi-stage pipelines. - Writing efficient Python code, optimizing models for real-time inference, and deploying them on cloud platforms. Join us to be part of impactful work, tremendous growth opportunities, and an innovative environment at Tanla, where diversity is championed and inclusivity is valued.,

Posted 6 days ago

Apply

1.0 - 5.0 years

0 Lacs

karnataka

On-site

NTT DATA is looking for a GCP Python Gen AI LLM RAG Vertex AI to join their team in Hyderabad, Telangana (IN-TG), India. As a part of this inclusive and forward-thinking organization, you will need to have 4+ years of Software Engineering experience or equivalent demonstrated through various means such as work experience, training, military experience, or education. The ideal candidate should have at least 2+ years of working experience with GCP (Google Cloud Platform) or alternate public/hybrid cloud, with a proven track record of delivering products at scale using cloud services and architectures. Additionally, you should possess 2+ years of experience with Python and 3+ years of experience with GenAI, LLMs, RAG, vector databases, and conversational bots. Exposure to Playbooks, Vertex AI, ADK, and Voice AI is also required. It would be beneficial to have knowledge of LangChain and/or LangGraph, and 4+ years of experience in the Contact Center industry, specifically in design, development, testing, integration with vendors, CRMs, and business applications. Proficiency in IVR/IVA, NLU/NLP, Real-Time Omni-channel Agent experience, and customer journey optimization using AI/ML is a plus. Furthermore, familiarity with Node JS, JAVA, Spring Boot, Kafka, Distributed Caches (GemFire, Redis), Elastic Search technologies, GraphQL, NoSQL Databases (Cassandra or Mongo), Graph Databases, and Public Cloud Marketplace services is desirable. Experience with Deep Domain Driven Design and cloud-native Microservices designed for massive scale and seamless resiliency on platforms like PCF/VMWare Tanzu, K8s, or Serverless cloud technologies will be an advantage. NTT DATA is a trusted global innovator of business and technology services, serving 75% of the Fortune Global 100. As a Global Top Employer, NTT DATA has diverse experts in over 50 countries and a robust partner ecosystem. Their services encompass business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation, and management of applications, infrastructure, and connectivity. NTT DATA is also a leading provider of digital and AI infrastructure worldwide, committed to helping clients innovate, optimize, and transform for long-term success. If you are an exceptional, innovative, and passionate individual looking to grow with a prestigious organization, apply now to be a part of NTT DATA's dynamic team.,

Posted 1 week ago

Apply

1.0 - 5.0 years

0 Lacs

hyderabad, telangana

On-site

NTT DATA is looking for a GCP Python Gen AI LLM RAG Vertex AI to join the team in Hyderabad, Telangana, India. As a potential candidate, you should have at least 4 years of Software Engineering experience or equivalent, demonstrated through work experience, training, military experience, or education. To be considered for this role, you must have a minimum of 2 years of experience working with GCP (Google Cloud Platform) or alternate public/hybrid cloud, with a proven track record of delivering products using cloud services and architectures at scale. Additionally, you should have at least 2 years of experience with Python, and 3 years of experience with GenAI, LLMs, RAG, vector databases, and conversational bots. Experience with Playbooks and Vertex AI is required, along with exposure to ADK and Voice AI. Knowledge of LangChain and/or LangGraph is considered a plus. Furthermore, candidates with 4+ years of Contact Center industry experience, including design, development, testing, integration with vendors, CRMs, and business applications, are preferred. Familiarity with IVR/IVA, NLU/NLP, Real-Time Omni channel Agent experience, customer journey, and CX/AX optimization using AI/ML is advantageous. Proficiency in Node JS, JAVA, Spring Boot, Kafka, Distributed Caches, Elastic Search technologies, GraphQL, and NoSQL Databases is beneficial. Experience with Graph Databases and Public Cloud Marketplace services is also a plus. Deep Domain Driven Design experience with cloud-native Microservices designed for massive scale and seamless resiliency is desirable, preferably deployed on PCF/VMWare Tanzu, K8s, or Serverless cloud technologies. NTT DATA is a trusted global innovator of business and technology services, serving 75% of the Fortune Global 100. The company is committed to helping clients innovate, optimize, and transform for long-term success. NTT DATA has a diverse team of experts in more than 50 countries and collaborates with a robust partner ecosystem. Their services encompass business and technology consulting, data and artificial intelligence, industry solutions, and the development, implementation, and management of applications, infrastructure, and connectivity. As a leading provider of digital and AI infrastructure, NTT DATA is dedicated to advancing organizations and society into the digital future confidently and sustainably.,

Posted 1 week ago

Apply

1.0 - 5.0 years

0 Lacs

hyderabad, telangana

On-site

NTT DATA is looking for a GCP Python Gen AI LLM RAG Vertex AI to join their team in Hyderabad, Telangana, India. As a potential candidate, you should have at least 4 years of Software Engineering experience or equivalent demonstrated through various means such as work experience, training, military experience, or education. It is essential to have a minimum of 2 years of experience working with GCP (Google Cloud Platform) or alternate public/hybrid cloud, delivering products with cloud services and cloud architectures at scale. In addition, you should have 2+ years of experience with Python and 3+ years of experience with GenAI, LLMs, RAG, vector databases, and conversational bots. Furthermore, 1+ years of experience with Playbooks and Vertex AI is required for this role. Exposure to ADK (hands-on) and Voice AI is a must. While not mandatory, having experience with LangChain and/or LangGraph is considered a plus. Additionally, 4+ years of Contact Center industry experience would be advantageous, including design, development, testing, integration with vendors, CRMs, and business applications. Proven knowledge in contact center subdomains such as IVR/IVA, NLU/NLP, Real-Time Omni-channel Agent experience, customer journey, and CX/AX experience optimization using AI/ML is beneficial. Moreover, familiarity with Node JS, JAVA, Spring Boot, Kafka, Distributed Caches (GemFire, Redis), Elastic Search technologies, GraphQL, and NoSQL Databases (Cassandra or Mongo), Graph Databases, Public Cloud Marketplace services is a good-to-have skill set. Experience with Deep Domain Driven Design with cloud-native Microservices designed and developed for massive scale and seamless resiliency, deployed on PCF/VMWare Tanzu, K8s, or Serverless cloud technologies for at least 2 years is also an added advantage. NTT DATA is a trusted global innovator of business and technology services, with a commitment to helping clients innovate, optimize, and transform for long-term success. Being a part of NTT DATA means being part of a diverse team of experts in over 50 countries, with a robust partner ecosystem. Their services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation, and management of applications, infrastructure, and connectivity. NTT DATA is a leading provider of digital and AI infrastructure globally, and as part of the NTT Group, they invest significantly in R&D to help organizations and society move confidently and sustainably into the digital future. Visit their website for more information.,

Posted 2 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

haryana

On-site

As a Senior AI Engineer specialized in Voice AI and Autonomous Agents at Spyne, you will be responsible for owning and building Spynes in-house voice bot stack. This pivotal role involves leveraging your expertise in LLMs, ASR/TTS, and voice UX to develop immersive, human-like conversations between auto dealerships and their customers. Located in Gurugram, you will work from the office five days a week to drive the development of cutting-edge AI solutions in the automotive retail sector. Your primary responsibilities will include: - Voice AI Stack Ownership: Developing and managing the complete voice bot pipeline encompassing ASR, NLU, dialog state management, tool calling, and TTS to deliver a seamless conversation experience. - LLM Orchestration & Tooling: Designing systems utilizing MCP to facilitate structured context exchange between real-time ASR, memory, APIs, and the LLM. - RAG Integration: Implementing retrieval-augmented generation to support responses based on dealership knowledge bases, inventory data, recall lookups, and FAQs. - Vector Store & Memory: Creating scalable vector-based search functionalities for dynamic FAQ handling, call recall, and user-specific memory embedding. - Latency Optimization: Engineering low-latency, streaming ASR + TTS pipelines and optimizing turn-taking models for natural conversations. - Model Tuning & Hallucination Control: Customizing tone, reducing hallucinations, and aligning responses with business objectives via fine-tuning, LoRA, or instruction tuning. - Instrumentation & QA Looping: Establishing robust observability, conducting real-time call QA processes, and analyzing interruptions, hallucinations, and fallbacks. - Cross-functional Collaboration: Collaborating closely with product, infra, and leadership teams to scale the voice bot solution to numerous US dealerships effectively. To excel in this role, you should possess: - Architectural Thinking: Ability to comprehend the integration of ASR, LLMs, memory, and tools and design modular, observable, and resilient systems. - LLM Tooling Mastery: Proficiency in implementing tool calling, retrieval pipelines, function calls, or prompt chaining across various workflows. - Fluency in Vector Search & RAG: Expertise in chunking, embedding, indexing, and retrieval processes while avoiding prompt bloat and token overflow. - Latency-First Mindset: Capability to identify and rectify token delays, optimize API round-trip time, and ensure human-like call interactions. - Grounding > Hallucination: Skill in tracing hallucinations to weak prompts, lack of guardrails, or tool access deficiencies and addressing them effectively. - Prototyping Skills: Comfort with building from scratch and rapid iteration using open-source or hosted tools as required. Requirements for this role include: - 5+ years of experience in AI/ML or voice/NLP systems with real-time exposure. - Profound knowledge of LLM orchestration, RAG, vector search, and prompt engineering. - Experience with MCP-style architectures and structured context pipelines between LLMs and APIs/tools. - Familiarity with integrating ASR (Whisper/Deepgram), TTS (ElevenLabs/Coqui), and OpenAI/GPT-style models. - Strong understanding of latency optimization, streaming inference, and real-time audio pipelines. - Proficiency in Python, FastAPI, vector DBs (Pinecone, Weaviate, FAISS), and cloud infrastructures (AWS/GCP). - Solid debugging, logging, and QA capabilities for hallucination, grounding, and UX analysis. Join Spyne for a real-world AI impact, a superior team balancing speed with technical depth, high autonomy, and visibility from day one, accelerated career growth, MacBook along with essential tools, and a flat structure focused on meaningful work without unnecessary bureaucracy.,

Posted 2 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

haryana

On-site

As a Senior AI Engineer - Voice AI / Autonomous Agents at Spyne, you will have the opportunity to own and build Spynes in-house voice bot stack. In this high-impact individual contributor role, you will be at the intersection of LLMs, ASR/TTS, and voice UX, focusing on creating deeply human, latency-optimized conversations between auto dealerships and their customers. Your main responsibilities will include: Voice AI Stack Ownership: Building and owning the end-to-end voice bot pipeline, including ASR, NLU, dialog state management, tool calling, and TTS to deliver a natural, human-like conversation experience. LLM Orchestration & Tooling: Architecting systems using MCP (Model Context Protocol) to mediate structured context between real-time ASR, memory, APIs, and the LLM. RAG Integration: Implementing retrieval-augmented generation to ground responses using dealership knowledge bases, inventory data, recall lookups, and FAQs. Vector Store & Memory: Designing scalable vector-based search for dynamic FAQ handling, call recall, and user-specific memory embedding. Latency Optimization: Engineering low-latency, streaming ASR + TTS pipelines and fine-tuning turn-taking models for natural conversation. Model Tuning & Hallucination Control: Using fine-tuning, LoRA, or instruction tuning to customize tone, reduce hallucinations, and align responses to business goals. Instrumentation & QA Looping: Building robust observability, running real-time call QA pipelines, and analyzing interruptions, hallucinations, and fallbacks. Cross-functional Collaboration: Working closely with product, infra, and leadership to scale this bot to thousands of US dealerships. To be successful in this role, you should possess: Architect-level thinking: Understanding how ASR, LLMs, memory, and tools fit together and having the ability to design modular, observable, and resilient systems. LLM Tooling Mastery: Implementing tool calling, retrieval pipelines, function calls, or prompt chaining across multiple workflows. Fluency in Vector Search & RAG: Knowing how to chunk, embed, index, and retrieve, while avoiding prompt bloat and token overflow. Latency-First Mindset: Debugging token delays, understanding the cost of each API hop, and optimizing round-trip time to maintain human-like interactions. Grounding > Hallucination: Tracing hallucinations back to weak prompts, missing guardrails, or lack of tool access and effectively addressing them. Prototyper at heart: Being unafraid of building from scratch and iterating quickly, utilizing open-source or hosted tools as necessary. The ideal candidate will have: 5+ years of experience in AI/ML or voice/NLP systems with real-time experience. Deep knowledge of LLM orchestration, RAG, vector search, and prompt engineering. Experience with MCP-style architectures or structured context pipelines between LLMs and APIs/tools. Experience integrating ASR (Whisper/Deepgram), TTS (ElevenLabs/Coqui), and OpenAI/GPT-style models. Solid understanding of latency optimization, streaming inference, and real-time audio pipelines. Hands-on experience with Python, FastAPI, vector DBs (Pinecone, Weaviate, FAISS), and cloud infra (AWS/GCP). Strong debugging, logging, and QA instincts for hallucination, grounding, and UX behavior. Working at Spyne offers real-world AI impact at scale, a high-performing team that balances speed with technical depth, high autonomy and visibility from day one, rapid career acceleration, access to MacBook and all necessary tools and compute, a flat structure with real work focus, and no BS. Join us in redefining how cars are marketed and sold with cutting-edge Generative AI.,

Posted 2 weeks ago

Apply

2.0 - 5.0 years

6 - 9 Lacs

Kolkata

Work from Office

Role & Responsibilities: Design & implement AI workflows to automate CRM, trip quotations, lead follow-ups, and customer chat/voice queries Build smart agents using GPT (chat + voice) for internal & customer use (OpenAI, Twilio, CallHippo) Integrate travel APIs (flights, hotels, activities) with our platform for live quotation generation Automate repetitive tasks using Make.com, Zapier, and internal tools Work with the Product and Creative teams to bring AI to media creation (photo/video sorting, customer albums, etc.) Develop performance dashboards, auto-suggestion tools, and smart seller assistants Stay up to date with AI trends in TravelTech and test applicable models for use Preferred Candidate Profile: 2 to 5 years of real-world experience in applied AI, automation, or backend systems Strong Python skills (FastAPI, LangChain, Pandas, etc.) Hands-on with OpenAI/GPT APIs, Twilio, WhatsApp integrations Familiarity with automation tools like Make.com, Zapier Passionate about travel, tech, and creating impact Bonus: Experience in e-commerce, SaaS, or travel-tech products

Posted 1 month ago

Apply

5.0 - 8.0 years

15 - 30 Lacs

Noida

Work from Office

Position Summary: We are looking for a dynamic Senior Voice AI Developer / Voice AI Architect to spearhead our AI initiatives, guiding the integration of artificial intelligence into various aspects of our business.You will design AI systems from the ground up, collaborate with multidisciplinary teams to tailor AI solutions to specific business needs, and ensure these solutions are scalable and sustainable.Your expertise will help us harness the power of AI to drive innovation, improve decision-making, and maintain competitive advantage in our industry. Key Responsibilities: Design, develop, and oversee the implementation of end-to-end AI solutions Collaborate with business and IT stakeholders to understand and fulfill the AI needs of the organization Create architectural approaches for AI software and hardware integration Define AI solution objectives and ensure alignment with business outcomes Monitor AI industry trends and maintain state-of-the-art industry knowledge Implement best practices for AI testing, deployment, and maintenance Qualifications:- Candidates should be B.Tech/M.tech/MCA (CSE/IT) preferably from premium institutes. 5+ years of experience in Node.js / javascript / TypeScript 5+ years of relevant experience in AI Frameworks Mandatory Skills: Google Dialog flow, Open AI Intergration, Google STT/TTS & Nodejs Node.js: Competent in developing and maintaining applications. GCP Services: Proficient in GCP logs, services, and custom deployment configurations. Dialog flow Expertise: Strong understanding of conversational agent design and integration for Voice Applications. Speech-to-Text & Text-to-Speech: Functional knowledge of speech processing. Functional Knowledge of Audio Streaming/Processing Applications Function Knowledge of Conversational AI Generative AI: Foundational understanding of Gen AI concepts and applications. Proven understanding of scalable computing systems, microservices architectures, software architecture, data structures, and algorithms Working with defined processes and policy (e.g. Peer review, test driven development, coding standards and deployment process) Excellent proven analytical problem solving skills. Self-motivated high performer and able to perform with minimal supervision, who can lead by example. Excellent written and verbal communication skills. Good To Have: Exp. in Integration with Azure Open AI Working Knowledge of Any CCAI Framework/Applications Audio Intelligence Frameworks Experience with Back-end technologies like NoSQL, RDBMS, Cache Management Experience working with AWS Technologies - Lambda, API Gateway, S3 Agile Development Methodology Experience to work with geographically distributed teams. Knowledge of any CRM (WFM) will be a bonus, preferably ServiceNOW is a big plus. Benefits: - Flexible Working Hours. Hybrid Working Style. Personal Accidental Insurance. Health Insurance to Self, Spouse and two kids. 5 days working week.

Posted 1 month ago

Apply

2 - 7 years

7 - 15 Lacs

Bengaluru

Hybrid

As a Prompt Engineer , you will play a crucial role in designing and refining the conversational prompts that power our AI voice agents. You will work closely with AI researchers, product managers, and developers

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies