Role Overview: ViH Messenger is looking for a fluent Gujarati and English speaker to join their team for a short-term voice recording project. As a part of the team, your role will involve recording voice samples in Gujarati and English as per project requirements, maintaining clear diction, correct pronunciation, and consistent tone, and ensuring timely delivery of high-quality recordings. Key Responsibilities: - Record voice samples in Gujarati and English as per project requirements. - Maintain clear diction, correct pronunciation, and consistent tone. - Ensure timely delivery of high-quality recordings. Qualifications Required: - Must be residing in Noida (on-site role). - Strong fluency in Gujarati and English (speaking & reading). - Clear, pleasant, and professional voice with accurate pronunciation. - Prior experience in voice recording, dubbing, or similar work is an advantage but not mandatory.,
We’re looking for an experienced engineer to lead the development of our real-time voice AI platform. This hybrid role combines deep expertise in conversational AI, audio infrastructure, and full stack systems, making you a core contributor to building natural, low-latency voice-driven agents for complex healthcare workflows and beyond. You’ll work directly with the founding team to bring intelligent, production-grade voice agents to life. Role: We are looking for someone who loves to read research papers ,debate on media streaming & How Voice Communication is going to be transformed in the coming 3 years around the globe. It doesnt matter your background - whether you are fresher or experienced ,with degree or without degree,if you love working around Voice AI - join our Core Team. If you are good at playing with webrtc ,websocket & streaming pipeline - get your spot here!! What You’ll Do ● Streamlining for voice-driven AI systems that integrate ASR(speech recognition), TTS (speech synthesis), and LLM inference with webrtc & websocket infra. ● Orchestrate multi-turn conversations using frameworks like Pipecat, with memory and context management. ● Develop scalable backends and APIs to support streaming audio pipelines, stateful agents, and secure healthcare workflows. ● Implement real-time communication features with WebRTC, WebSockets, and low- latency audio streaming pipelines. ● Collaborate closely across research, engineering, and product to move fast, ship experiments, and deploy them to production. ● Monitor, optimize, and maintain deployed agents for high reliability, safety, and Performance. ● Translate experimental AI audio models into production-ready services. What We’re Looking For ● 4+ years of software engineering experience, with a focus on real-time systems, streaming, or conversational AI. ● Proven track record building and deploying voice AI, audio/video, or low-latency communication systems & if you are fresher - a good github what challenges you come across instead of what you have built,we are interested to know the process at First. ● Strong proficiency in Python (FastAPI, async frameworks, LangChain or similar); Working knowledge of modern front-end frameworks like Next.js is a plus. ● Knowledge of WebRTC, WebSockets, Redis, Kafka, Docker, AWS for scalable real- time pipelines. ● Exposure to healthcare tech, RCM, or regulated environments is highly valued. Bonus Points ● Contributions to open-source audio/media projects. ● Experience in DSP, live streaming, or media infrastructure. ● Comfortable with observability tools (Grafana, etc.). Why Join Us You’ll sit at the crossroads of AI research and real-world product engineering, shaping voice intelligence that has immediate, high-impact applications in healthcare and beyond. This is a chance to own systems end-to-end, from building cutting-edge low-latency voice pipelines to designing APIs and full-stack apps that bring new agent experiences to life. Interview Process: - Fill the Google Form along with your Linkedin & Github ( Link to your best 2 Projects you have worked around) - https://forms.gle/xHchssjc3X23CeM16 - If we find the profile right fit , you will receive an email with the next task. - Third Round will be Discussion Round with the Tech Team - The Final Round with HR
We're looking for an experienced engineer to lead the development of our real-time voice AI platform. This hybrid role combines deep expertise in conversational AI, audio infrastructure, and full stack systems, making you a core contributor to building natural, low-latency voice-driven agents for complex healthcare workflows and beyond. You'll work directly with the founding team to bring intelligent, production-grade voice agents to life. Role: We are looking for someone who loves to read research papers ,debate on media streaming & How Voice Communication is going to be transformed in the coming 3 years around the globe. It doesnt matter your background - whether you are fresher or experienced ,with degree or without degree,if you love working around Voice AI - join our Core Team. If you are good at playing with webrtc ,websocket & streaming pipeline - get your spot here!! What You'll Do ? Streamlining for voice-driven AI systems that integrate ASR(speech recognition), TTS (speech synthesis), and LLM inference with webrtc & websocket infra. ? Orchestrate multi-turn conversations using frameworks like Pipecat, with memory and context management. ? Develop scalable backends and APIs to support streaming audio pipelines, stateful agents, and secure healthcare workflows. ? Implement real-time communication features with WebRTC, WebSockets, and low- latency audio streaming pipelines. ? Collaborate closely across research, engineering, and product to move fast, ship experiments, and deploy them to production. ? Monitor, optimize, and maintain deployed agents for high reliability, safety, and Performance. ? Translate experimental AI audio models into production-ready services. What We're Looking For ? 4+ years of software engineering experience, with a focus on real-time systems, streaming, or conversational AI. ? Proven track record building and deploying voice AI, audio/video, or low-latency communication systems & if you are fresher - a good github what challenges you come across instead of what you have built,we are interested to know the process at First. ? Strong proficiency in Python (FastAPI, async frameworks, LangChain or similar); Working knowledge of modern front-end frameworks like Next.js is a plus. ? Knowledge of WebRTC, WebSockets, Redis, Kafka, Docker, AWS for scalable real- time pipelines. ? Exposure to healthcare tech, RCM, or regulated environments is highly valued. Bonus Points ? Contributions to open-source audio/media projects. ? Experience in DSP, live streaming, or media infrastructure. ? Comfortable with observability tools (Grafana, etc.). Why Join Us You'll sit at the crossroads of AI research and real-world product engineering, shaping voice intelligence that has immediate, high-impact applications in healthcare and beyond. This is a chance to own systems end-to-end, from building cutting-edge low-latency voice pipelines to designing APIs and full-stack apps that bring new agent experiences to life. Interview Process: - Fill the Google Form along with your Linkedin & Github ( Link to your best 2 Projects you have worked around) - https://forms.gle/xHchssjc3X23CeM16 - If we find the profile right fit , you will receive an email with the next task. - Third Round will be Discussion Round with the Tech Team - The Final Round with HR