Jobs
Interviews

12 Speechtotext Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 10.0 years

0 Lacs

noida, uttar pradesh

On-site

As a Senior Voice AI Developer / Voice AI Architect with 5 to 10 years of experience, you will play a crucial role in spearheading our AI initiatives and integrating artificial intelligence into various aspects of our business. Your primary responsibility will be to design AI systems from scratch, collaborate with multidisciplinary teams to customize AI solutions according to specific business requirements, and ensure the scalability and sustainability of these solutions. Your expertise in AI will be instrumental in driving innovation, enhancing decision-making processes, and maintaining a competitive edge in our industry. Your key responsibilities will include designing, developing, and overseeing the implementation of end-to-end AI solutions. You will work closely with both business and IT stakeholders to identify and address the AI needs of the organization. Additionally, you will be tasked with creating architectural approaches for AI software and hardware integration, defining AI solution objectives in alignment with business outcomes, monitoring AI industry trends, and implementing best practices for AI testing, deployment, and maintenance. In terms of preferred skills and experience, you must have a minimum of 5 years of experience in Node.js / JavaScript / TypeScript and at least 3 years of relevant experience in AI frameworks. Mandatory skills include expertise in Google Dialogflow, Google STT/TTS, and Node.js, as well as proficiency in GCP services and Dialogflow. You should also possess functional knowledge of speech processing, audio streaming/processing applications, conversational AI, and generative AI concepts and applications. A strong understanding of scalable computing systems, microservices architectures, software architecture, data structures, and algorithms is crucial. Ideally, you should hold a Bachelor's degree in B.Tech/M.Tech/MCA (CSE/IT). It would be advantageous to have experience with OpenAI's Real-Time API, integration with Azure OpenAI, any CCAI frameworks/applications, audio intelligence frameworks, and back-end technologies like NoSQL, RDBMS, and cache management. Familiarity with AWS technologies such as Lambda, API Gateway, and S3, as well as agile development methodology and working with geographically distributed teams, will be beneficial. Knowledge of any CRM, especially ServiceNOW, would be considered a significant advantage.,

Posted 1 week ago

Apply

0.0 - 3.0 years

0 Lacs

surat, gujarat

On-site

As an AI Research Engineer (Intern) working remotely, you will be responsible for building infrastructure for training, evaluating, and deploying Small Language Models (SLMs). Your role will involve designing RL-based training loops such as PPO, DPO, RLAIF for model tuning. Additionally, you will work on speech systems including Speech-to-text, text-to-speech, VAD, and optimizing models for on-device inference on platforms like mobile, browser, and edge. Furthermore, you will be involved in developing real-time multimodal AI pipelines that encompass text, audio, and video. Your tasks will also include translating research papers into production-ready code and driving research efforts on quantization, distillation, and architecture search. To excel in this role, a Bachelor's degree in Computer Science, Information Technology, AI-ML, or related fields is required. The ideal candidate should have 0 to 6 months of experience in the field. If you are passionate about AI research and eager to contribute to cutting-edge projects, this position offers a unique opportunity to apply your skills and knowledge in a dynamic and innovative environment.,

Posted 2 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

At Skillsoft, we are dedicated to propelling organizations and individuals towards growth through transformative learning experiences. We strongly believe that each team member has the potential to achieve greatness. Join us on our mission to revolutionize learning and assist individuals in unlocking their full potential. As a Quality Assurance (QA) Engineer, you will play a crucial role in ensuring the performance, accuracy, and reliability of our AI conversational systems. This includes overseeing natural language processing (NLP), speech-to-text (STT), text-to-speech (TTS), and AI dialogue management. The ideal candidate will possess exceptional analytical skills, a profound understanding of AI testing methodologies, and hands-on experience with AI-driven conversational interfaces. Your responsibilities will involve developing and executing comprehensive test plans and test cases for AI conversational systems. This will entail verifying that all functional and non-functional requirements are met. You will also be responsible for conducting end-to-end testing of speech recognition (STT), natural language understanding (NLU), and response generation (NLG) within the AI platform. Additionally, you will need to perform both manual and automated testing of conversational flows, including edge cases, error handling, and multi-turn dialogues. Your role will require you to validate the system's ability to comprehend and process various user intents, entities, and languages. You will also be responsible for testing integrations between different components of the AI system, such as third-party APIs, speech interfaces, and backend data systems. Furthermore, ensuring the quality and accuracy of voice output (TTS) and conducting thorough testing of speech quality across different platforms and devices will be within your purview. Identifying and documenting bugs, performance issues, and user experience problems will be essential. You will collaborate closely with developers and AI engineers to resolve these issues. Monitoring system performance and behavior under different conditions, including large-scale interactions and stress testing, will also be part of your responsibilities. Conducting regression testing after each update to ensure the continued functionality and efficiency of previous features is paramount. Collaboration with cross-functional teams, including AI/ML engineers, product managers, and UI/UX designers, to define quality benchmarks and maintain continuous improvement in testing processes is crucial for success in this role. Required Skills and Qualifications: - 5-8 years of experience and a Bachelor's degree in Computer Science, Engineering, or a related field. - Proven experience as a QA Engineer, preferably with exposure to AI/ML-based systems or conversational AI platforms. - Familiarity with natural language processing (NLP), machine learning models, and speech technologies such as STT, TTS, and dialogue systems. - Proficiency in test automation tools like Selenium, Appium, Cypress, Playwright, PyTest, JUnit, TestNG, and API testing tools like Postman, RestAssured, or SoapUI. - Strong problem-solving skills with a focus on quality and attention to detail. - Experience in agile software development methodologies and excellent communication skills. Preferred Qualifications: - Experience working with conversational AI platforms. - Familiarity with testing frameworks for AI models. - Knowledge of multilingual and multi-accent conversational testing. - Understanding of user experience (UX) testing for voice interfaces. - Experience in performance and load testing for AI systems. Skillsoft is a leading provider of online learning, training, and talent solutions that help organizations unleash their potential. With a focus on immersive and engaging content, Skillsoft empowers organizations to develop their people and build skilled teams for success. Partnering with thousands of global organizations, including Fortune 500 companies, Skillsoft offers award-winning systems like Skillsoft learning content and the Percipio intelligent learning experience platform. If you are intrigued by this opportunity, we welcome you to apply and become part of our team at Skillsoft. Thank you for considering us as your next career destination.,

Posted 2 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

You will be responsible for designing, building, and maintaining iOS applications that deliver high-quality user experiences. If you are a team player with a strong eye for design and functionality, we would love to hear from you. Design and develop advanced applications for the iOS platform using Swift and, where applicable, Objective-C. Collaborate with cross-functional teams to define, design, and launch new features. Troubleshoot and fix bugs, while continuously improving application performance. Evaluate and implement new technologies to maximize development efficiency. Ensure application performance, quality, and responsiveness. Integrate applications with backend services using RESTful APIs or GraphQL. Participate in code reviews and contribute to maintaining high code quality standards. Proven experience as an iOS developer. Proficiency in Swift and Xcode. Familiarity with UIKit, SwiftUI, and CoreData. Solid understanding of Apple's Human Interface Guidelines. Experience working with third-party libraries and APIs. Knowledge of push notifications and cloud messaging. Experience with version control systems such as Git. Understanding of the full mobile application development lifecycle. Must-Have Skills: SwiftUI, Swift5, Facetime, SpeechToText, RTC, CoreML. Preferred Qualifications: Experience with reactive programming frameworks such as RxSwift or Combine. Familiarity with continuous integration and deployment practices. Knowledge of automated testing tools and frameworks. Exposure to Agile or Scrum development methodologies.,

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

pune, maharashtra

On-site

As a skilled professional in the field of Conversational Text AI Platform, your primary responsibility will be to develop and maintain state-of-the-art Conversational Text AI systems using cutting-edge LLM frameworks. You will collaborate closely with product owners and domain experts to create reusable components tailored to specific business processes. Additionally, you will be tasked with building core infrastructure and reusable components that facilitate the seamless deployment of conversational AI systems. Your expertise will be crucial in working on orchestration, prompt engineering, and LLM-powered integrations, ensuring the scalability and integration of solutions with enterprise data platforms. In the realm of Generative AI & Model Optimization, you will be expected to fine-tune LLMs/SLMs using proprietary NBFC data and perform distillation and quantization of models for edge deployment. Your role will involve evaluating and running LLM/SLM models on local/edge server machines, contributing to the optimization and efficiency of the AI models. Moreover, you will play a key role in developing self-learning frameworks that allow systems to adapt without complete retraining, incorporating lightweight local models for real-time learning on the edge. Your expertise will be crucial in enhancing the adaptability and learning capabilities of AI systems. The ideal candidate for this role should possess a Bachelor's or Master's degree in Computer Science, Engineering, or a related field, along with at least 7 years of experience in Python, Node.JS, JavaScript, HTML/CSS, Redis, Postgres, Azure COSMOS, DevOps, and CI/CD, with exposure to AI/ML technologies. Strong programming skills in languages like Python, Node.JS, and JavaScript are essential, along with familiarity with Redis, Postgres, Vector Embeddings, Speech-to-Text & Text-to-Speech Services, Azure COSMOS, DevOps, and CI/CD practices. Experience in building or integrating LLMs for task automation, reasoning, or autonomous workflows, as well as a solid understanding of prompt engineering, tool calling, and agent orchestration, will be highly valued. Joining our team at Bajaj Finance Limited offers you the opportunity to be part of a dynamic and diverse organization that values its people and fosters a culture of innovation and achievement. With over 500 locations across India, we provide a stimulating work environment where your skills and drive can lead to rewarding accomplishments. This is a full-time position that requires in-person work at the designated location.,

Posted 1 month ago

Apply

0.0 - 4.0 years

0 Lacs

delhi

On-site

The day-to-day responsibilities of the selected intern will include building AI agents using tools such as ChatGPT, OpenAI API, n8n, Make.com, and Voiceflow. Designing prompt-based workflows to automate tasks, exploring speech-to-text and intent recognition APIs, and creating proof-of-concept tools using AI technologies. Additionally, the intern will be responsible for developing scalable backend APIs and database schemas, designing interactive React-based dashboards and CRM interfaces, integrating REST APIs and third-party services, and optimizing performance, user experience, and responsiveness. Collaboration with the AI/ML team to construct front-end interfaces for intelligent workflows is also an essential part of the role. SOLAR-MAIT India is a comprehensive solar brand dedicated to making solar energy affordable and accessible through innovative products, services, and business models. The company's vision is to democratize solar power, ensuring its availability to all while contributing to a greener and more eco-friendly planet for future generations. SOLAR-MAIT India is actively involved in creating and empowering local entrepreneurs and generating employment opportunities within the solar energy sector.,

Posted 1 month ago

Apply

12.0 - 16.0 years

0 Lacs

ahmedabad, gujarat

On-site

You will be joining Jetbro, a dynamic digital agency that specializes in bespoke development solutions for websites, mobile applications, and business applications. The company is at the forefront of AI-driven projects, offering innovative and cutting-edge solutions to its clients. As the company continues to expand, they are looking for a skilled AI Engineer to build and maintain high-performance, scalable systems. Your responsibilities will include integrating and orchestrating OpenAI, Gemini, and other LLM APIs to power features such as Essay & Recommendation Brainstorming (Voice & Text), AI-driven Evaluation and Scoring, and Contextual, voice-based guidance via Ivy (Text-to-Speech + prompt control). You will also be tasked with building and fine-tuning prompt-response workflows, input-output schemas, fallback handling, and session management. Additionally, you will need to convert speech inputs to structured text using Speech-to-Text APIs, and implement Text-to-Speech outputs for voice-based interactions. Setting up and managing Vector Databases (e.g., Pinecone, Weaviate, AstraDB) to store and retrieve semantically indexed essay data will also be part of your role. Collaboration with backend, frontend, and QA teams to ensure smooth flow of AI-driven features across the platform is essential. Furthermore, you will be responsible for evaluating and continuously improving AI output quality by testing prompt variations and refining scoring logic. Mandatory requirements for this position include 1.5 years of hands-on experience building with LLM APIs (OpenAI, Gemini, Claude, etc.), a strong grasp of prompt engineering, including role prompting, temperature control, and context design, experience working with Python and integrating APIs in live products, familiarity with Text-to-Speech and Speech-to-Text workflows (OpenAI Whisper, Google Cloud STT/TTS, etc.), exposure to using or querying Vector Databases (e.g., Pinecone or similar), and basic understanding of AI evaluation techniques (prompt tuning, hallucination handling, response scoring). Good to have qualifications include experience with LangChain, LlamaIndex, or agent frameworks, working knowledge of FastAPI or Django to support backend integration, familiarity with Voice UX or chatbot design, comfort with versioning tools (Git), JSON handling, and API testing tools (Postman), and demonstrated ability to iterate quickly and adapt to changing LLM capabilities. In return, you can expect to work on mission-critical, real-world projects that impact large-scale operations at Jetbro. The company has a lean, no-nonsense tech culture built on ownership, accountability, and growth, with direct access to decision-makers and no red tape. You will enjoy flexibility in work location and timings, regular feedback, performance reviews, and learning opportunities.,

Posted 1 month ago

Apply

1.0 - 5.0 years

0 Lacs

hyderabad, telangana

On-site

Qualcomm India Private Limited is seeking a candidate to join their Multimedia Audio Systems Group as a Voice AI Engineer. As part of the team, you will be responsible for prototyping and productizing Voice AI Models for tasks such as Automatic Speech Recognition (ASR), Text-to-Speech (TTS), NLP, Multilingual Translation, Summarization, Language modeling, and other Speech/text generation tasks. You will work closely with a team of engineers to develop, train, and optimize Voice AI models for efficient offload to NPU, GPU, and CPU. Additionally, you will conduct model evaluation studies, competitive analysis, and collaborate with other R&D and Systems teams for system integration, use case validation, efficient offload to HW accelerators, and commercialization support. The ideal candidate should have strong programming skills in C/C++ and Python, along with experience in ML inference optimizations. Proficiency in designing, implementing, and training DL models using high-level languages/frameworks such as PyTorch, TensorFlow, and ONNX is required. Knowledge of ML architectures and operators like Transformers, LSTM, GRUs, and familiarity with recent trends in machine learning and traditional statistical modeling/feature extraction techniques are essential. Experience in Speech-to-text, Text-to-Speech, Speech-to-Speech, NLP applications, model quantization, compression techniques, software development on embedded platforms, software design patterns, multi-threaded programming, computer architecture, operating systems, data structures, algorithms, fixed-point coding, and AI HW accelerators (NPU or GPU) is a plus. Candidates should hold a Bachelor's/Masters/PhD degree in Engineering, Electronics and Communication, Computer Science, or related field, along with 3+ years of experience in Audio Systems engineering, Audio Signal Processing modules, ML Model development, or related work. Minimum qualifications include a Bachelor's degree in Engineering, Information Systems, Computer Science, or related field with 2+ years of Systems Engineering or related work experience, or a Master's degree with 1+ year of experience, or a PhD in a related field. Qualcomm is an equal opportunity employer committed to providing accessible processes for individuals with disabilities. Individuals seeking accommodation during the application/hiring process can contact Qualcomm for support. The company expects its employees to adhere to all applicable policies and procedures, including security requirements regarding protection of confidential information. Please note that Qualcomm does not accept unsolicited resumes or applications from agencies. Staffing and recruiting agencies are not authorized to submit profiles, applications, or resumes on behalf of individuals. For more information about this role, please contact Qualcomm Careers.,

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

pune, maharashtra

On-site

As a member of our team, you will be responsible for working on the Conversational Text AI Platform, where your primary tasks will include building and maintaining the system using cutting-edge LLM frameworks. You will collaborate closely with product owners and domain experts to develop reusable components for various business processes. Additionally, you will play a key role in developing core infrastructure and reusable components to facilitate the deployment of conversational AI systems. Your work will involve orchestration, prompt engineering, and integrating LLM-powered solutions with enterprise data platforms. In the realm of Generative AI & Model Optimization, you will be engaged in fine-tuning LLMs/SLMs using proprietary NBFC data, as well as performing distillation and quantization of models for edge deployment. Your responsibilities will also include evaluating and running LLM/SLM models on local/edge server machines. Furthermore, you will have the opportunity to build self-learning systems that can adapt without requiring full retraining, enabling real-time learning on the edge through lightweight local models. The ideal candidate for this role will possess a Bachelor's or Master's degree in computer science, engineering, or a related field, along with a minimum of 7 years of experience in Python, Node.JS, JavaScript, HTML/CSS, Redis, Postgres, Azure COSMOS, DevOps, and CI/CD, with exposure to AI/ML. Strong programming skills in Python, Node.JS, JavaScript, and HTML/CSS are essential, along with familiarity with Redis, Postgres, Vector Embeddings, Speech-to-Text & Text-to-Speech Services, Azure COSMOS, DevOps, CI/CD, Lang-Chain, or Lang-Graph. Experience in building or integrating LLMs for task automation, reasoning, or autonomous workflows, as well as a solid understanding of prompt engineering, tool calling, and agent orchestration, will be highly valued.,

Posted 1 month ago

Apply

4.0 - 8.0 years

0 Lacs

kolkata, west bengal

On-site

You are a highly skilled ChatGPT Developer with expertise in AI development, web technologies, and iOS mobile app development. Your main responsibility will be designing, developing, and integrating ChatGPT-based AI solutions into web and mobile applications. To excel in this role, you must have a deep understanding of natural language processing (NLP), machine learning (ML), API development, and user experience across various platforms. In the realm of AI & Chatbot Development, you will be tasked with designing and developing AI-powered chatbot solutions using ChatGPT, OpenAI APIs, or similar NLP frameworks. Your goal is to implement conversational AI models that enhance user interactions across web and iOS applications. Additionally, you will train and fine-tune language models to provide better responses, personalization, and user engagement. Moreover, you will implement AI-driven recommendation systems, sentiment analysis, and automated workflows. When it comes to Web Development & Integration, your role involves developing and integrating ChatGPT-powered chatbots into web applications using technologies such as React, Angular, or Vue.js. You will also be responsible for building RESTful and GraphQL APIs to facilitate seamless communication between AI models and frontend applications. Your focus will be on ensuring responsive, scalable, and high-performing web applications that seamlessly integrate AI functionalities. You will collaborate with cloud platforms like Azure, AWS, or Google Cloud to deploy AI models and services effectively. In the iOS Mobile Development domain, you will develop and integrate ChatGPT-powered chatbots into iOS applications using Swift, SwiftUI, or Objective-C. Your task is to optimize AI chatbot interactions to provide mobile-friendly experiences. Furthermore, you will implement features such as push notifications, voice input, and AI-driven automation to enhance mobile user engagement. Your ultimate goal is to ensure seamless performance across various iOS devices and OS versions. To thrive in this role, you must possess strong expertise in ChatGPT, OpenAI APIs, NLP, and AI model integration. You should have experience in Python, TensorFlow, PyTorch, or LangChain for AI model development. Proficiency in web development technologies like React, Angular, Vue.js, and Node.js is essential. Moreover, you must have a solid background in iOS development using Swift, SwiftUI, and Objective-C. Hands-on experience with cloud-based AI services such as Azure AI, AWS AI, and Google AI is highly beneficial. Additionally, familiarity with database management, RESTful APIs, GraphQL, WebSockets, and serverless computing will be advantageous. Preferred qualifications include experience with speech-to-text and voice assistant technologies, familiarity with mobile AI SDKs like Core ML, TensorFlow Lite, and Apple Neural Engine, as well as knowledge of AI ethics, security, and privacy considerations. Certifications in AI, web, or iOS development (e.g., OpenAI, AWS AI, Microsoft AI) are considered a plus. In terms of educational qualifications, a Bachelors or Masters degree in Computer Science, Artificial Intelligence, Data Science, or a related field is required. Relevant certifications in AI, web, or mobile development will be beneficial for this role.,

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

You will be working as an AI Engineer with expertise in Speech-to-text and Text Generation to tackle a Conversational AI challenge for a client in EMEA. The project aims to transcribe conversations and utilize generative AI-powered text analytics for enhancing engagement strategies and decision-making processes. Your main responsibilities will include developing Conversational AI & Call Transcription solutions, creating NLP & Generative AI Applications, performing Sentiment Analysis & Decision Support tasks, and handling AI Deployment & Scalability aspects. You will be expected to work on real-time transcription, intent analysis, sentiment analysis, summarization, and decision-support tools. Key technical skills required for this role include a strong background in Speech-to-Text (ASR), NLP, and Conversational AI, along with hands-on experience in tools like Whisper, DeepSpeech, Kaldi, AWS Transcribe, Google Speech-to-Text, Python, PyTorch, TensorFlow, Hugging Face Transformers, LLM fine-tuning, RAG-based architectures, LangChain, and Vector Databases (FAISS, Pinecone, Weaviate, ChromaDB). Experience in deploying AI models using Docker, Kubernetes, FastAPI, Flask will be essential. In addition to technical skills, soft skills such as translating AI insights into business impact, problem-solving abilities, and effective communication skills to collaborate with cross-functional teams will be crucial for success in this role. Preferred qualifications include experience in healthcare, pharma, or life sciences NLP use cases, a background in knowledge graphs, prompt engineering, and multimodal AI, as well as familiarity with Reinforcement Learning (RLHF) for enhancing conversation models.,

Posted 1 month ago

Apply

10.0 - 14.0 years

0 Lacs

karnataka

On-site

You will be part of a team that is pioneering the development of the world's first agentic AI voice-to-voice mental health platform, offering personalized therapy globally. The platform aims to provide validated emotional support and mental health care on a large scale to address the global mental health crisis. Your role as Co-founder & CTO will require you to lead the technical vision and execution, overseeing core AI infrastructure, voice integration, and team management. As the CTO, you will be responsible for refining and implementing revolutionary AI architecture for both B2C and B2B platforms, developing cutting-edge voice technology for therapeutic effectiveness, and ensuring secure, compliant systems. Your expertise in engineering or computer science, coupled with 10+ years of experience in scaling consumer applications and enterprise solutions, will be essential in building a global-scale platform that can impact millions of lives positively. Your exceptional leadership qualities, entrepreneurial mindset, and strategic thinking abilities will play a crucial role in balancing technical excellence with business impact. You must have a passion for leveraging technology to solve significant human problems and be capable of inspiring and motivating technical teams. The role also requires proficiency in full-stack development, cloud infrastructure, core AI/ML technologies, and preferably voice technology, healthcare compliance, and blockchain development. This extraordinary opportunity will allow you to architect a technological solution that could revolutionize mental health care globally, working alongside a top-tier team with a shared vision of making a meaningful impact. You will have the autonomy to make critical technical decisions, influence a platform that transcends borders and cultures, and contribute to groundbreaking work at the intersection of AI, healthcare, and human empathy. In return, you will receive a substantial equity stake in the company, technical freedom to drive innovation, the chance to create a global impact, and the opportunity to collaborate with exceptional co-founders who are dedicated to changing the world. If you are a visionary leader and technologist ready to tackle one of humanity's greatest challenges, this role offers a unique chance to be part of a transformational journey in human healthcare. Ready to be a catalyst for change and contribute to a pivotal moment in global mental health care We are looking for exceptional individuals like you to join us in this mission. Location: India, Dubai, Remote Compensation: Equity, with salary on the close of pre-seed round Stage: Early (pre-seed / MVP in development),

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies