Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in kochi
>
FriskaAi
>
AI/ML Engineer – ASR, Speech Enhancement, Computer Vision & Cloud Deployment

AI/ML Engineer – ASR, Speech Enhancement, Computer Vision & Cloud Deployment

FriskaAi

0 years

0 Lacs

kochi kerala india

Posted:2 days ago| Platform:

Apply

Skills Required

ai ml vision deployment healthcare communication recognition signal azure collaboration latency visual design tuning learning multilingual personalization processing algorithms extraction spectrogram analysis beamforming tracking video model quantization distillation onnx gcp vertex docker containerization support autoscaling scheduling engineering git versioning testing code reliability monitoring evaluation research opencv pytorch tensorflow inference aws python fastapi flask service

Work Mode

On-site

Job Type

Full Time

Job Description

Employment type: Full time

Location: Kochi, Kerala

FriskaAi

Job Summary

Automatic Speech Recognition (ASR), Speech Enhancement, Computer Vision (CV), and Scalable Cloud Deployment

real-time, low-latency, speech-driven and audio-visual AI applications

Key Responsibilities

ASR and Speech recognition

Design, train and optimize ASR models (Whisper, Conformer, Wav2Vec2, SpeechBrain, etc.) with focus on
speaker adaptation and impaired speech recognition
.
Implement domain adaptation techniques (fine-tuning, transfer learning, LoRA) for
accent, dialect, multilingual, and special-use cases
.
Develop and integrate
speaker recognition, diarization & personalization modules
.
Work with
real-time/streaming ASR, VAD, and low-latency decoding
.

Signal processing and speech enhancement

Apply
speech enhancement, noise reduction, denoising, dereverberation, and echo cancellation
algorithms to improve input quality.
Work with feature extraction methods (
MFCC, PLP, spectrogram analysis, filterbanks
) for robust ASR performance.
Handle
far-field, multi-mic audio and beamforming-based processing
.

Computer vision and lip reading

Develop and optimize
computer vision models
using
CNNs, Vision Transformers, YOLO, and OpenCV
.
Build and deploy
face detection, face tracking, face recognition, and lip-reading (visual speech recognition) systems
.
Develop
audio-visual speech recognition (ASR + lip reading)
for noisy and real-time environments.
Optimize CV models for
real-time video inference
.

Model Optimization

Apply
quantization, pruning, knowledge distillation, and LoRA
for
low-latency, resource-efficient deployment
.
Optimize models using
ONNX, TensorRT, TorchScript
.
Balance trade-offs between
accuracy, speed, and scalability
.

Cloud and Deployment

Deploy AI/ASR/CV pipelines on
Azure (AKS, Cognitive Services, Functions)
and
GCP (Vertex AI, Cloud Run, BigQuery)
.
Build scalable APIs and services for
real-time speech and video processing
.
Use
Docker/Kubernetes
for containerization and orchestration.
Integrate with
CI/CD pipelines
for automated model retraining and updates.
Support
autoscaling, GPU scheduling, and high-availability deployments
.

Backend and Engineering Practices

Collaborate with backend engineers to integrate ASR and CV modules with
production systems
.
Build
RESTful APIs and microservices
for ASR, speech enhancement, face, and lip-reading tasks.
Handle
audio/video ingestion, buffering, chunking, and timestamp alignment
.
Follow best practices with
Git, versioning, unit testing, and code reviews
.
Ensure system
reliability, monitoring, and logging
for multimodal pipelines.

Evaluation and Continuous Learning

Evaluate ASR models with
WER, CER, SER, RTF
and CV/lip-reading models with
precision, recall, mAP, FPS
.
Incorporate
user feedback loops
for continuous improvement.
Stay updated with latest research in
ASR, Speech AI, Computer Vision, and Cloud AI services
.

Required skills and Qualifications

Strong experience with
ASR frameworks
(Whisper, Conformer, Wav2Vec2, NeMo, Riva, Coqui STT, Kaldi).
Strong experience with
Computer Vision & lip-reading frameworks
(OpenCV, YOLO, CNNs, Vision Transformers).
Solid background in
speech signal processing
(MFCC, PLP, spectrograms, denoising, beamforming, echo cancellation).
Hands-on with
Deep Learning frameworks
(PyTorch, TensorFlow, SpeechBrain).Experience with
streaming ASR, diarization, VAD, and real-time inference pipelines
.
Cloud deployment experience:
Azure AI, GCP Vertex AI, AWS (bonus)
.
Proficiency in
Python (FastAPI/Flask/Django)
for backend service integration.

More Jobs at FriskaAi

Backend Developer (C#, ASP.NET Core, SQL Server, Microservices)

Kerala, India

3.0 - 3.0 yrs

Salary: Not disclosed

Director of Business Development

India

5.0 - 5.0 yrs

Salary: Not disclosed

Backend Developer (C#, ASP.NET Core, SQL Server, Microservices)

Kochi, Kerala, India

2.0 - 2.0 yrs

Salary: Not disclosed

Android (Kotlin) Developer

Kochi, Kerala, India

2.0 - 2.0 yrs

Salary: Not disclosed

Backend Developer (Python, Django, Azure, SQL)

Kerala, India

4.0 - 4.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

FriskaAi

RecommendedJobs for You

AI/ML Engineer – ASR, Speech Enhancement, Computer Vision & Cloud Deployment

FriskaAi

kochi, kerala, india

AI/ML Engineer – ASR, Speech Enhancement, Computer Vision & Cloud Deployment

FriskaAi

kochi, kerala, india

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

AI/ML Engineer – ASR, Speech Enhancement, Computer Vision & Cloud Deployment

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

Employment type: Full time

Location: Kochi, Kerala

FriskaAi

Job Summary

Automatic Speech Recognition (ASR), Speech Enhancement, Computer Vision (CV), and Scalable Cloud Deployment

real-time, low-latency, speech-driven and audio-visual AI applications

Key Responsibilities

ASR and Speech recognition

speaker adaptation and impaired speech recognition

accent, dialect, multilingual, and special-use cases

speaker recognition, diarization & personalization modules

real-time/streaming ASR, VAD, and low-latency decoding

Signal processing and speech enhancement

speech enhancement, noise reduction, denoising, dereverberation, and echo cancellation

MFCC, PLP, spectrogram analysis, filterbanks

far-field, multi-mic audio and beamforming-based processing

Computer vision and lip reading

computer vision models

CNNs, Vision Transformers, YOLO, and OpenCV

face detection, face tracking, face recognition, and lip-reading (visual speech recognition) systems

audio-visual speech recognition (ASR + lip reading)

real-time video inference

Model Optimization

quantization, pruning, knowledge distillation, and LoRA

low-latency, resource-efficient deployment

ONNX, TensorRT, TorchScript

accuracy, speed, and scalability

Cloud and Deployment

Azure (AKS, Cognitive Services, Functions)

GCP (Vertex AI, Cloud Run, BigQuery)

real-time speech and video processing

Docker/Kubernetes

CI/CD pipelines

autoscaling, GPU scheduling, and high-availability deployments

Backend and Engineering Practices

production systems

RESTful APIs and microservices

audio/video ingestion, buffering, chunking, and timestamp alignment

Git, versioning, unit testing, and code reviews

reliability, monitoring, and logging

Evaluation and Continuous Learning

WER, CER, SER, RTF

precision, recall, mAP, FPS

user feedback loops

ASR, Speech AI, Computer Vision, and Cloud AI services

Required skills and Qualifications

ASR frameworks

Computer Vision & lip-reading frameworks

speech signal processing

Deep Learning frameworks

streaming ASR, diarization, VAD, and real-time inference pipelines

Azure AI, GCP Vertex AI, AWS (bonus)

Python (FastAPI/Flask/Django)

More Jobs at FriskaAi