AI Systems Engineer

0 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

About US

Shunya Labs is building the Voice AI Infrastructure Layer for Enterprises powering speech intelligence, conversational agents, and domain-specific voice applications across industries. Born from deep work in mental-health AI and built for global enterprise scale, our stack combines state-of-the-art ASR/TTS models with an open-weights philosophy , driving accuracy, privacy, and scalability.


About the Role

AI Systems Engineer

You will evaluate, host, and optimize a wide range of AI models—spanning ASR, LLMs, and multimodal systems and build the orchestration layer that powers scalable, low-latency deployments.

ambiguity

You’ll work across the full stack—from GPU inference tuning to React-based control dashboards building a resilient and scalable AI delivery platform.


Key Responsibilities -


AI Model Evaluation & Optimization

·      Evaluate, benchmark, and optimize AI models (speech, text, vision, multimodal) for latency, throughput, and accuracy.

ONNX Runtime

latest AI runtimes

·      Develop efficient caching and model loading strategies for multi-tenant serving.


AI Infrastructure & Orchestration

central orchestration layer

scalable, fault-tolerant deployments

Kubernetes autoscaling

·      Implement observability and monitoring (Prometheus, Grafana, CloudWatch) across the model-serving ecosystem.


DevOps, CI/CD & Automation

CI/CD pipelines

Dockerized environments

infrastructure-as-code


Frontend & Developer Tools

React/Next.js

·      Build intuitive internal tools for model comparison, experiment management, and deployment control.

Cursor


Client Interaction & Solutioning

functional and performance requirements

deployable AI systems

·      Prototype quickly, iterate with feedback, and deliver robust production systems.


Research & Continuous Innovation

latest AI research and model releases

·      Evaluate emerging frameworks for model serving, fine-tuning, and retrieval (LangChain, LlamaIndex, GraphRAG, etc.).

·      Proactively identify and implement performance or cost improvements in the model serving stack.

·      Share learnings and contribute to the internal AI knowledge base.


Ambiguous Problem Solving

undefined problem spaces

·      Break down high-level goals into actionable technical strategies.

·      Balance trade-offs between accuracy, latency, and cost while innovating under uncertainty.


Required Skills

Python

Docker

inference optimization

real-time inference pipelines

CI/CD pipelines

React/Next.js

API design


Nice to Have

LangChain

speech processing models

serverless inference

data pipelines


Soft Skills

·      Excellent problem-solving in ambiguous, evolving environments.

·      Strong ability to research, self-learn, and prototype emerging AI technologies.

·      Confident communicator who can translate technical findings to business impact.

·      Ownership mindset with a collaborative, solution-oriented approach.


Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

hyderabad, telangana, india

Hyderabad, Telangana, India

Hyderabad, Telangana, India