Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Jobs

Interviews

Home
>
Jobs in Hyderābād
>
Apple
>
LLM Ops Engineer

LLM Ops Engineer

Apple

5 years

0 Lacs

Hyderābād

Posted:19 hours ago| Platform: GlassDoor logo

Apply

Skills Required

technology learning software code architecture ml deployment scaling monitoring optimization python programming inference strategies design tuning server model latency efficiency experimentation integration research automate kubernetes orchestration gitops packaging versioning rollback quantization distillation devops engineering tokenization management containerization helm logging profiling aws gcp terraform

Work Mode

On-site

Job Type

Part Time

Job Description

We work on Apple scale opportunities and challenges. We are engineers at heart. We like solving technical problems. We believe a good engineer has the curiosity to dig into inner workings of technology and is always experimenting, reading and in constant learning mode. If you are a software engineer with passion to code and dig deeper into any technology, love knowing the internals, fascinated by distributed systems architecture, we want to hear from you.
Description
We are seeking a highly skilled LLM Ops and ML Ops Engineer to lead the deployment, scaling, monitoring, and optimization of large language models (LLMs) across diverse environments. This role is critical to ensuring our machine learning systems are production-ready, high-performing, and resilient. The ideal candidate will have deep expertise in Python programming / Go Programming, a comprehensive understanding of LLM internals, and hands-on experience with various inference engines and deployment strategies. The person should be capable of exhibiting deftness to balance multiple simultaneous competing priorities and deliver solutions in a timely manner. The person should be able to understand complex architectures and be comfortable working with multiple teams KEY RESPONSIBILITIES: - Design and build scalable infrastructure for fine-tuning, and deploying large language models. - Develop and optimize inference pipelines using popular frameworks and engines (e.g. TensorRT, vLLM, Triton Inference Server). - Implement observability solutions for model performance, latency, throughput, GPU/TPU utilization, and memory efficiency. - Own the end-to-end lifecycle of LLMs in production-from experimentation to continuous integration and continuous deployment (CI/CD). - Collaborate with research scientists, ML engineers, and backend teams to operationalize groundbreaking LLM architectures. - Automate and harden model deployment workflows using Python, Kubernetes, Containers and orchestration tools like Argo Workflows and GitOps. - Design reproducible model packaging, versioning, and rollback strategies for large-scale serving. - Stay current with advances in LLM inference acceleration, quantization, distillation, and model compilation techniques (e.g., GGUF, AWQ, FP8). Minimum Qualifications

5+ years of experience in LLM/ML Ops, DevOps, or infrastructure engineering with a focus on machine learning systems.
Advance level proficiency in Python/Go, with ability to write clean, performant, and maintainable production code.
Deep understanding of transformer architectures, LLM tokenization, attention mechanisms, memory management, and batching strategies.
Proven experience deploying and optimizing LLMs using multiple inference engines.
Strong background in containerization and orchestration (Kubernetes, Helm).
Familiarity with monitoring tools (e.g., Prometheus, Grafana), logging frameworks, and performance profiling.

Preferred Qualifications

Experience integrating LLMs into micro-services or edge inference platforms.
Experience with Ray distributed inference
Hands-on with quantization libraries
Contributions to open-source ML infrastructure or LLM optimization tools.
Familiarity with cloud platforms (AWS, GCP) and infrastructure-as-code (Terraform).

Submit CV

More Jobs at Apple

Global Time Away and Allowance Partner

Bengaluru, Karnataka

5 - 5 yrs

Salary: Not disclosed

Manufacturing Quality Auditor (MQA)

Chennai, Tamil Nadu

Experience: Not specified

Salary: Not disclosed

Test Automation Architect - Operations Business Process Reengineering

Bengaluru, Karnataka

15 - 15 yrs

Salary: Not disclosed

Software Engineer- AEM

Hyderabad

4 - 9 yrs

INR 40 - 45 Lacs

Fullstack Software Engineer - Global Sourcing & Supply Management

Bengaluru

7 - 12 yrs

INR 20 - 25 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Apple

Computers and Electronics Manufacturing

Cupertino California

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

LLM Ops Engineer