Remote
Full Time
We are looking for an ML Engineer with hands-on experience in deploying large language models (LLMs) on GPU infrastructure. This role combines ML engineering with DevOps, focusing on scalable deployments, API integration, and optimization of LLM performance.
Deploy and optimize LLMs on GPU-based infrastructure.
Build and manage APIs for model serving (Python-based).
Implement CI/CD, monitoring, and scaling for ML models.
Collaborate on prompt engineering and model optimization.
Manage containerized workloads (Docker/Kubernetes).
4–5 years of ML/DevOps engineering experience.
Strong in Python, APIs, and LLM architecture.
Experience with GPU deployments and cloud platforms (AWS/GCP/Azure).
Familiarity with prompt engineering and inference optimization.
IntraEdge
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Nowhyderabad, chennai, bengaluru
6.0 - 8.0 Lacs P.A.
bengaluru, delhi / ncr, mumbai (all areas)
15.0 - 30.0 Lacs P.A.
Salary: Not disclosed
Salary: Not disclosed
Salary: Not disclosed
chennai, tamil nadu, india
Salary: Not disclosed
bengaluru
5.0 - 8.0 Lacs P.A.
delhi, delhi, india
Salary: Not disclosed
bengaluru, karnataka, india
Salary: Not disclosed
20.0 - 35.0 Lacs P.A.