Posted:3 weeks ago|
Platform:
On-site
Full Time
We are looking for a strong software engineer with 3–4 years of hands-on experience to join our high-performance team.
Key requirements
* Experience with high-throughput inference engines like vLLM
* Solid understanding of transformer-based AI models (attention, KV cache, paged attention, quantization, etc.)
* Proven ability to design and implement efficient algorithms & data structures
* Good Python (async, multiprocessing, performance profiling) and C++ skills
* Experience with production inference optimization (batching, prefix caching, continuous batching) is highly preferred
Job Type: Full-time
Pay: ₹372,698.48 - ₹1,587,520.05 per year
Work Location: In person
Expedera India Pvt Ltd
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now6.0 - 10.0 Lacs P.A.
bengaluru
5.0 - 7.0 Lacs P.A.
bengaluru
5.0 - 10.0 Lacs P.A.
6.0 - 10.0 Lacs P.A.
6.0 - 10.0 Lacs P.A.
mumbai
17.0 - 18.0 Lacs P.A.
bengaluru
10.0 - 14.0 Lacs P.A.
bengaluru
4.0 - 8.0 Lacs P.A.
chennai, bengaluru
4.0 - 8.0 Lacs P.A.
bengaluru
15.0 - 27.5 Lacs P.A.