Posted:2 hours ago|
Platform:
Remote
Full Time
Roles and Responsibilities:
Build and optimize model serving infrastructure with a focus on inference latency and cost
optimization
Architect efficient inference pipelines that balance latency, throughput, and cost across various
acceleration options
Develop monitoring and observability solutions for ML systems
Collaborate with ML Engineers to establish best practices for optimized model deployment
Implement cost-efficient, enterprise-scale solutions
Collaborate in a cross-functional, distributed team for continuous system improvement
Work with MLEs, QA Engineers, and DevOps Engineers
Evaluate and implement new technologies and tools
Contribute to architectural decisions for distributed ML systems
Experience and Qualifications:
5+ years of experience in software engineering with Python
Experience with ML frameworks, particularly PyTorch
Experience optimizing ML models with hardware acceleration (AWS Neuron , ONNX, TensorRT)
Experience with AWS ML services and hardware-accelerated instances (Sagemaker, Inferentia,Trainium)
Proven experience building and operating AWS serverless architectures
Deep understanding of event-driven processing patterns, SQS/SNS and serverless caching solutions
Experience with containerization using Docker and orchestration tools
Strong knowledge of RESTful API design and implementation
Proficiency in writing good quality & secure code and be familiar with static code analysis tools
Excellent analytical, conceptual and communication skills in spoken and written English
Experience applying Computer Science fundamentals in algorithm design, problem solving, and complexity analysis
New Groyp Talentoj
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Nowbengaluru
25.0 - 40.0 Lacs P.A.
3.5 - 7.5 Lacs P.A.
ahmedabad
Experience: Not specified
1.0 - 3.0 Lacs P.A.
lucknow
8.0 - 12.0 Lacs P.A.
bengaluru
40.0 - 45.0 Lacs P.A.
pune, thiruvananthapuram
15.0 - 30.0 Lacs P.A.
chennai
15.0 - 25.0 Lacs P.A.
bengaluru
25.0 - 40.0 Lacs P.A.
15.0 - 27.5 Lacs P.A.
Experience: Not specified
0.5 - 0.6 Lacs P.A.