Posted:1 week ago|
Platform:
Hybrid
Full Time
Role Brief: We are seeking a skilled and experienced MLOps Engineer to join our team and drive the operationalization of machine learning models and pipelines at scale. The ideal candidate will be responsible for automating, deploying, monitoring, and maintaining AI/ML solutions. Turning prototypes into robust, customer- ready solutions while mitigating risks like production pipeline failures, will be primary. This role requires expertise in infrastructure management, CI/CD pipelines, cloud services, model orchestration and collaboration with cross- functional teams to ensure seamless deployment into diverse customer environments. Primary Responsibilities: Strategizing and implementing scalable infrastructure for ML or LLM model pipelines using tools like Kubernetes, Docker, and cloud services such as AWS (e.g., AWS Batch, Fargate, Bedrock) Manage auto-scaling mechanisms to handle varying workloads and ensure high availability of RestAPIs Automate CI/CD pipelines and Lambda functions for model testing, deployment, and updates, reducing manual errors and improving efficiency. Amazon SageMaker Pipelines for end-to-end ML workflow automation. Optimize utilizing step-functions Set up reproducible workflows for data preparation, model training, and deployment. Provision and optimize cloud resources (e.g., GPUs, memory) to meet computational demands of large models like those used in RAG systems Use Infrastructure-as-Code (IaC) tools like Terraform to standardize provisioning C deployments Automate retraining workflows to keep models updated as data evolves Work closely with data scientists, ML engineers, and DevOps teams to integrate models into production environments. Implement monitoring tools to track model performance and detect issues like drift or degradation in real- time. Monitoring dashboards with real-time alerts for pipeline failures or performance issues C Implementing Model Observability frameworks. Required Skills: Education Any Engineering (BE/Btech/ME/Mtech) Min 4 years of experience with AWS services such as Lambda, Bedrock, Batch with Fargate, RDS (PostgreSQL), DynamoDB, SQS, CloudWatch, API Gateway, SageMaker Expertise in containerization (Docker C Kubernetes) for consistent deployments C orchestration tools like Airflow, ArgoCD, Kubeflow etc. Experience with CI/CD tools (e.g., Jenkins, GitLab CI/CD) and IaC tools like Terraform Knowledge of ML frameworks (e.g., PyTorch, TensorFlow) to understand model requirements during deployment Experience with RestAPI Frameworks like FastAPIs, Flask Familiarity with model observability like Evidently, NannyML, Phoenix and monitoring tools (Grafana etc)
Aurigo Software Technologies
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Software / Information Technology
51-200 Employees
22 Jobs
Key People
Greater Hyderabad Area
Experience: Not specified
Salary: Not disclosed
Bangalore Rural, Bengaluru
15.0 - 25.0 Lacs P.A.
Bengaluru / Bangalore, Karnataka, India
5.0 - 7.0 Lacs P.A.
Chennai, Tamil Nadu, India
3.0 - 8.0 Lacs P.A.
Bengaluru
18.0 - 33.0 Lacs P.A.
Bengaluru
5.0 - 9.0 Lacs P.A.
Noida, Uttar Pradesh, India
Salary: Not disclosed
15.0 - 27.5 Lacs P.A.
15.0 - 20.0 Lacs P.A.
15.0 - 20.0 Lacs P.A.