IT Consulting professional

3 - 6 years

5 - 9 Lacs

Posted:18 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Description
We are seeking an experienced and highly specialized Senior CloudOps Engineer to manage, automate, and secure our production cloud infrastructure and Machine Learning (ML)/Large Language Model (LLM) operational pipelines This role is strictly focused on the operations and infrastructure that supports our data science and engineering teams?it is not a data science or core LLM development position, Key Responsibilities And Required Expertise The successful candidate will be an expert in all the following areas, driving high availability, scalability, and security, Cloud Infrastructure & Automation Infrastructure as Code (IaC): Deep expertise in managing and provisioning infrastructure using Terraform, Containerization & Orchestration: Advanced deployment, scaling, and management of services using Docker/Kubernetes, Networking & Services: Architecting and maintaining high-performance API Layers & Microservices, AWS CloudOps: Expert proficiency in AWS operational services, including EventBridge and Step Functions, for building robust automation flows, Data Storage: Managing and optimizing critical AWS data services, including S3, DynamoDB, Redshift, and Kinesis, MLOps Tooling & Monitoring ML/LLM Tooling Support: Provide and maintain the operational infrastructure for ML/LLM systems, including Model Registry/Versioning tools like MLflow/SageMaker, Pipeline Automation (CI/CD): Designing and implementing robust CI/CD pipelines for ML/LLM deployments using tools like GitHub Actions/Jenkins, Model Operations: Building the infrastructure to support Drift Detection & Retraining capabilities, Monitoring & Alerting: Implementing comprehensive observability stacks using Prometheus/Grafana/CloudWatch, Incident Management: Leading resolution efforts for production issues, including expertise with PagerDuty and On-call responsibilities, III Security & Compliance (FinOps) Cloud Security: Establishing and enforcing strong security policies and best practices across the cloud environment (IAM, VPC, Secrets), AWS Security Services: Expert knowledge and application of specific AWS security tools like IAM, KMS, and Secrets Manager, Cost Optimization: Leading initiatives for Cost Optimization (FinOps), balancing performance and efficiency across all cloud resources,

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Rapid7 logo
Rapid7

Cybersecurity

Boston

RecommendedJobs for You