Posted:1 day ago|
Platform:
Hybrid
Full Time
- Be responsible for both on-premises and cloud-based infrastructure. - Function as an extension of the existing staff and teams.
- Possess deep expertise in:
- Troubleshooting
- Infrastructure (physical and cloud) - Automation - Enterprise-level system administration - Agile workflows - Enterprise change management
Cloud Technologies:
- AWS native load balancers - AWS EC2, ECS, EKS, Containers - Terraform
Monitoring & Observability:
- Splunk Cloud Observability - CloudWatch
DevOps & Automation:
- CI/CD - Jenkins - Automation with Python
- Deploy and maintain shared platform team assets (e.g., ECS clusters, ALBs) - Deploy and maintain unique or non-standard infrastructure assets
- Assist developer teams in standardizing deployments
- Minimize service operational costs
- Perform periodic cost analysis to identify cost-saving opportunities
- Conduct capacity analysis for production and non-production environments - Right-size domain assets for performance and availability
- Leverage automation (e.g., autoscaling)
- Collaborate on defining SLOs for service availability
- Coordinate deployment quality objectives - Develop pattern-based service monitors - Implement uniform service measurement and monitoring
- Develop performance measurement techniques
- Assist in refining and improving service efficiency over time
- Provide on-call support and drive service restoration
- Implement InfoSec-recommended patterns - Monitor anomalies using internal tools
- Establish targeted alerting and predefined NOC response procedures
- Develop and maintain monitoring and alerting systems
- Manage the incident response lifecycle (runbooks, dashboards, automation) - Automate operational tasks for efficiency
- Participate in on-call rotations - Design performance testing and capacity planning strategies
- Collaborate across teams to troubleshoot and resolve issues
- Strong problem-solving skills
Hands-on experience with:
- Cloud Platforms: AWS, Azure, GCP - IaC Tools: Terraform or CloudFormation - Programming Languages: Python, Java, C/C++, Go, JavaScript, or Ruby - Log Aggregation: Splunk, ELK, or SumoLogic - Monitoring Tools: SignalFx, Datadog, Dynatrace, AppDynamics
- Prior roles in SRE, Software Engineering, or Production Engineering - Passion for learning and improving systems
- Interest in SLIs, SLOs, resilience, scaling, system Design and performance
- Experience with large-scale distributed systems
Familiarity with configuration and automation tools: - Terraform, Puppet, Ansible
Experience with CI/CD and DevOps toolchain:
- Git, Jenkins, Docker, Nexus, Artifactory, Selenium
Knowledge of cloud security practices, including: - Intrusion detection
- Penetration testing
- Vulnerability scanning
Genpact
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now18.0 - 22.5 Lacs P.A.
hyderabad, bengaluru, delhi / ncr
25.0 - 37.5 Lacs P.A.
bengaluru
0.5 - 3.0 Lacs P.A.
pune, chennai, bengaluru
15.0 - 30.0 Lacs P.A.
hyderabad
9.5 - 13.0 Lacs P.A.
bengaluru
0.6 - 1.0 Lacs P.A.
bengaluru, mumbai (all areas)
10.0 - 15.0 Lacs P.A.
hyderabad, pune
10.0 - 15.0 Lacs P.A.
2.25 - 3.75 Lacs P.A.
chennai, bengaluru, delhi / ncr
20.0 - 35.0 Lacs P.A.