Site Reliability Engineer

2 - 4 years

4 - 6 Lacs

Posted:-1 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

We re transforming the software industry. We re Flexera. With more than 50,000 customers across the world, we re achieving that goal . But we know we can t do any of that without our team . Ready to help us re-imagine the industry during a time of substantial growth and ambitious plansCome and see why we re consistently recognized by Gartner, Forrester and IDC as a category leader in the marketplace. Learn more at flexera.com

Key Responsibilities
  • Design, implement, and maintain reliable, scalable, and secure cloud infrastructure on AWS .

  • Develop and manage Infrastructure as Code (IaC) using Terraform , ensuring consistent and repeatable infrastructure deployments.

  • Collaborate with development teams to design and implement and maintain CI/CD pipelines , ensuring smooth and automated deployments.

  • Maintain and optimize networking configurations (VPCs, subnets, routing, load balancers, DNS, security groups).

  • Support and participate in on-call rotations , ensuring high availability of critical systems.

  • Design, implement, and maintain robust observability solutions covering metrics, logs, and traces, with a strong understanding of observability platforms such as Datadog, Prometheus, Grafana, Coralogix, or New Relic, to enable proactive monitoring, alerting, and deep system visibility.

  • Troubleshoot complex production issues across systems and applications, driving root cause analysis (RCA) and long-term resolutions.

  • Collaborate with development teams to design and implement CI/CD pipelines , ensuring smooth and automated deployments.

  • Continuously evaluate and integrate new tools and technologies to improve system performance and operational efficiency.

Required Skills & Experience
  • 2-4 years of hands-on experience as an SRE , DevOps Engineer , or Cloud Infrastructure Engineer .

  • Strong experience with AWS (EC2, S3, RDS, ECS/EKS, CloudWatch, IAM, Route 53, ALB/NLB, etc.).

  • Proficiency in Terraform for infrastructure provisioning and management.

  • Solid understanding of networking fundamentals (TCP/IP, DNS, VPN, load balancing, firewalls, routing).

  • Experience with CI/CD tools (GitHub Actions, AWS CodePipeline etc).

  • Familiarity with containerization and orchestration (Docker, Kubernetes, or ECS).

  • Proficiency in Linux system administration , shell scripting, and automation (Bash, Python, or Go).

  • Hands-on experience with monitoring and observability platforms (e.g., Datadog, Prometheus, Grafana, Coralogix, New Relic, ELK, or OpenTelemetry).

  • Strong analytical, troubleshooting, and problem-solving skills.

  • Good understanding of security best practices and compliance in cloud environments.

Good to Have
  • Experience with Infrastructure automation frameworks (Ansible, Packer).

  • Expertise in Kubernetes for container orchestration, deployment, and scaling.

  • Exposure to multi-cloud or hybrid cloud setups.

  • Experience in incident response and postmortem analysis .

  • Familiarity with cost optimization and performance tuning in AWS environments.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Flexera Software logo
Flexera Software

Software Asset Management

Schaumburg

RecommendedJobs for You