8 - 10 years

0 Lacs

Posted:4 days ago| Platform: Indeed logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Bengaluru, Karnataka, India
Job Type
Full Time
About the Role
Design and Architect SRE element into all the existing and new apps and services along with defining several controls/processes that ensures SLAs/KPIs are met.
Define SLAs/SLIs/SLOs metrics at a technical level and ensure 100% adherence. Proactively maintain services once they are live by measuring and monitoring availability, latency and overall system health. Respond quickly to issues and mobilise responsible individuals quickly to achieve the fasted possible resolution. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews Scale system and service sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and speed of service resolution. Continually analyse service to end customers with a view to enhancing customer experience, eradicating issues, fixing root causes and driving quality into everything we do. Educating support operations and customer help desks to adapt to new ways of working by increasing skills and knowledge. Perform RCAs, publish reports and take it to the next level by inventing short/long term fixes and further Runbooks. Be part of the Agile Mode of delivering Work Products by performing Backlog planning, Sprint Planning, Design Reviews, Peer Reviews and Retrospectives Mandatory Skills Site Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design review Experience 8 to 10 Years Education Qualificaiton Bachelor DegreeRoles & Responsibilities Experience in one or more of the following: C, C++, Java, Python, Go, Ruby or shell scripting Experience with Windows and Unix/Linux operating systems internals and administration (e.g. filesystems, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware) Experience with containers and containers orchestration (e.g. Kubernetes, Docker) Extensive knowledge of AWS Hands-on experience with IAC tools such as Cloudformation and Terraform Experience with Configuration Management tools such as Ansible, Chef. Experience with cloud hosted application-monitoring tools such as Kibana, ELK stack etc Experience with Observability tools such as Dynatrace or Datadog Excellent communication skills with the ability to present complex technical information in a clear and concise manner to a variety of audiences, both technical and non-technical Comfortable working in a fast-paced, multi-tasking, dynamic environment Experience with deployment automation, working with platforms for configuration management, provisioning and artifact repositories. Preferred to have expertise with Make, Maven, Groovy, Gitlab, Gitlab pipelines, ArgoCD, AWS Codebuild/Codepipeline/CodeDeploy Experience in improving internal processes and good understanding of security engineering Capable of grasping, modifying and maintaining systems and code developed by others. Ability to debug and optimise code and automate routine tasks Systematic problem-solving approach, coupled with a strong sense of ownership, drive and determination. Ability to think outside the box and find innovative solutions to complex problems.
Requirements
Design and Architect SRE element into all the existing and new apps and services along with defining several controls/processes that ensures SLAs/KPIs are met.
Define SLAs/SLIs/SLOs metrics at a technical level and ensure 100% adherence.
Proactively maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Respond quickly to issues and mobilise responsible individuals quickly to achieve the fasted possible resolution.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
Scale system and service sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and speed of service resolution.
Continually analyse service to end customers with a view to enhancing customer experience, eradicating issues, fixing root causes and driving quality into everything we do.
Educating support operations and customer help desks to adapt to new ways of working by increasing skills and knowledge.
Perform RCAs, publish reports and take it to the next level by inventing short/long term fixes and further Runbooks.
Be part of the Agile Mode of delivering Work Products by performing Backlog planning, Sprint Planning, Design Reviews, Peer Reviews and Retrospectives

Mandatory Skills
Site Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design review

Experience
8 to 10 Years

Education Qualificaiton
Bachelor DegreeRoles & Responsibilities
Experience in one or more of the following: C, C++, Java, Python, Go, Ruby or shell scripting
Experience with Windows and Unix/Linux operating systems internals and administration (e.g. filesystems, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware)
Experience with containers and containers orchestration (e.g. Kubernetes, Docker) Extensive knowledge of AWS
Hands-on experience with IAC tools such as Cloudformation and Terraform
Experience with Configuration Management tools such as Ansible, Chef.
Experience with cloud hosted application-monitoring tools such as Kibana, ELK stack etc
Experience with Observability tools such as Dynatrace or Datadog
Excellent communication skills with the ability to present complex technical information in a clear and concise manner to a variety of audiences, both technical and non-technical
Comfortable working in a fast-paced, multi-tasking, dynamic environment
Experience with deployment automation, working with platforms for configuration management, provisioning and artifact repositories.
Preferred to have expertise with Make, Maven, Groovy, Gitlab, Gitlab pipelines, ArgoCD, AWS Codebuild/Codepipeline/CodeDeploy
Experience in improving internal processes and good understanding of security engineering
Capable of grasping, modifying and maintaining systems and code developed by others.
Ability to debug and optimise code and automate routine tasks
Systematic problem-solving approach, coupled with a strong sense of ownership, drive and determination.
Ability to think outside the box and find innovative solutions to complex problems.
About the Company
Cigres Technologies Private Limited is a technology consulting and services company that focuses on helping clients resolve their significant digital problems and enabling radical digital transformation using multiple technologies on premise or in the cloud. The company was founded with the goal of leveraging cutting-edge technology to deliver innovative solutions to clients across various industries.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now

RecommendedJobs for You