Cloud Site Reliability Engineer

3 - 5 years

25 Lacs

Posted:1 week ago| Platform:

Apply

Work Mode

On-site

Job Type

Part Time

Job Description

Cloud Site Reliability Engineers (SRE) play a crucial role in ensuring the seamless performance and availability of our services. As an SRE, you will be responsible for maintaining and enhancing the reliability, security, and efficiency of our infrastructure and applications. You will work closely with our development, cloud operations, production operations and cloud architects’ teams to implement best practices, automate processes, and address complex issues. Your expertise and dedication will contribute to our ongoing success, improving the experience of our customers and driving our company's growth. Responsibilities As a Site Reliability Engineer (SRE), you will be responsible for: Supporting the design, implementation, and maintenance of our infrastructure, ensuring high availability, performance, and security. Collaborating with cross-functional teams to define and implement SLIs and SLOs. Identifying, troubleshooting, and resolving complex system and application issues, implementing monitoring and alerting solutions to proactively detect and address potential problems. Developing automation tools and scripts for managing, monitoring, and deploying infrastructure and applications. Working closely with the development team to ensure efficient and reliable deployment of new features and services. Continuously monitoring and maintaining a baseline of metrics and KPIs for system and application performance, scalability, and reliability Creating and maintaining comprehensive documentation for infrastructure, processes, and incident postmortems. Staying up to date with emerging trends and technologies in the field of site reliability engineering and driving the adoption of relevant best practices within the team. Required Experience 3–5 years of experience working as an SRE, DevOps Engineer, or a similar role, with a proven record of accomplishment. Bachelor’s degree or equivalent in a related field. Strong knowledge of cloud computing platforms, such as AWS or Azure Proficiency in one or more programming or scripting languages, such as Python or PEARL, PowerShell, and ANSIBLE Experience with containerization and orchestration technologies, such as Docker and Kubernetes. Familiarity with infrastructure as code (IaC) tools, such as Terraform, CloudFormation, or ARM templates. Solid understanding of networking concepts, including TCP/IP, DNS, and load balancing. Experience with monitoring and alerting tools, such as Prometheus, Grafana, Datadog, New Relic, logz.io. Experience with CI/CD pipelines and tools, such as Azure DevOps, Jenkins, GitLab, or GitHub Actions. Familiarity with database systems, both SQL and NoSQL. Knowledge of security best practices for infrastructure and application development. Industry certifications, such as AWS Certified Solutions Architect or Microsoft Certified: Azure Solutions Architect Expert Strong problem-solving skills and the ability to work independently or as part of a team. Excellent written and verbal communication skills. Must be able to succeed in a dynamic collaborative team environment and have excellent interpersonal and communication skills. Tungsten Automation Corporation, Inc. is an Equal Opportunity Employer M/F/Disability/Vets While the job description describes what is anticipated as the requirements of the position, the job requirements are subject to change based upon any changing needs and requirements of the business.

Mock Interview

Practice Video Interview with JobPe AI

Start Reliability Interview Now

My Connections Kofax

Download Chrome Extension (See your connection in the Kofax )

chrome image
Download Now
Kofax
Kofax

Software Development

Irvine California

1001-5000 Employees

11 Jobs

    Key People

  • Reynolds C. Bish

    Chief Executive Officer
  • Heather D. McGowan

    Chief Technology Officer

RecommendedJobs for You

Thiruvananthapuram, Kerala, India

Thiruvananthapuram, Kerala, India