Site Reliability Engineer

5 years

0 Lacs

Posted:19 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Site Reliability Engineer (SRE)

At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between development and operations, contributing to the design, scalability, and performance optimization of our infrastructure.

Requirements

Key Responsibilities:

  • Ensure the reliability, availability, and performance of production systems
  • Develop and maintain monitoring, alerting, and incident response systems
  • Automate routine tasks and improve system performance using scripting and programming
  • Create and promote best practices for operational efficiency and reliability
  • Collaborate closely with development teams to enhance system designs for reliability and monitoring
  • Perform root cause analysis to resolve production incidents effectively
  • Contribute to capacity planning and performance analysis activities
  • Document processes, architectures, and troubleshooting steps for team knowledge-sharing

Required Skills and Qualifications:

  • 5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role
  • Strong experience with cloud service providers (AWS, Azure, Google Cloud)
  • Proficiency in programming/scripting languages such as Python, Go, or Ruby
  • Experience with container orchestration platforms (e.g., Kubernetes, Docker Swarm)
  • Familiarity with configuration management tools (e.g., Ansible, Puppet, Chef)
  • Experience with monitoring tools like Prometheus, Grafana, Datadog, or similar
  • Strong problem-solving skills with a focus on automation and reliability
  • Excellent communication skills and the ability to work collaboratively in a team environment

Preferred Skills:

  • Knowledge of microservices architecture and RESTful APIs
  • Familiarity with Agile methodologies and CI/CD practices
  • Experience in disaster recovery planning and execution
  • Certification in cloud technologies or site reliability practices

Education:

  • Bachelor's degree in Computer Science, Information Technology, or equivalent experience

Benefits

Talworx is an emerging recruitment consulting and services firm, we are hiring for our Product based health care client which is a leading precision medicine company focused on guarding wellness and giving every person more time free from cancer. Founded in 2012, we're transforming patient care by providing critical insights into what drives disease through its advanced blood and tissue tests, real-world data and AI analytics.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You