Home
Jobs

Site Reliability Engineer

2 years

3 - 9 Lacs

Posted:9 hours ago| Platform: GlassDoor logo

Apply

Work Mode

On-site

Job Type

Part Time

Job Description

TriNet is a leading provider of comprehensive human resources solutions for small to midsize businesses (SMBs). We enhance business productivity by enabling our clients to outsource their HR function to one strategic partner and allowing them to focus on operating and growing their core businesses. Our full-service HR solutions include features such as payroll processing, human capital consulting, employment law compliance and employee benefits, including health insurance, retirement plans and workers’ compensation insurance. TriNet has a nationwide presence and an experienced executive team. Our stock is publicly traded on the NYSE under the ticker symbol TNET. If you’re passionate about innovation and making an impact on the large SMB market, come join us as we power our clients’ business success with extraordinary HR. Don't meet every single requirement? Studies have shown that many potential applicants discourage themselves from applying to jobs unless they meet every single requirement. TriNet always strives to hire the most qualified candidate for a particular role, ensuring we deliver outstanding results for our small and medium-size customers. So if you're excited about this role but your past experience doesn't align perfectly with every single qualification in the job description, nobody’s perfect – and we encourage you to apply. You may just be the right candidate for this or other roles. JOB SUMMARY We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, availability, and performance of our systems and applications. Leveraging your technical expertise and knowledge of SRE practices, you will collaborate with cross-functional teams, drive automation initiatives, and implement best practices to enhance system resilience. If you are a dedicated and detail-oriented SRE professional with a passion for maintaining highly reliable systems, we encourage you to apply for this position. Essential Duties/Responsibilites System Monitoring and Incident Response: Monitor system health, proactively detect issues, and respond to incidents in a timely manner. Participate in incident response activities, including triage, troubleshooting, and resolution, ensuring minimal disruption to services. Automation and Tooling: Develop and maintain automation scripts, tools, and utilities to streamline operational tasks, reduce manual effort, and improve system efficiency. Leverage scripting languages and configuration management tools to automate routine tasks. Performance Optimization: Identify performance bottlenecks, analyze system metrics, and optimize system performance. Collaborate with Development and Operations teams to implement performance tuning measures and ensure optimal resource utilization. Infrastructure and Configuration Management: Manage infrastructure resources, including cloud platforms, servers, and network devices. Implement and maintain configuration management practices to ensure consistency and reliability across environments. Capacity Planning: Conduct capacity planning exercises to forecast resource requirements and support scalability. Analyze usage patterns, monitor system performance, and recommend infrastructure adjustments to meet demand. Incident Analysis and Post-Mortems: Perform root cause analysis for incidents and contribute to post-incident reviews. Identify areas for improvement, implement preventive measures, and update documentation and runbooks accordingly. System Documentation: Contribute to the development and maintenance of system documentation, runbooks, and standard operating procedures (SOPs). Ensure documentation is accurate, up-to-date, and accessible to the team. Collaboration and Communication: Collaborate effectively with cross-functional teams, including Development, Operations, and Support, to address system issues, implement changes, and improve system reliability. Communicate updates, findings, and recommendations to stakeholders in a clear and concise manner. Continuous Improvement: Identify opportunities for automation, process enhancements, and tooling improvements. Drive initiatives to optimize system reliability, streamline workflows, and improve operational efficiency. Security and Compliance: Collaborate with Security and Compliance teams to ensure adherence to security best practices, regulations, and standards. Participate in security assessments, vulnerability management, and risk mitigation efforts. Performs other duties as assigned Complies with all policies and standards QUALIFICATIONS Education Bachelor's Degree or equivalent experience Work Experience Typically 2+ years of relevant work experience in Site Reliability Engineering, system administration, or infrastructure management. Knowledge, Skills and Abilities Strong understanding of SRE principles, practices, and methodologies. Proficiency in scripting languages such as Python, Bash, or PowerShell. Familiarity with configuration management tools like Ansible, Puppet, or Chef. Experience with cloud platforms such as AWS, Azure, or GCP. Knowledge of containerization technologies like Docker and orchestration tools like Kubernetes is a plus. Understanding of networking concepts, load balancing, and distributed systems. Experience with monitoring and observability tools like Prometheus, Grafana, or ELK stack. Excellent problem-solving and troubleshooting skills. Strong attention to detail and the ability to work efficiently in a fast-paced environment. Effective communication and collaboration skills, with the ability to work well in a team. Work Environment: Work in a clean, pleasant, and comfortable office work setting. The work environment characteristics described here are representative of those an employee encounters while performing the essential functions of this job. Reasonable accommodations may be made to enable persons with disabilities to perform the essential functions. This position is 100% in office Please Note: TriNet reserves the right to change or modify job duties and assignments at any time. The above job description is not all encompassing. Position functions and qualifications may vary depending on business necessity. TriNet is an Equal Opportunity Employer and does not discriminate against applicants based on race, religion, color, disability, medical condition, legally protected genetic information, national origin, gender, sexual orientation, marital status, gender identity or expression, sex (including pregnancy, childbirth or related medical conditions), age, veteran status or other legally protected characteristics. Any applicant with a mental or physical disability who requires an accommodation during the application process should contact recruiting@trinet.com to request such an accommodation.

Mock Interview

Practice Video Interview with JobPe AI

Start Reliability Interview Now
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
TriNet
TriNet

53 Jobs

RecommendedJobs for You

Hyderabad, Bengaluru, Thiruvananthapuram