Senior Site Reliability Engineer

4 years

0 Lacs

Dehradun, Uttarakhand, India

Posted:4 months ago| Platform: Linkedin logo

Apply

Skills Required

reliability compliance git flow communication aws api powershell networking docker kubernetes linux software development automation monitoring security drive engineering design automate resolve management support php node.js troubleshooting tuning optimization regulations scalability code terraform analyze metrics database forecast documentation analysis efficiency scaling devops scripting programming python strategy containerization orchestration learning certifications engagement portal

Work Mode

Remote

Job Type

Full Time

Job Description

Experience: 4.00 + years Salary: Confidential (based on experience) Expected Notice Period: 30 Days Shift: (GMT+11:00) Australia/Melbourne (AEDT) Opportunity Type: Remote Placement Type: Full Time Indefinite Contract(40 hrs a week/160 hrs a month) (*Note: This is a requirement for one of Uplers' client - Compare Club) What do you need for this opportunity? Must have skills required: CI/CD, Compliance, git flow startegy, Good communication skills, Apigee, Aws api, AWS Powershell/AWS CLI, Networking, New Relic, node.js/PHP, serverless framework application, Site Reliability, Docker, Kubernetes, Linux SysAdmin Compare Club is Looking for: Senior Site Reliability Engineer The Senior Site Reliability Engineer (SRE) plays a critical role in bridging the gap between software development and operations, ensuring systems are scalable, reliable, and efficient. This role focuses on reducing manual work through automation, improving system resilience, and delivering seamless services to our customers. Monitoring, security, and compliance are key aspects of this role, as you will proactively address potential risks while ensuring the systems meet the required operational and regulatory standards. As an offshore SRE, you will collaborate across global teams to optimize operations, implement effective monitoring solutions, and drive innovation in reliability engineering. The role will also include supporting cloud environments, primarily on AWS, and managing Apigee API Gateway on Google Cloud Offshore Responsibilities: Ensure Reliability and Performance: Maintain high availability and performance of production systems and applications to meet SLA commitments.Proactive Monitoring: Develop, implement, and enhance monitoring solutions, dashboards, and alerts to detect and address issues before they impact users.Automate Operational Tasks: Design and maintain tools and scripts to automate repetitive tasks, deployments, and incident response processes.Improve System Resilience: Collaborate with cross-functional teams to identify and resolve potential bottlenecks and single points of failure.Cloud and API Management: Support and optimize cloud infrastructure on AWS, including EC2, S3, RDS, Lambda, and networking. Manage and enhance Apigee API Gateway for seamless API performance and integration.Support PHP, Node.js, Serverless Framework Applications: Monitor, troubleshoot, and optimize PHP, Node.js, ServerlessFramework-based applications for reliability and scalability.Linux Systems Administration: Provide high-level Linux support, including advanced troubleshooting, performance tuning, and system optimization for reliability and scalability.Security Implementation: Collaborate with security teams to ensure applications and systems are configured to meet security and compliance requirements.Compliance Adherence: Monitor, document, and enforce compliance with industry standards, regulations, and company policies, such as ISO 27001, SOC 2, or GDPR.Promote Infrastructure Scalability: Use Infrastructure as Code (IaC) tools like Terraform or CloudFormation to manage, scale, and improve infrastructure.Collaborate for Continuous Improvement: Partner with development, operations, and security teams to embed SRE best practices and enhance operational efficiency.Optimize Performance: Analyze system performance metrics to implement tuning measures for improved application and database efficiency.Plan for Capacity Growth: Monitor infrastructure trends and forecast requirements to ensure systems scale with business growth.Document Processes: Maintain clear, up-to-date documentation for systems, tools, and procedures to facilitate knowledge sharing and team alignment.System Uptime: Achieve and maintain 99.9% uptime for critical applications and infrastructure. Incident Resolution: Ensure incidents are resolved within defined SLAs, with clear root cause analysis and follow-up actions.Automation Coverage: Increase automation of operational tasks by at least 30% annually, reducing manual intervention.Monitoring Efficiency: Implement proactive monitoring tools with minimal false positives, ensuring issues are flagged before customer impact.Application Support: Provide consistent, high-quality support for PHP and Node.js applications, reducing downtime and performance issues.Linux Optimization: Ensure Linux systems operate at peak efficiency, with documented performance tuning and troubleshooting standards.Security Compliance: Ensure all systems and processes adhere to defined security standards and compliance requirements. Key Result Areas: Infrastructure Scalability: Support seamless scaling of infrastructure to meet 100% of projected growth requirements without major disruptions.Collaboration Impact: Actively contribute to cross-team initiatives, enhancing overall reliability and operational efficiency.Documentation Quality: Maintain 100% up-to-date and accurate system documentation to support operational excellence and knowledge sharing. Skills & Qualifications: Education: Bachelor''s degree in Computer Science, IT, or a related field (or equivalent experience). Experience: 3+ years in an SRE, DevOps, or equivalent role.Hands-on experience with AWS services (e.g., EC2, S3, RDS, Lambda, CloudWatch).Experience managing and optimizing APIs using Apigee and AWS API Gateway.Experience supporting PHP, Node.js and ServerlessFramework-based applications in production environments. Technical Skills: High proficiency in Linux systems administration, including troubleshooting, performance tuning, and system optimization.Proficiency in scripting/programming (e.g., Python, Bash).Strong expertise in monitoring tools (e.g., Cloudwatch, New Relic, Prometheus, Grafana).Knowledge of security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR).Experience with CI/CD pipelines and tools and gitflow strategy (e.g., AWS Codebuild, AWS Codepipeline, GitLab).Understanding of containerization and orchestration (e.g., Docker, Kubernetes).Familiarity with networking, security, and database management. Good to have: Continuous learning mindset to stay updated on emerging technologies and trendsStrong communicationEnjoys problem solving skills, task automation and analysisAbility to work across time zonesAnalytical mindset with a focus on measuring and optimising operational processesRelevant certifications in cloud computing, devops or related fields (e.g., AWS Certified DevOps Engineer, AWS Certified SolutionsArchitect, Certified Kubernetes Administrator) Engagement: Indefinite contract with Compare Club Interview Process: 2 rounds How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal.Step 2: Complete the Screening Form & Upload updated ResumeStep 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!

Mock Interview

Practice Video Interview with JobPe AI

Start Reliability Interview Now
Uplers
Uplers

Digital Services

Ahmedabad

200+ Employees

4724 Jobs

    Key People

  • Karan Singh

    Co-founder & CEO
  • Nitesh Gohil

    Co-founder

RecommendedJobs for You

Hyderabad / Secunderabad, Telangana, Telangana, India