Jobs
Interviews

6 Slo Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Who We Are: Wayfair runs the largest custom e-commerce large parcel network in the United States, approximately 1.6 million square meters of logistics space. The nature of the network is inherently a highly variable ecosystem that requires flexible, reliable, and resilient systems to operate efficiently. The Wayfair Operations Center team is looking for experienced engineers skilled in cloud-native design, legacy maintenance, and SRE best practicesplus ideas for improvement! We collaborate across our Technology organization to ensure platforms and services are production-ready, contributing to both platform and software codebases. What Youll Do: As a DevOps Engineer, you will join our team to help grow our systems into best-in-class for efficiency, stability, observability, velocity, and scale in the e-commerce space, engage with the product and engineering team from Day 1 to design, build and maintain the system/software proactively. Influence the design and architecture of Wayfair system as part of our Cloud Enablement journey while maintaining our critical pieces of legacy tools; collaborate with development teams to design scalable and reliable systems, considering aspects such as fault tolerance, availability and performance. Be the bridge between software engineers and platform engineers to develop and optimize repeatable systems so each side can leverage each other. Theres a wide range of opportunities to both guide the broad conversation and dive into the nuance of our code & architecture. Help service owners build realistic SLOs, set SLAs and error budgets, and ensure production services have reliability built into their design. Even after self-healing and automation done by you provide production support and creatively solve challenging engineering problems across our stack. Participate in the team&aposs on-call rotation. Automate repetitive tasks to increase efficiency and reduce human error. Mentor new hires and other engineers by example. Opportunities to lead tech talks, paired programming sessions, and more to increase technical efficiency across the organization. What Youll Need: 3+ years experience working in DevOps or SRE role, or software development with an understanding of Cloud Infrastructure. Experience with cloud platforms GCP, AWS, Azure, and containerization technologies (e.g. Docker, Kubernetes). Experience with server-side software engineering (Python Programming, Go, Java, BASH, etc) Design experience with distributed systems, microservices architecture, and related technologies. Strong understanding of monitoring and alerting, with a focus on performance monitoring and tracing instrumentation & SLI/SLO/SLAs. Knowledge of CI/CD pipelines and version control systems (e.g., Buildkite, Github Actions, Gitlab CI/CD). Knowledge of configuration management tools (e.g. Puppet, Ansible, Chef, Terraform). Excellent communication skills across engineers, product managers, and business stakeholders alike. Passion for leading a large, cross-cutting technical initiative to delivery, cross-functional consensus building and influencing design decisions. Ample experience gathering and balancing requirements from technical and business stakeholders, and reaching consensus on prioritization. Experience mentoring engineers and leading code reviews. Proven track record decoupling monolith services is a plus. Experience in Linux OS. Show more Show less

Posted 5 days ago

Apply

8.0 - 12.0 years

10 - 14 Lacs

Bengaluru

Work from Office

Job Summary: We are looking for a Junior Site Reliability Engineer (SRE) with strong Java coding and debugging skills to help maintain the reliability, performance, and scalability of our critical systems. As a Junior SRE, you will work closely with senior engineers to monitor systems, automate processes, and enhance infrastructure reliability. This role is ideal for candidates passionate about Java, DevOps, cloud technologies, and automation in a fast-paced environment. Experience: 2-4 years Key Responsibilities: System Reliability & Performance: Monitor and maintain the availability of key services and applications. Participate in defining and improving SLIs, SLOs, and SLAs for system reliability. Identify and resolve performance bottlenecks and system inefficiencies. Incident Management & Monitoring: Assist in incident response, troubleshooting production issues, and conducting root cause analysis (RCA). ¢ Work on improving monitoring, logging, and alerting systems using tools like Prometheus, Grafana, and Elastic APM. ¢ Participate in on-call rotations and incident handling. Java Coding & Debugging: ¢ Write and debug Java-based applications to enhance system reliability. ¢ Analyze logs, troubleshoot performance issues, and optimize Java services. ¢ Gain hands-on experience with JVM monitoring, thread dumps, and heap analysis. ¢ Work closely with developers to improve the reliability of Java applications. Automation & Infrastructure: ¢ Work with infrastructure as code (IaC) using Helm, or Ansible. ¢ Optimize system configurations for scalability and reliability. ¢ Automate operational tasks to improve system efficiency. Collaboration & Learning: ¢ Work closely with software engineers and senior SREs to enhance system reliability. ¢ Continuously develop knowledge in cloud computing (AWS, Azure, GCP), Kubernetes, and DevOps practices. Skills & Qualifications: Required Skills: ¢ Strong Java programming and debugging skills (must-have). ¢ Experience with Linux systems, networking, and cloud platforms (AWS, Azure, or GCP). ¢ Familiarity with monitoring tools like Prometheus, Grafana, or New Relic. ¢ Experience troubleshooting and analyzing Java application performance. ¢ Strong problem-solving skills and ability to analyze system issues. Preferred Skills (Nice to Have): ¢ Scripting ability in Python, Bash, or Go for automation. ¢ Exposure to Kubernetes and containerization concepts. ¢ Experience with infrastructure-as-code tools like Terraform or Ansible. Why Join Us? ¢ Work with experienced SREs and gain hands-on experience with modern DevOps practices. ¢ Learn and grow in a collaborative and innovative environment. ¢ Gain exposure to cutting-edge cloud, Java, and automation technologies. ¢ Opportunity for career growth into senior SRE roles.

Posted 1 week ago

Apply

5.0 - 9.0 years

15 - 27 Lacs

Pune, Bengaluru, Mumbai (All Areas)

Hybrid

Role: ServiceNow SLO Consultant Location: Pune / Mumbai / Bangalore / Chennai Skills Required: Over 3 years of hands-on experience in ServiceNow Administration, Development, and Upgrade Support, including day-to-day 2nd and 3rd line support and implementation of core ServiceNow modules. Proven expertise in ServiceNow modules such as Incident Management, Problem Management, Change Management, and CMDB. Experience with the ServiceNow SLO (Supplier Lifecycle Operations) module is must. Strong analytical and scripting skills across Client and Server-side scripting, including: Demonstrated ability to support and manage ServiceNow platform upgrades, ensuring minimal disruption and full compatibility with customizations. Certifications: ServiceNow Certified System Administrator (must) Certified in ServiceNow One Supplier Portal/Supplier Lifecycle Operations (OSP/SLO) module (must) ServiceNow Certified Implementation Specialist (ITSM, ITOM, EM) - good to have Regards Ramkumar 9363437874

Posted 2 weeks ago

Apply

12.0 - 18.0 years

35 - 60 Lacs

Hyderabad

Hybrid

Senior Manager, Site Reliability Engineering Hyderabad Shift Timings: 1.00 PM - 10.00 PM Duties and Responsibilities: People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior titles. Responsibilities: Lead and manage a team of Site Reliability Engineers, providing mentorship, guidance, and support to ensure the team's success. Develop and implement strategies for improving system reliability, scalability, and performance. Establish and enforce SRE best practices, including monitoring, alerting, error budget tracking, and post-incident reviews. Collaborate with software engineering teams to design and implement reliable, scalable, and efficient systems. Implement and maintain monitoring and alerting systems to proactively identify and address issues before they impact customers. Implement performance engineering processes to ensure reliability of Products, Services, & Platforms. Drive automation and tooling efforts to streamline operations and improve efficiency. Continuously evaluate and improve our infrastructure, processes, and practices to ensure reliability and scalability. Provide technical leadership and guidance on complex engineering projects and initiatives. Stay up-to-date with industry trends and emerging technologies in site reliability engineering and cloud computing. Other duties as assigned. Required Work Experience: 10+ years of experience in site reliability engineering or a related field. 5+ years of experience in a leadership or management role, managing a team of engineers. 5+ years of hands on working experience with Dynatrace (administrative, deployment, etc). Strong understanding of DevSecOps principles. Strong understanding of cloud computing principles and technologies, preferably AWS, Azure, or GCP. Strong communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams. Proven track record of driving projects to successful completion in a fast-paced, dynamic environment. Experience with driving cultural change in technical excellence, quality, and efficiency. Experience managing and growing technical leaders and teams. Constructing, interpreting, and applying metrics to your work and decision making, able to use those metrics to identify correlation between drivers and results, and using that information to drive prioritization and action Preferred Work Experience: Proficiency in programming/scripting languages such as Python, Go, or Bash. Experience with infrastructure as code tools such as Terraform or CloudFormation. Deep understanding of Linux systems administration and networking principles. Experience with containerization and orchestration technologies such as Docker and Kubernetes. Experience or familiarity with IIS, HTML, Java, Jboss. Knowledge: Site Reliability Engineering Principles DevSecOps Principles Agile (SAFe) Healthcare industry ITLT ServiceNow Jira/Confluence Skills: Strong communication skills Leadership Programming languages (see above) Project Management Mentorship Continuous learning

Posted 1 month ago

Apply

5.0 - 8.0 years

25 - 37 Lacs

Thiruvananthapuram

Work from Office

What youll do? Design, develop, and operate high scale applications across the full engineering stack. Design, develop, test, deploy, maintain, and improve software. Apply modern software development practices (serverless computing, microservices architecture, CI/CD, infrastructure-as-code, etc.) Work across teams to integrate our systems with existing internal systems, Data Fabric, CSA Toolset. Participate in technology roadmap and architecture discussions to turn business requirements and vision into reality. Participate in a tight-knit, globally distributed engineering team. Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on network, or service operations and quality. Research, create, and develop software applications to extend and improve on Equifax Solutions. Manage sole project priorities, deadlines, and deliverables. Collaborate on scalability issues involving access to data and information. Actively participate in Sprint planning, Sprint Retrospectives, and other team activity What experience you need Bachelor's degree or equivalent experience 5+ years of relevant software engineering experience 5+ years experience writing, debugging, and troubleshooting code in mainstream Java, SpringBoot, TypeScript/JavaScript, HTML, CSS 5+ years experience with Cloud technology: GCP, AWS, or Azure 5+ years experience designing and developing cloud-native solutions 5+ years experience designing and developing microservices using Java, SpringBoot, GCP SDKs, GKE/Kubernetes 5+ years experience deploying and releasing software using Jenkins CI/CD pipelines, understand infrastructure-as-code concepts, Helm Charts, and Terraform constructs What could set you apart Self-starter that identifies/responds to priority shifts with minimal supervision. Experience designing and developing big data processing solutions using Dataflow/Apache Beam, Bigtable, BigQuery, PubSub, GCS, Composer/Airflow, and others UI development (e.g. HTML, JavaScript, Angular and Bootstrap) Experience with backend technologies such as JAVA/J2EE, SpringBoot, SOA and Microservices Source code control management systems (e.g. SVN/Git, Github) and build tools like Maven & Gradle. Agile environments (e.g. Scrum, XP) Relational databases (e.g. SQL Server, MySQL) Atlassian tooling (e.g. JIRA, Confluence, and Github) Developing with modern JDK (v1.7+) Automated Testing: JUnit, Selenium, LoadRunner, SoapUI Cloud Certification strongly preferred

Posted 1 month ago

Apply

5.0 - 12.0 years

2 - 7 Lacs

Hyderabad / Secunderabad, Telangana, Telangana, India

On-site

Your role and responsibilities As a Software Developer you'll participate in many aspects of the software development lifecycle, such as design, code implementation, testing, and support. You will create software that enables your clients hybrid-cloud and AI journeys Your primary responsibilities include: Experience in developing Pipelines using Java or Replicator framework. Writing Junit testcases for Java modules. Having experience in writing SQL queries Experience in handling Java production issues, error management, debugging skills. Willingness to learn and work on client specific tools and API's. Responsible to work as an individual contributor and deliver the issues within SLA's & SLO's. Good understanding of client-side scripting. Having good logical & analytical, problem-solving skills with good soft & communication skills Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise Technical expertise in Java development projects Understanding and experience in Java coding using various frameworks and design patterns. Knowledge on data pipelines. Developing data bridge pipelines using replicator framework. Writing Junit testcases for the pipelines Preferred technical and professional experience Experience in data analytics. Working knowledge on Plx framework and tools. Knowledge on workday integrations with external systems and Experience in working on Google Cloud Platform

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies