Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
2.0 - 6.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineer at Google, you will be responsible for combining software and systems engineering to develop and maintain large-scale, fault-tolerant systems. Your role will involve ensuring that Google's services, both internally critical and externally visible, meet the reliability and uptime needs of users while continuously improving performance. You will be tasked with overseeing system capacity and performance, optimizing existing systems, building infrastructure, and automating tasks to enhance efficiency. Joining the SRE team at Google will provide you with the opportunity to tackle the unique scalability challenges faced by the company. Your expertise in coding, algorithms, complexity analysis, and large-scale system design will be put to the test as you manage complex projects and contribute to the improvement of Google's infrastructure. The culture at SRE fosters intellectual curiosity, problem-solving skills, and openness. Collaboration, innovation, and risk-taking are encouraged in a supportive and blame-free environment. The diversity of backgrounds, experiences, and perspectives within the organization promotes creativity and learning. Self-direction is promoted for meaningful project work, while mentorship and support are provided for personal and professional growth. As part of the Technical Infrastructure team at Google, your role will involve supporting Corp Engineering services, establishing strong relationships with business partners, and driving technical discussions to propose innovative solutions that enhance the reliability of enterprise applications. You will apply Google SRE reliability strategies across GCP and the Google stack to minimize operational work. Collaboration with other engineering teams is essential to ensure that the infrastructure is reliable, scalable, and secure, and you will participate in the team's on-call rotation to maintain system availability and performance. Join us at Google and be a part of the team that powers the architecture behind the products and services used by millions of users worldwide. Experience the excitement of working on cutting-edge technologies, pushing the boundaries of what's possible, and contributing to a culture of continuous improvement and innovation.,
Posted 3 days ago
8.0 - 12.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineering (SRE) Technical Leader on the Network Assurance Data Platform (NADP) team at ThousandEyes, you will be responsible for ensuring the reliability, scalability, and security of cloud and big data platforms. Your role will involve representing the NADP SRE team, working in a dynamic environment, and providing technical leadership in defining and executing the team's technical roadmap. Collaborating with cross-functional teams, including software development, product management, customers, and security teams, is essential. Your contributions will directly impact the success of machine learning (ML) and AI initiatives by ensuring a robust and efficient platform infrastructure aligned with operational excellence. In this role, you will design, build, and optimize cloud and data infrastructure to ensure high availability, reliability, and scalability of big-data and ML/AI systems. Collaboration with cross-functional teams will be crucial in creating secure, scalable solutions that support ML/AI workloads and enhance operational efficiency through automation. Troubleshooting complex technical problems, conducting root cause analyses, and contributing to continuous improvement efforts are key responsibilities. You will lead the architectural vision, shape the team's technical strategy and roadmap, and act as a mentor and technical leader to foster a culture of engineering and operational excellence. Engaging with customers and stakeholders to understand use cases and feedback, translating them into actionable insights, and effectively influencing stakeholders at all levels are essential aspects of the role. Utilizing strong programming skills to integrate software and systems engineering, building core data platform capabilities and automation to meet enterprise customer needs, is a crucial requirement. Developing strategic roadmaps, processes, plans, and infrastructure to efficiently deploy new software components at an enterprise scale while enforcing engineering best practices is also part of the role. Qualifications for this position include 8-12 years of relevant experience and a bachelor's engineering degree in computer science or its equivalent. Candidates should have the ability to design and implement scalable solutions with a focus on streamlining operations. Strong hands-on experience in Cloud, preferably AWS, is required, along with Infrastructure as a Code skills, ideally with Terraform and EKS or Kubernetes. Proficiency in observability tools like Prometheus, Grafana, Thanos, CloudWatch, OpenTelemetry, and the ELK stack is necessary. Writing high-quality code in Python, Go, or equivalent programming languages is essential, as well as a good understanding of Unix/Linux systems, system libraries, file systems, and client-server protocols. Experience in building Cloud, Big data, and/or ML/AI infrastructure, architecting software and infrastructure at scale, and certifications in cloud and security domains are beneficial qualifications for this role. Cisco emphasizes diversity and encourages candidates to apply even if they do not meet every single qualification. Diverse perspectives and skills are valued, and Cisco believes that diverse teams are better equipped to solve problems, innovate, and create a positive impact.,
Posted 6 days ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
In this role, you will contribute to a critical and highly-visible function within the Esper business. You will have the opportunity to autonomously deliver the technical direction of the service and the feature roadmap. Working with extraordinary talent, you will deliver end-to-end features, improve platform quality, and act as a technical leader. If you are excited about making a significant impact on Esper and the device industry, you will find this role engaging, challenging, and full of opportunities to learn and grow. You will be responsible for end-to-end implementation and maintenance of features, fixes, and enhancements to the platform. Your contributions will directly and immediately enhance the experience of our customers. This role offers the chance to work with cutting-edge technologies and solve scalability issues associated with managing millions of devices. Each project you undertake will expand the scope of your impact on the platform. Your responsibilities will include improving the Esper Platform by planning, recommending, and executing strategic projects. Using metrics and data, you will provide insights on customer usage, bottlenecks, future requirements, security, and scalability of the platform. You will establish standards, guidelines, sample projects, and demos to influence engineering teams to write stable, secure, maintainable, and quality code. Collaboration with distributed teams will be essential to drive changes, write root cause analyses (RCAs), and coordinate resolutions for production incidents. Additionally, you will objectively assess new technologies, tools, frameworks, and design patterns for adoption into the Esper Platform. You will become the Subject Matter Expert (SME) for the Platform SRE team and be responsible for various SRE tasks including performance testing, API test automation, maintaining Kubernetes clusters, automations, and release-related tasks. The ideal candidate for this role should have at least 5 years of experience. Hands-on experience in building and managing cloud systems on one or more providers such as AWS, GCP, or Azure is required. Knowledge of Computer Science fundamentals like Data Structures, Algorithms, Operating Systems, and Networks is essential. Experience in designing, developing, and deploying at least one customer-facing project is expected. Proficiency in scripting or any modern programming languages is necessary, along with experience in developing and deploying on UNIX/Linux-based systems. Hands-on experience in performance optimization using multiple metrics, as well as familiarity with microservices and container technologies like Docker, Kubernetes, and OpenShift, is important. Understanding best security practices for implementing Infrastructure as Code (IAC), automation, and CI/CD workflows is a plus. Familiarity with tools such as Jenkins and Buildkite, as well as knowledge of performance testing and automation testing, will be advantageous.,
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
31458 Jobs | Dublin
Wipro
16542 Jobs | Bengaluru
EY
10788 Jobs | London
Accenture in India
10711 Jobs | Dublin 2
Amazon
8660 Jobs | Seattle,WA
Uplers
8559 Jobs | Ahmedabad
IBM
7988 Jobs | Armonk
Oracle
7535 Jobs | Redwood City
Muthoot FinCorp (MFL)
6170 Jobs | New Delhi
Capgemini
6091 Jobs | Paris,France