Jobs
Interviews

16 Sre Principles Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

The SRE team at Freshworks comprises expert Software and System engineers who are responsible for ensuring the Availability, Scalability, and Performance of the SaaS products. They design tools and frameworks for monitoring, load testing, and occasionally develop complete platform features used by other products. The team conducts architecture reviews and assists individual product teams in identifying performance bottlenecks. The approach taken by the team is bottom-up, focusing on viewing the application from a system perspective. Engineers within the SRE team have the autonomy to select the challenges they wish to tackle and take ownership of tasks until completion. Their responsibilities include designing, coding, and delivering software to enhance the availability, latency, and efficiency of Freshworks Products & Platforms. They also manage the availability, latency, and performance of critical services, implementing automation to prevent recurring issues. Furthermore, the team independently devises and implements architectural strategies and infrastructure solutions. They are tasked with defining strategies, vision, and roadmaps for developing CI/CD, Application hosting, Security, and Compliance standards across Freshworks. The team also conducts blameless postmortems for large-scale incidents, drives automation and orchestration strategies, and formulates cost optimization plans for the Freshworks Cloud environment. Qualifications: - 4-7 years of experience in Site Reliability Engineering, DevOps, or Software Engineering roles - Proficient in programming/scripting languages like Python or Go - Hands-on experience with cloud platforms such as AWS, GCP, or Azure - Deep understanding of SRE principles, including SLIs/SLOs, reliability metrics, and incident response - Familiarity with monitoring and observability tools like Prometheus, Grafana, Datadog, ELK, OpenTelemetry - Solid experience with infrastructure automation tools such as Terraform, Ansible, or Pulumi - Strong knowledge of Linux systems, networking, and containerization (Docker, Kubernetes) - Experience with CI/CD pipelines and version control systems like GitHub Actions, Jenkins, GitLab CI/CD - Strong analytical and problem-solving skills with a proactive and ownership-driven mindset Freshworks offers a dynamic work environment where you can leverage your expertise in Site Reliability Engineering to make a real impact. Join us in building a fresh vision of how the world works.,

Posted 21 hours ago

Apply

3.0 - 8.0 years

0 Lacs

pune, maharashtra

On-site

As a System Reliability Engineer at Roche, you will play a crucial role in leading the design, execution, and continuous evolution of the Monitoring, Observability, Automation, and Job Management strategy. Your expertise in ERP Operations Control Center (OCC) and enterprise observability architecture will be utilized to ensure end-to-end visibility across SAP ERP, middleware, and business-critical applications. Your responsibilities will include designing and governing a comprehensive monitoring architecture, executing the Automation & Observability roadmap, and standardizing monitoring patterns using tools like SAP Focused Run and SAP Cloud ALM. You will define and manage SLIs, SLOs, error budgets, and establish a reliability engineering culture across SAP operations. Additionally, you will integrate AI-driven monitoring and anomaly detection for faster incident detection and resolution. In this strategic role, you will collaborate with technical teams and business stakeholders to enhance observability capabilities, conduct root cause analysis, and introduce operational best practices for proactive incident prevention. You will also define and operationalize business KPIs with dashboards tied to user experience and transaction health. To be successful in this role, you should have 8+ years of experience in SAP system architecture, monitoring automation design, or SRE roles, along with 3+ years of experience with SAP OCC technologies. Proficiency in SAP S/4HANA, BTP, middleware, and enterprise-wide observability tools is essential. Strong stakeholder management, communication, and collaboration skills are required, along with a passion for reliability, automation, and measurable improvement. At Roche, we are dedicated to advancing science and ensuring everyone has access to healthcare. With over 100,000 employees worldwide, we work together to deliver life-changing healthcare solutions that make a global impact. Join us in building a healthier future, where every voice matters. Roche is an Equal Opportunity Employer.,

Posted 3 days ago

Apply

8.0 - 12.0 years

0 Lacs

pune, maharashtra

On-site

As a Lead Full Stack Engineer at Barclays, you will play a pivotal role in driving the evolution of the API First digital strategy, facilitating innovation and operational excellence. Your main responsibility will be to utilize cutting-edge technology to develop and manage robust, scalable, and secure APIs that ensure the seamless delivery of digital solutions. To excel in this position, you must possess advanced proficiency in full-stack development, with hands-on experience in Core Java, JPA/Hibernate, Spring framework, and enterprise caching solutions. Additionally, you should have a strong grasp of Spring ecosystem technologies such as Spring Boot, Spring-JMS, and Spring-Data. Your demonstrated ability to build secure, fault-tolerant, and scalable enterprise applications across the entire technology stack will be crucial. Moreover, you should be well-versed in RESTful API development and consumption, with practical implementation of OpenAPI/Swagger specifications. Hands-on experience in implementing API security protocols and authentication mechanisms like OAuth2, JWT, and mTLS is essential. Your expertise in both relational (RDBMS) and NoSQL database technologies, along with practical coding experience in event-driven architecture patterns and microservices, will be highly beneficial. In this role, your strong technical leadership capabilities will be put to use as you mentor junior developers and conduct effective code reviews. You will also enforce code quality using tools like SonarQube and Veracode to ensure high standards are maintained. Proficiency with developer tools such as Jenkins, Maven, Gradle, Nexus, Git, and CI/CD pipelines is expected, along with hands-on experience with enterprise message broker platforms like Kafka. Additionally, you should have a solid understanding of Agile, DevOps, Site Reliability Engineering (SRE) principles, and CI/CD practices. Familiarity with cloud platforms such as OpenShift Enterprise, AWS, and cloud-native application development is advantageous. Knowledge of testing methodologies like TDD, BDD, and automated testing frameworks is also required, along with effective communication skills and the ability to collaborate within a team. Some other valuable skills for this role include experience with modern frontend technologies like HTML5, CSS3, JavaScript, and frameworks such as Angular or React. Advanced SQL optimization skills, performance tuning experience with Oracle or similar RDBMS, and the ability to implement API client SDKs and developer-focused documentation are appreciated. Troubleshooting capabilities for diagnosing and resolving complex performance issues at the code level, experience with containerization technologies like Docker and Kubernetes, and knowledge of ITIL-based release and change management processes are also beneficial. In summary, as a Lead Full Stack Engineer at Barclays, you will be at the forefront of shaping the digital strategy through innovative API development, secure coding practices, and collaboration with cross-functional teams to deliver high-quality software solutions that meet business objectives and customer needs.,

Posted 1 week ago

Apply

17.0 - 21.0 years

0 Lacs

hyderabad, telangana

On-site

As a part of the Technology team at Arcesium, you will play a crucial role in managing a group of highly skilled senior engineers. Your responsibilities will include providing technical management, guidance, coaching, best practices, and principles to the team. You will actively engage with team members, manage their performance, and plan their career development. Resource planning, execution, and ensuring the quality of software delivered by the group will be under your purview. Your leadership will be instrumental in building sophisticated products using cutting-edge technology stack, which will be utilized by leading investment management firms globally. Collaborating closely with various stakeholders such as the Product Management Team and other engineering teams, you will drive the execution of multiple business strategies and technologies. Ensuring operational efficiency and actively participating in organizational initiatives to maintain the highest levels of service offerings to customers will be a key aspect of your role. To excel in this position, you are required to hold a bachelor's degree in Computer Science with over 17 years of experience. Deep understanding in programming languages such as Java (or other JVM languages) and Python is essential. Experience with relational or non-relational Database technologies, exposure to cloud service providers like AWS, Azure, or GCP, and knowledge of delivering products with low/no touch support along with SRE principles are also important. Your ability to lead a team of highly skilled engineers, oversee multiple projects and engagements concurrently, and exhibit exceptional verbal and written communication skills will be critical. Any experience in FinTech will be considered a bonus, further enhancing your suitability for this role.,

Posted 1 week ago

Apply

7.0 - 11.0 years

0 Lacs

karnataka

On-site

You are a highly skilled and experienced Java SRE (Site Reliability Engineer) sought by our team in Bangalore. You should have a strong background in Java development and production support (L2/L3), demonstrating the ability to uphold the reliability, performance, and availability of large-scale enterprise systems. In this role, your responsibilities will include providing L2/L3 support for Java-based production systems, ensuring swift resolution of critical issues. You will be expected to monitor system health and performance, promptly identifying and addressing potential issues. Collaborating with development, QA, and infrastructure teams is crucial to enhance application reliability and scalability. Your expertise should encompass strong hands-on experience with Java/J2EE technologies and deep knowledge of Java production support and debugging high-volume applications. You should possess a profound understanding of SRE principles, monitoring tools (e.g., Prometheus, Grafana), and logging systems (e.g., ELK stack, Splunk). Experience in incident management, change management, and familiarity with ITIL best practices is required. Moreover, knowledge of CI/CD pipelines, automation scripts, and proficiency in cloud platforms (preferably AWS/Azure/GCP) will be beneficial. Excellent problem-solving skills and the ability to perform effectively under pressure are essential for success in this position. This is a full-time employment opportunity with an immediate to April joiners" notice period. If you are a Java SRE professional looking to leverage your skills in a challenging environment, we encourage you to apply for this position.,

Posted 2 weeks ago

Apply

8.0 - 10.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Job Description Ford Credit Platform Engineering is looking for a Project Managers specializing in strategic delivery. You will work on highly complex, cross-team, multi-project programs and help to ensure teams are delivering high quality products and services while being responsible for the over-arching roadmap. You will help teams work on predictability and quality while supporting engineering and operational excellence. The role will have the ability to directly impact the future of program management practices at Ford Credit and effect a high degree of change across the Ford Credit Engineering teams. The candidate will make an impact by aligning business and global technology goals while managing relationships across geographically distributed teams and influencing decisions across multiple work streams and executive leadership. The primary customers for the role are engineering leaders, product managers, developers, and analytics teams within Ford Credit. The candidate will have a deep understanding of Lean-Agile program management practices (including SAFe), DevOps, and SRE principles and practices. The candidates technical depth should include understanding the principles behind why engineering teams make architectural decisions, including cloud native platforms, streaming data platforms, and the challenges faced when providing secure solutions in regulatory spaces as well as ensuring the privacy of our customers. Responsibilities In this position, you will Project Planning and Management Develop and manage project plans, schedules, and budgets Define project scope, goals, and deliverables Coordinate and collaborate with cross-functional teams to ensure successful project delivery. Monitor and report on project progress, identifying and mitigating risks as needed. Program Incremental (PI) Event Lead and facilitate Agile ceremonies such as daily scrum, sprint planning, sprint reviews, and retrospectives. Foster a culture of continuous improvement within the team. Facilitate the PI Planning by helping ensure readiness with strategic alignment, leadership and team preparedness and managing event logistics. Support PI execution by facilitating ART events and escalating impediments. Stakeholder Management Serve as the primary point of contact for project stakeholders, ensuring clear communication and alignment. Manage stakeholder expectations and ensure their needs are met. Provide on-going visibility to all stakeholders on program status including key decisions, dependencies, risks, issues, metrics, etc. Team Leadership and Development Mentor ad coach team members on Agile methodologies and best practices. Foster a collaborative and high-performing team environment. Identify and address any team dynamics issues that may impact project success. Uncover, anticipate, raise and aggressively remove obstacles which prevent program teams from delivering against expected program outcomes. Support the teams to collaboratively drive continuous improvement and create a learning organization to enable speed to market and foster innovation. Delivery Metrics and JIRA Dashboards Ensure that project deliverables meet the required quality standards. Implement and oversee testing and quality assurance processes. Qualifications The minimum requirements we seek Bachelor degree/BE in Computer Science, Information Technology or related field. Overall 10+ years and minimum of 8+ years experience in engineering, engineering program management, technical program management, product management, or related area Extensive experience using, managing, and supporting teams with Agile program management tools such as Jira, Confluence. Should have delivered atleast one Program/Project in SAFe and has good knowledge in PI planning. Our Preferred Qualifications Certification in Lean-Agile practices, such as Certified Scrum Master, RTE. Experience managing complex programs with solutions relying on cloud-native technologies. Overall 10+ years and minimum of 8+ years experience in engineering, engineering program management, technical program management, product management, or related area Extensive experience managing programs supporting Platform service-oriented or SaaS based solutions. Strong verbal and written communications skills with the ability to influence the enterprise. What Youll Receive In Return We believe that freedom of movement drives human progress. As part of this exciting program, youll enjoy a high level of involvement with an exceptional team of industry innovators and visionaries. Youll contribute in a meaningful way to our important, breakthrough work! And youll develop the skills that will give you a significant edge in your future career pursuits. We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. Show more Show less

Posted 2 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

pune, maharashtra

On-site

Join us as a Service Operations Manager at Barclays, where you'll play a crucial role in shaping the evolution of our digital landscape. Your responsibilities will involve driving innovation and excellence by leveraging cutting-edge technology to enhance our digital offerings and deliver exceptional customer experiences. As a valued member of our team, you will be tasked with delivering a robust technology stack, utilizing your strong analytical and problem-solving skills to interpret business requirements and provide high-quality solutions. Collaboration with fellow engineers, business analysts, and stakeholders will be a key aspect of your role, as you work on addressing complex technical challenges that require detailed analytical skills and in-depth analysis. To excel in the role of Service Operations Manager, you should possess experience in the following areas: - Strong understanding and application of SRE principles with a focus on robust governance of Incident, Problem & Change Management. - Proficiency in Digital Technology principles. - Expertise in Change & Transformation methodologies. Additionally, highly valued skills for this role include: - Proficiency in Java, Jenkins, APIs, AWS, CI/CD Pipelines, SDLC, etc. - Effective Stakeholder Management capabilities. - Knowledge of Risk & Control Standards. Your performance may be evaluated based on critical skills essential for success in this role, such as risk management, change and transformation proficiency, business acumen, strategic thinking, and digital and technological acumen. This position is based in Pune. **Purpose of the Role:** The primary purpose of this role is to oversee the IT Services department, set strategic directions, provide support to senior management, manage IT service risks, and ensure the effective operation of IT services to support the banks operations. You will be responsible for representing Technology service performance to senior stakeholders and managing IT service risks across the organization. **Accountabilities:** - Develop and implement strategic directions for IT Services, incorporating the latest methodologies and processes. - Manage the IT Services department, including overseeing colleagues" performance, setting departmental goals and objectives, and ensuring departmental efficiency and effectiveness. - Establish and maintain relationships with IT Services stakeholders, identify relevant stakeholders, and uphold the quality of external third-party services. - Develop and enforce policies and procedures for IT Services, ensure adherence to control targets and standards, manage adherence to group SLAs, and control core technology production activities in incident, problem, and change management. - Identify and mitigate potential IT Services risks, develop risk mitigation strategies, and align with the bank's change and compliance functions. - Monitor the financial performance of the IT Services department, including revenue, profitability, cost control, and value realization from commercial agreements. - Lead IT Services projects, drive successful research and product launches, and deliver integrated solutions to clients. - Ensure the smooth operation and maintenance of the bank's critical technology infrastructure, resolve complex technical issues, and minimize operational disruptions. **Assistant Vice President Expectations:** As an Assistant Vice President, you are expected to advise on decision-making, contribute to policy development, and ensure operational effectiveness. Collaborate with other functions/business divisions and lead a team in performing complex tasks that impact the entire business function. Set objectives, coach employees, and appraise performance. Demonstrate leadership behaviors to create an environment for colleagues to excel. Demonstrate the Barclays Values of Respect, Integrity, Service, Excellence, and Stewardship, as well as the Barclays Mindset of Empower, Challenge, and Drive in all aspects of your work.,

Posted 2 weeks ago

Apply

15.0 - 19.0 years

0 Lacs

pune, maharashtra

On-site

As the Investor Services Head of Quality Engineering, you will play a strategic leadership role within the Investor Services management team, focusing on driving quality engineering and enhancing software delivery processes. Your responsibilities will include leading the testing team, implementing test automation strategies, standardizing toolsets, and supporting key transformational programs such as platform modernization and new capability development. You will collaborate closely with Technology and Operations heads to align with business goals and objectives. Your role will involve developing the strategic direction of the quality engineering function, communicating testing strategies to stakeholders, and implementing strong quality engineering governance for new applications. You will lead efforts to standardize processes, procedures, and governance, while providing thought leadership in quality engineering and new technologies. In addition, you will focus on driving continuous, measurable improvements in quality engineering processes, championing automation, implementing CI/CD integrated testing methodologies, and embedding SRE principles into quality engineering. You will also be responsible for leveraging new technologies like AI/ML, championing DevOps processes, and ensuring compliance with Citis Technology standards. Your qualifications for this role include significant experience in Technology supporting Financial Services, people management experience, impactful delivery track record, and proficiency in application development and cloud environments. Strong influencing skills, clear communication abilities, problem-solving skills, and attention to detail are also essential for this position. As the Investor Services Head of Quality Engineering, you will lead the way in driving quality engineering excellence, automation, and innovation to support the business goals of Investor Services Technology.,

Posted 2 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

pune, maharashtra

On-site

Embark on a transformative journey as an Infrastructure Engineer at Barclays, where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. You'll harness cutting-edge technology to revolutionize our digital offerings, ensuring unparalleled customer experiences. Operational Support Systems (OSS) Platform Engineering is a new team within the newly formed Network OSS & Tools functional unit in the Network Product domain at Barclays. The Barclays OSS Platform Engineering team is responsible for the design, build, and run of the underlying OSS infrastructure and toolchain across cloud and on-prem that the core systems and tools required to run the Barclays Global Network reside on. To be successful in this role as an Infrastructure Engineer in the Barclays OSS Platform Engineering team, you should possess the following skillsets: Hands-on expertise with IaC, Cloud Platforms, CI/CD Pipelines, Containerization & Orchestration, and SRE principles; in a product organization with Network products. Strong knowledge and demonstrable hands-on experience with Networking - Routing & Switching, Network Security, and Data Centre Technologies (BGP, VXLAN/EVPN, Spine-Leaf implementation, NGFW, IDS/IPS, NAC, Segmentation, etc). Expertise building highly available and secure networks for low latency and high throughput applications. Expert knowledge of Cloud Networking (AWS/Azure/GCP) and Hybrid cloud connectivity - with expertise setting up TCO efficient resilient cloud connectivity for applications across multiple geo-locations and availability zones. Some other highly valued skills include: Expertise working in Network and infrastructure operational product engineering and knowledge of network element management plane, control plane, data plane, and fast switching. A very good understanding of Network Protocols - TCP/IP, UDP, HTTP/HTTPS, DNS, DHCP, BGP, OSPF, VXLAN, IPSec, etc. CCNA or equivalent certification is an added advantage. Expertise in Network Security and Network Automation is an added advantage. Specifically experience with concepts of zero trust, TLS/SSL, VPNs, and concepts like gNMI/gRPC, RESTCONF, etc. Programming expertise in one of the high-level languages like Python, Java, Golang alongside proficiency in Agile Methodologies Scrum/Kanban, backlog and workflow management, and SRE specific reporting (MTTR, deployment frequency, SLO, etc). You may be assessed on the key critical skills relevant for success in the role, such as risk and controls, change and transformation, business acumen strategic thinking, and digital and technology, as well as job-specific technical skills. This role is based in our Pune office. Purpose of the role: To build and maintain infrastructure platforms and products that support applications and data systems, using hardware, software, networks, and cloud computing platforms as required with the aim of ensuring that the infrastructure is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. Accountabilities: - Build Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfill business requirements ensuring measurable reliability, performance, availability, and ease of use. Including the identification of the appropriate technologies and solutions to meet business, optimization, and resourcing requirements. - Incident Management: Monitoring of IT infrastructure and system performance to measure, identify, address, and resolve any potential issues, vulnerabilities, or outages. Use of data to drive down mean time to resolution. - Automation: Development and implementation of automated tasks and processes to improve efficiency and reduce manual intervention, utilizing software scripting/coding disciplines. - Security: Implementation of a secure configuration and measures to protect infrastructure against cyber-attacks, vulnerabilities, and other security threats, including protection of hardware, software, and data from unauthorized access. - Teamwork: Cross-functional collaboration with product managers, architects, and other engineers to define IT Infrastructure requirements, devise solutions, and ensure seamless integration and alignment with business objectives via a data-driven approach. - Learning: Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth. Vice President Expectations: To contribute or set strategy, drive requirements and make recommendations for change. Plan resources, budgets, and policies; manage and maintain policies/ processes; deliver continuous improvements and escalate breaches of policies/procedures. If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviors to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviors are: L Listen and be authentic, E Energize and inspire, A Align across the enterprise, D Develop others. For an individual contributor, they will be a subject matter expert within own discipline and will guide technical direction. They will lead collaborative, multi-year assignments and guide team members through structured assignments, identify the need for the inclusion of other areas of specialization to complete assignments. They will train, guide, and coach less experienced specialists and provide information affecting long-term profits, organizational risks, and strategic decisions. Advise key stakeholders, including functional leadership teams and senior management on functional and cross-functional areas of impact and alignment. Manage and mitigate risks through assessment, in support of the control and governance agenda. Demonstrate leadership and accountability for managing risk and strengthening controls in relation to the work your team does. Demonstrate a comprehensive understanding of the organization functions to contribute to achieving the goals of the business. Collaborate with other areas of work, for business-aligned support areas to keep up to speed with business activity and the business strategies. Create solutions based on sophisticated analytical thought comparing and selecting complex alternatives. In-depth analysis with interpretative thinking will be required to define problems and develop innovative solutions. Adopt and include the outcomes of extensive research in problem-solving processes. Seek out, build and maintain trusting relationships and partnerships with internal and external stakeholders to accomplish key business objectives, using influencing and negotiating skills to achieve outcomes. All colleagues will be expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence, and Stewardship - our moral compass, helping us do what we believe is right. They will also be expected to demonstrate the Barclays Mindset - to Empower, Challenge, and Drive - the operating manual for how we behave.,

Posted 2 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

maharashtra

On-site

As an Engineering Manager focusing on the OSS Platform & Infrastructure team, you will be responsible for leading and managing a team of engineers to ensure the successful development and maintenance of the organization's platform. Your role will require a deep understanding and practical experience in various technical domains. You should have hands-on expertise in Infrastructure as Code (IaC), Cloud Platforms, Continuous Integration/Continuous Deployment (CI/CD) Pipelines, Containerization & Orchestration, and Site Reliability Engineering (SRE) principles. Your experience should include working in a product-oriented environment with leadership responsibilities in engineering. In addition, you must demonstrate strong proficiency and practical experience with tools such as Ansible, Terraform, CloudFormation, and Pulumi. Knowledge of resource management frameworks like Apache Mesos, Kubernetes, and Yarn is essential. Expertise in Linux operating systems and experience in monitoring, logging, and observability using tools like Prometheus, Grafana, and ELK stack is also required. Furthermore, your programming skills should encompass at least one high-level language such as Python, Java, or Golang. A solid understanding of architectural and systems design, including scalability and resilience patterns, various databases (RDBMS & NoSQL), and familiarity with multi-cloud and hybrid-cloud architectures is crucial for this role. Additionally, highly valued skills for this position include expertise in Network and infrastructure operational product engineering. Knowledge of Network Protocols such as TCP/IP, UDP, HTTP/HTTPS, DNS, BGP, OSPF, VXLAN, IPSec, and having a CCNA or equivalent certification would be advantageous. Experience in Network Security, Network Automation, zero trust concepts, TLS/SSL, VPNs, and protocols like gNMI, gRPC, and RESTCONF is desirable. Proficiency in Agile Methodologies like Scrum and Kanban, backlog and workflow management, as well as SRE-specific reporting metrics (MTTR, deployment frequency, SLO, etc.), will also be beneficial for excelling in this role.,

Posted 3 weeks ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

As a Site Reliability Engineering (SRE) Technical Leader on the Network Assurance Data Platform (NADP) team at Cisco ThousandEyes, you will be responsible for ensuring the reliability, scalability, and security of the cloud and big data platforms. Your role will involve representing the NADP SRE team, contributing to the technical roadmap, and collaborating with cross-functional teams to design, build, and maintain SaaS systems operating at multi-region scale. Your efforts will be crucial in supporting machine learning (ML) and AI initiatives by ensuring the platform infrastructure is robust, efficient, and aligned with operational excellence. You will be tasked with designing, building, and optimizing cloud and data infrastructure to guarantee high availability, reliability, and scalability of big-data and ML/AI systems. This will involve implementing SRE principles such as monitoring, alerting, error budgets, and fault analysis. Additionally, you will collaborate with various teams to create secure and scalable solutions, troubleshoot technical problems, lead the architectural vision, and shape the technical strategy and roadmap. Your role will also encompass mentoring and guiding teams, fostering a culture of engineering and operational excellence, engaging with customers and stakeholders to understand use cases and feedback, and utilizing your strong programming skills to integrate software and systems engineering. Furthermore, you will develop strategic roadmaps, processes, plans, and infrastructure to efficiently deploy new software components at an enterprise scale while enforcing engineering best practices. To be successful in this role, you should have relevant experience (8-12 yrs) and a bachelor's engineering degree in computer science or its equivalent. You should possess the ability to design and implement scalable solutions, hands-on experience in Cloud (preferably AWS), Infrastructure as Code skills, experience with observability tools, proficiency in programming languages such as Python or Go, and a good understanding of Unix/Linux systems and client-server protocols. Experience in building Cloud, Big data, and/or ML/AI infrastructure is essential, along with a sense of ownership and accountability in architecting software and infrastructure at scale. Additional qualifications that would be advantageous include experience with the Hadoop Ecosystem, certifications in cloud and security domains, and experience in building/managing a cloud-based data platform. Cisco encourages individuals from diverse backgrounds to apply, as the company values perspectives and skills that emerge from employees with varied experiences. Cisco believes in unlocking potential and creating diverse teams that are better equipped to solve problems, innovate, and make a positive impact.,

Posted 3 weeks ago

Apply

14.0 - 18.0 years

0 Lacs

haryana

On-site

As a Senior Principal Engineer at Arcesium, you will play a key role in shaping the technology vision of the firm and leading the transformation across various verticals. You will be responsible for creating and reviewing architectural decisions of products, driving technology innovation, and engaging actively with the tech community to enhance employee knowledge and skills. Your contributions will include establishing policies, processes, and frameworks to guide the transformation, as well as mentoring Senior Engineers across the firm. To excel in this role, you should have a bachelor's degree in Engineering with over 14 years of experience in the IT industry. Deep understanding in one programming language, familiarity with architectural principles and patterns, and exposure to cloud service providers such as AWS, Azure, or GCP are essential. Additionally, experience in preparing business proposals, evaluating open-source frameworks, and troubleshooting scalability challenges will be valuable. Your passion for exploring and learning new technologies, along with a strong bias towards engineering and operational excellence, will be critical in driving the firm's technological advancements. You will lead the group in building sophisticated products using a cutting-edge technology stack, ensuring high availability, scalability, and reliability on a cloud platform. Your visible leadership within the technology community, excellent communication and interpersonal skills, and ability to oversee multiple technology initiatives concurrently will be instrumental in your success. By mentoring highly skilled engineers, providing guidance, and coaching, you will contribute significantly to the professional development and growth of the team. Join us at Arcesium, where we value intellectual curiosity, proactive ownership, and collaboration with colleagues. Empower yourself to make a meaningful contribution from day one and accelerate your professional development in a dynamic and innovative environment.,

Posted 3 weeks ago

Apply

14.0 - 18.0 years

0 Lacs

haryana

On-site

As a Senior Principal Engineer at Arcesium, you will play a crucial role in shaping the technology vision of the firm and driving transformation across various verticals. Your responsibilities will include contributing to the technology roadmap, creating and reviewing architectural decisions, and leading technology innovation within the organization. You will actively engage with the tech community through knowledge-sharing sessions and other mediums to enhance employee skills and support future growth. Additionally, you will establish policies, processes, and frameworks to guide the transformation and mentor senior engineers to accelerate professional development. To excel in this role, you should possess a bachelor's degree in Engineering with at least 14 years of experience in the IT industry. Deep understanding of programming languages, architectural principles, and cloud service providers (AWS/Azure/GCP) is essential. You should have hands-on experience in evaluating open-source frameworks, preparing business proposals, and delivering products with low/no touch support. A penchant for exploring and learning new technologies, coupled with a strong bias towards engineering and operational excellence, will be key to your success in this position. As a leader within the technology community at Arcesium, you will be responsible for building sophisticated products using a cutting-edge technology stack that is highly available, scalable, and reliable on a cloud platform. Your visible leadership and mentorship to senior engineers will be instrumental in fostering a culture of continuous learning and innovation. Strong communication, interpersonal, leadership, and motivational skills are prerequisites for this role, along with the ability to oversee multiple technology initiatives concurrently. Join us at Arcesium and be part of a dynamic team that values intellectual curiosity, proactive ownership, and collaboration. Empower yourself to make a meaningful contribution from day one and accelerate your professional development in a high-growth industry where innovation and transformation are at the forefront.,

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

Genpact is a global professional services and solutions firm dedicated to delivering outcomes that shape the future. With over 125,000 employees in 30+ countries, we are fueled by curiosity, agility, and a commitment to creating lasting value for our clients. Our purpose, the relentless pursuit of a world that works better for people, drives us to serve and transform leading enterprises, including the Fortune Global 500, leveraging our deep business and industry knowledge, digital operations services, and expertise in data, technology, and AI. We are currently looking for a qualified candidate for the role of Assistant Vice President - APS to join our Practice team as a Presales Production Support. As part of this role, you will provide technical support and expertise to our sales team during the pre-sales phase. Responsibilities: - Thought Leadership - Automation Architecture and Solutions - Collaborate with sales, solutions, and delivery teams - Assist in proposal preparation and solution design - Conduct demonstrations and presentations - Define offerings, partnerships, and positioning - Provide consulting services for Production Support and Reliability Engineering - Stay updated with industry trends and technologies - Drive modernization initiatives in production support and Site Reliability Engineering - Conduct technical assessments and feasibility studies - Manage and oversee delivery for large, complex production support engagements Qualifications: Minimum Qualifications: - Bachelor's degree in computer science or relevant technical field - Strong technical knowledge in enterprise software systems, customer application development, databases, and cloud computing - Familiarity with application support processes and best practices Preferred Qualifications/ Skills: - Expertise in Application support, SRE principles, and cloud hyperscalars - Proficiency in scripting languages and automation tools - Experience with production support tools like ServiceNow, JIRA, AppDynamics, New Relic, ELK stack, Data Dog - Excellent communication and presentation skills - Ability to work independently and collaboratively in a fast-paced environment - Professional certifications in relevant areas (e.g., ITIL, AWS Certified SysOps Administrator) are desirable If you are a dynamic individual with the required qualifications and skills, we invite you to apply for this challenging role as Assistant Vice President - APS at Genpact. Join us in shaping the future and delivering value to our clients. Location: India-Hyderabad Education Level: Bachelor's / Graduation / Equivalent Job Posting: Jul 5, 2024 Unposting Date: Sep 3, 2024 Job Category: Full Time,

Posted 1 month ago

Apply

5.0 - 9.0 years

0 Lacs

pune, maharashtra

On-site

The role of a Manager, BizOps at Mastercard entails leading the Site Reliability Engineering team in solving problems, developing the CI/CD pipeline, and spearheading DevOps automation and best practices. The Business Operations team is at the forefront of the DevOps transformation, advocating for change and standards across various organizational domains. Individuals in this role must possess a penchant for automation, a collaborative spirit, and the ability to work seamlessly across development, operations, and product teams. As a people manager, the incumbent will oversee functions such as Alerting & Monitoring, Capacity Management, CI-CD, Agile, and Production Support using SRE principles and ITIL practices. The role also involves implementing Automation best practices to enhance customer experience and eliminate operational toil. Collaboration with global support teams is essential to establish a responsive and agile operational unit, setting high standards for performance. Key responsibilities include contributing to engineering and production support strategies, troubleshooting applications, minimizing development costs, mentoring staff, and staying abreast of technological advancements. The role involves the entire lifecycle of services, from inception to refinement, and requires proactive engagement in activities such as system design consulting, capacity planning, and incident response. Qualifications for this role include a BS degree in Computer Science or a related field, experience with algorithms, data structures, and scripting, as well as a systematic problem-solving approach. Proficiency in programming languages like C, C++, Java, Python, Go, Perl, or Ruby is preferred, along with expertise in designing and troubleshooting distributed systems. Strong communication skills, adaptability to change, and a collaborative mindset are vital attributes for success in this role. The ideal candidate should have hands-on experience with industry-standard CI/CD tools, architecture techniques, and a broad array of technologies including Cloud platforms, Java, Web Services, Oracle, Linux, messaging protocols, and security coding practices. Additionally, knowledge in governance, strategy, and technology planning is beneficial for establishing effective operational frameworks within the organization. Mastercard emphasizes corporate security responsibility, requiring all employees to adhere to security policies, maintain information confidentiality and integrity, report security breaches, and undergo periodic security training sessions in alignment with company guidelines.,

Posted 1 month ago

Apply

3.0 - 6.0 years

6 - 10 Lacs

Hyderabad

Work from Office

Compute SRE to join our team and ensure our compute infrastructure's reliability, performance, and scalability You will work on building and maintaining highly available systems that power our applications and services. Required Candidate profile 3+ years of experience in systems engineering or operations focus on SRE principles operating systems -Linux, Windows,Storage and Back Up systems, container orchestration platforms -Kubernetes, Docker

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies