Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
2.0 - 6.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Technology, you will play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions. You will use technology to solve business problems and leverage software engineering best practices as the team strives towards excellence. Your responsibilities will include executing small to medium projects independently with initial direction and eventually designing and delivering projects by yourself. Collaborating with cross-functional teams will provide you with the opportunity to continually enhance your knowledge about JPMorgan Chase's business and relevant technologies. You will leverage technology to solve business problems by writing high-quality, maintainable, and robust code following best practices in software engineering. Additionally, you will participate in triaging, examining, diagnosing, and resolving incidents, working with others to solve problems at their root. Recognizing the toil within your role, you will proactively work towards eliminating it through systems engineering or updating application code. Understanding observability patterns is crucial, and you will strive to implement and improve service level indicators, objectives monitoring, and alerting solutions for optimal transparency and analysis. In terms of qualifications, capabilities, and skills, you should have formal training or certification on software engineering concepts and a minimum of 2 years of applied experience. Ability to code in at least one programming language is essential, along with experience maintaining a Cloud-based infrastructure. Familiarity with site reliability concepts, principles, and practices is required, as well as observability practices using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others. Knowledge of containers or common Server OS like Linux and Windows is preferred. Emerging knowledge of software, applications, and technical processes within a given technical discipline and continuous integration and continuous delivery tools are beneficial. You should also have familiarity with common networking technologies and be able to work in a large, collaborative team, demonstrating willingness to vocalize ideas with peers and managers. Preferred qualifications, capabilities, and skills include familiarity with popular IDEs for Software Development and knowledge of using GENAI tools such as Copilot or Windsurf as Code Assistants. General knowledge of the financial services industry is preferred, along with an understanding of NFRs. By joining JPMorgan Chase as a Site Reliability Engineer II, you will have the opportunity to contribute to the reliability and efficiency of the organization's technological infrastructure while continuously enhancing your skills and knowledge in software engineering practices.,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
As an AI Engineer at Deutsche Bank, you will play a crucial role in designing, developing, and implementing AI-based solutions for CB Tech. By working with large datasets, conducting experiments, and staying updated with the latest advancements in AI and Machine Learning, you will drive innovation and lead efforts to modernize the engineering landscape. Your role will involve identifying AI use cases and providing local support, contributing to the development of innovative products. You will collaborate as part of a cross-functional agile delivery team, bringing an innovative approach to software development and focusing on the latest technologies and practices to deliver business value. Your dedication to open code, open discussion, and creating a supportive, collaborative environment will be essential as you contribute to all stages of software delivery, from initial analysis to production support. Key Responsibilities: - Design, develop, and implement AI and Gen-AI-based Agentic software systems on the cloud. - Collaborate with development teams and subject matter experts to integrate shared services into products. - Operate within Deutsche Bank's AI Governance framework and adhere to safe AI principles. - Utilize architecture decision trees to select strategic AI patterns for solving business problems. - Integrate Gen-AI APIs with cloud-native presentation and persistent layers. - Scale systems while innovating and evolving continuously. - Work with data engineers and scientists to ensure effective data collection and preparation for training AI models. - Monitor AI solution performance and implement improvements. - Lead training sessions and create comprehensive documentation for end users. - Function as an active member of an agile team. Skills You'll Need: - Proficiency in AI frameworks, libraries, and cloud platforms. - Crafting effective prompts and enhancing AI outputs. - Strong understanding of natural language processing and conversational AI. - Experience in model deployment, monitoring, optimization, and problem-solving. - Proficiency in Python or Java, SQL, and RESTful design. - Experience with enterprise and real-world data sets. - Putting ML/AI into production and discussing best practices. - Relationship and consensus building skills. Skills That Will Help You Excel: - Ability to explain AI concepts to non-technical audiences. - Flexibility in learning new tools and developing innovative solutions. - Experience with cloud-native databases/warehouses and data visualization. - Thought leadership in emerging technologies related to AI. Join us at Deutsche Bank and be part of a culture that empowers you to excel daily, take initiative, and work collaboratively. We promote a positive, fair, and inclusive work environment where success is celebrated and shared across our teams. Apply today to drive innovation and shape the future of AI at Deutsche Bank.,
Posted 2 weeks ago
3.0 - 7.0 years
0 Lacs
noida, uttar pradesh
On-site
As a DevOps Engineer, you will play a key role in building and maintaining a robust, scalable, and reliable 0-downtime platform. You will work hands-on with a recently kick-started greenfield initiative with modern infrastructure and automation tools to support our engineering teams. This is a great opportunity to work with a forward-thinking team and have the freedom to approach problems with fresh thinking, embedding AI and automation to help shape our cloud-native journey. If you are passionate about automation, cloud infrastructure, and delivering high-quality production-grade platforms, this role offers the chance to make a real impact. Key Responsibilities Hands-On Development: Design, implement, and optimize AWS infrastructure through hands-on development using Infrastructure as Code tools. Automation & CI/CD: Develop and maintain CI/CD pipelines to automate fast, secure, and seamless deployments. Platform Reliability: Ensure high availability, scalability, and resilience of our platform, leveraging managed services. Monitoring & Observability: Implement and manage proactive observability using DataDog and other tools to monitor system health, performance, and security, ensuring that we can see and fix issues before they impact users. Cloud Security & Best Practices: Apply cloud and security best practices, including patching and secure configuration of networking, encryption (at rest and in transit), secrets, and identity/access management. Continuous Improvement: Contribute ideas and solutions to improve our DevOps processes. AI & Future Tech: We aim to push the boundaries of AI-driven development. If you have ideas on how to embed AI into our DevOps processes, you will have the space to explore them. Your Experience Tech stack: We use Terraform, Terragrunt, Helm, Python, Bash, AWS (EKS, Lambda, EC2, RDS/Aurora), Linux OS & Github Actions. You are comfortable with all of these and have strong hands-on experience with Terraform and IaC principles, CI/CD, and the AWS ecosystem. Proven experience with Networking (VPC, Subnets, Security Groups, API Gateway, Load Balancing, WAF) and Cloud configuration (Secrets Manager, IAM, KMS). Comfortable with Kubernetes, ArgoCD, Isitio & Deployment strategies (blue/green & canary). Familiarity with Cloud Security services such as Security Hub, Guard Duty, Inspector, and vulnerability management/patching. Observability Mindset: You believe in measuring everything. You have worked with DataDog (or similar) to ensure teams have visibility into platform health and security. Experience with embedding AI into DevOps processes is advantageous.,
Posted 2 weeks ago
8.0 - 15.0 years
70 - 100 Lacs
Bengaluru, Karnataka, India
On-site
This role is for one of the Weekday's clients Salary range: Rs 7000000 - Rs 10000000 (ie INR 70-100 LPA) Min Experience: 8 years Location: Bengaluru JobType: full-time Platform Infrastructure (PI) serves as the foundational layer that empowers engineering teams to focus on solving high-impact challenges without being hindered by infrastructure complexities. As organizations scale, the efficiency, reliability, and innovation of PI directly dictate their ability to deliver quickly and securely across global markets. This role positions infrastructure not just as a support function, but as a strategic enabler of innovation and growth. Requirements What You Will Do: Lead the architecture and implementation of a scalable, unified observability platform that consolidates fragmented systems across both federal and commercial deployments. Design and deploy greenfield observability solutionsincluding distributed request tracing, advanced log analysis, and Prometheus scalingto support a rapidly expanding multi-region infrastructure across the US, EU, Japan, and emerging data centers. Drive operational excellence by automating repetitive infrastructure tasks, solving system issues through code, and participating in a follow-the-sun on-call model across global teams. Own critical observability infrastructure components that ensure system reliability, performance, and uptime across all product lines. Collaborate closely with cross-functional stakeholders including Product Managers, Technical Program Managers, and Data Scientists to align infrastructure goals with product needs. Must-Have Qualifications: 8+ years of software engineering experience with deep expertise in distributed systems, Python or GoLang, and AWS cloud services. Proven success in designing and deploying enterprise-grade observability stacks (metrics, logging, tracing) for complex, multi-region environments. Strong technical leadership skills with a track record of mentoring engineers and driving innovation while maintaining operational stability. Demonstrated ability to optimize high-throughput, low-latency systems and automate large-scale operational workflows. Excellent communication skills with the ability to bridge technical and non-technical teams and drive cross-functional collaboration. Nice to Have: Master's degree in Computer Science, Electrical Engineering, or a related technical field. Hands-on experience with observability tools such as Datadog, Chronosphere, Splunk, Prometheus, and Grafana. Exposure to or experience within the cybersecurity industry. Key Skills: Platform Infrastructure | Observability | Distributed Systems | Backend Engineering | Python | GoLang | AWS | Metrics & Tracing | Log Analysis | Technical Leadership
Posted 2 weeks ago
12.0 - 22.0 years
45 - 65 Lacs
Hyderabad
Work from Office
Role & responsibilities As a Senior Manager, you will work with and manage the engineering team in the Hyderabad Development Centre to deliver the goals and objectives of the business. As a leader, you must be capable of working in a matrixed organization and coordinating the delivery of multiple outcomes. You will be hands-on in terms of design, architecture, and development and should be able to lead the team from front in any critical situation. As a people leader first and a delivery manager second, you must build, inspire, and lead the technical teams. In this role, you are expected to work with stakeholders and internal customers across the different GAP tech locations. You will be managing the Engineering Platform Observe team that set modern architecture principles to promote innovation, flexibility, and reuse. Our team support the engineering teams in building automation to help enable developer success across all our brands and markets. You'll play a key role in building, maintaining, and supporting GAPs next-generation Observability platform enabling innovation, solutioning and exceptional developer experience. We have a sharp technical team, and you will be working with many high-performing software development professionals in a friendly, open-minded, and diverse environment. What Youll Do: Lead DevOps best practices and mentor a team of Observability engineers working towards optimizing our monitoring solutions. Develop the roadmap and strategy of seamlessly onboarding the Product teams on our Observability solutions. Architecture and enhance implementation of Observability platforms across the organization. Present possible updates, recommendations, strategic opportunities to local & US leadership. Develop relationships with local business leaders. Strong desire to simplify the developers debug experience by adopting and on boarding the right tools across the enterprise. Develop an understanding of GAP's Observability Pipelines to automate and enhance user experience. Participate in the design of new or changing monitoring needs. Build, operationalize, and maintain Observability solutions for our technology customers Participate in problem solving and troubleshooting for the assigned applications, functional areas or projects Stay current with changes in the technical area of expertise Build, maintain, and support enterprise production systems with a business mindset, keeping an eye towards simplicity, reliability, maintainability, scalability, extensibility and performance Drive resolution of operational and production issues in a timely manner. Support internal customers in adopting our Next Generation Observability pipelines. Work with the team to develop features and improvements. Identifies opportunities to eliminate or automate remediation through RCA for recurring issues to improve overall operational stability of software applications and systems Preferred candidate profile Minimum 5 years experience in Engineering Leadership position, overall 12+ years of work experience. Hands on experience and managing operations of large-scale internet-centric production environments for application or infrastructure services serving tens to millions of end users. Excellent decision-making, problem-solving and time management skills. Demonstrated ability to innovate and operate outside the comfort zone of established methods and procedures Demonstrated ability to gain immediate credibility at all levels both inside and outside the organization and develop lasting, productive and collaborative relationships Excellent communication and influencing skills including the ability to simplify key messages, present compelling stories and promote technical and personal credibility with internal and external executives, and both technical and non-technical audiences Willingly shares relevant technical and/or industry knowledge and expertise in order to mentor team members. Strong hands on experience with latest Observability trends. Asses new Observability technologies and their potential fit within our current ecosystem. Support the team's technical growth through code reviews, architecture discussions, and knowledge sharing. Drive the development of tools to streamline developer workflows, in collaboration with other teams. Efficiently collaborate with other cross-functional teams in driving initiatives. Participate in an on-call rotation as needed by the business. Retail/Ecommerce industry experience preferred Strong considerable hands-on experience with monitoring tools like Grafana, Prometheus, OpenTelemetry (OTEL), NewRelic, Nagios & Splunk or similar tools. Proficiency with Infrastructure as Code patterns & tools (e.g. ARM, Terraform, GitOps) Proficiency with Multi cloud platforms Observability solutions like Azure Monitor, Google Cloud Observability or AWS Cloudwatch, Working on at least one Kubernetes cloud offering (AKS/GKE) or on-prem Kubernetes (native Kubernetes) Experience with Unix platforms, system administration skills in UNIX Appreciation and preference for open-source solutions like OTEL or eBPF. Ability to maintain and manage observability tools to look at logs, metrics & traces to diagnose issues within that system. Experience in scaling infrastructure to support high-throughput data-intensive applications Experience working on projects following Agile methodologies You're proficient in at least one programming language (e.g., Python, Java, Go) and comfortable working across different types of languages as needed. Working knowledge of Collaboration tools like Slack, JIRA & Confluence & Service Management tools like ServiceNow & PagerDuty About Us: Hyderabad Development Center (HDC): Launched in March 2017 with a small pilot team, Gap Inc.’s Hyderabad Development Center has grown into the India’s largest fashion retail technology hub with 800+ employees today. HDC plays a pivotal role in driving innovation across digital technology, engineering, employee enablement, cybersecurity, data science, product management and customer experience. Home to 40% of Gap Inc.’s global tech workforce, this young and diverse team is powering cutting-edge e-commerce and enterprise solutions for our people and iconic brands. Our growth is powered by a strong focus on nurturing talent and shaping the next generation of innovators in fashion retail technology. About Gap Inc.: Gap Inc., a house of iconic brands, is the largest specialty apparel company in America. Its Old Navy, Gap, Banana Republic, and Athleta brands offer clothing, accessories, and lifestyle products for men, women and children. Since 1969, Gap Inc. has created products and experiences that shape culture, while doing right by employees, communities and the planet. Gap Inc. products are available worldwide through company-operated stores, franchise stores, and e-commerce sites. Fiscal year 2024 net sales were $15.1 billion. For more information, please visit www.gapinc.com.
Posted 2 weeks ago
10.0 - 14.0 years
0 Lacs
karnataka
On-site
As a Principal Engineer at Walmart Enterprise Business Services (EBS), you will play a crucial role in shaping the engineering direction and driving architectural decisions. You will be responsible for leading the design and development of full stack applications with high scalability and resilience. Your expertise in cloud-native development on Google Cloud Platform (GCP) will be instrumental in ensuring the delivery of secure and high-performing solutions across the platform. In this role, you will be expected to architect complex cloud-native systems using a variety of GCP services, define best practices, and drive engineering excellence across teams. Your responsibilities will include building and optimizing APIs and frontend frameworks, guiding the adoption of serverless and container-based architectures, and championing CI/CD pipelines and Infrastructure as Code (IaC) practices. You will collaborate cross-functionally with product, design, and data teams to translate business requirements into scalable technical solutions. Additionally, you will act as a trusted technical advisor and mentor to staff and senior engineers, staying ahead of industry trends and evaluating new tools and frameworks to enhance productivity and performance. To be successful in this role, you should have a minimum of 10 years of experience in full stack development, with at least 2 years in a technical leadership or principal engineering role. Deep proficiency in JavaScript/TypeScript, Python, or Go is required, along with expertise in modern frontend frameworks, cloud-native systems on GCP, microservices architecture, and DevOps practices. Strong communication, leadership, and collaboration skills are essential, along with a GCP Professional Certification and experience with serverless platforms and observability tools. Joining Walmart Global Tech means working in an environment where your contributions can impact the lives of millions of people. As part of a team that values innovation and empowerment, you will have the opportunity to grow your skills and expertise while driving meaningful change in the retail industry. At Walmart, we strive to create a culture of belonging where every associate is valued for who they are. Our commitment to diversity and inclusion allows us to engage associates, strengthen our business, and better serve our customers and communities around the world. As an Equal Opportunity Employer, Walmart is dedicated to understanding, respecting, and valuing the unique experiences and identities of all individuals.,
Posted 2 weeks ago
4.0 - 8.0 years
0 Lacs
chennai, tamil nadu
On-site
The role within Services - Client & Services Assurance group falls under the BCC - Orion stream. As an App Support Senior Analyst, you are expected to possess techno-functional skills. Your primary responsibilities will involve investigating alerts to determine root causes through deep dives into application flows. Additionally, you will be responsible for communicating with stakeholders regarding system and business service availability. In the event of major incidents, you will play a key role in coordinating, communicating, and escalating issues that impact the delivery of TTS Services flow to client-facing teams. It will be essential for you to identify and eliminate operational toils from the day-to-day support operational workload. Furthermore, you will lead service efficiency and stability improvement project initiatives to enhance the overall client experience. Acting as a liaison between the Business and Technology teams, you will facilitate the rapid escalation of incidents or market events and provide a business perspective on remediation options. To qualify for this role, you should hold a technology academic degree with 4-6 years of relevant work experience. Proficiency in common database skills such as SQL queries (Oracle, Mongo DB, Sybase) is required. Familiarity with popular application monitoring tools like Grafana, ITRS Geneos, AppDynamics is essential. Basic knowledge of webserver technologies (WebLogic or WebSphere) and middleware technologies like MQ, Kafka is expected. Experience working with ITIL tools like ServiceNow, along with ITIL Foundation Certification, is preferred. Strong technical troubleshooting and problem-solving skills, coupled with effective verbal and written communication abilities, are crucial for success in this role. A basic understanding of Observability, Site Reliability Engineering (SRE), and Open Telemetry Principles will be beneficial. This is a full-time position in the Technology job family group, specifically within the Applications Support job family. For more information on the most relevant skills required for this role, please refer to the qualifications listed above. Additional complementary skills may also be discussed with the recruiter. If you require a reasonable accommodation due to a disability to utilize our search tools or apply for a career opportunity, please review the Accessibility at Citi information. You can also view Citi's EEO Policy Statement and the Know Your Rights poster for further details.,
Posted 2 weeks ago
6.0 - 10.0 years
0 Lacs
karnataka
On-site
As a Senior Software DevOps Engineer, you will lead the design, implementation, and evolution of telemetry pipelines and DevOps automation that enable next-generation observability for distributed systems. You will blend a deep understanding of Open Telemetry architecture with strong DevOps practices to build a reliable, high-performance, and self-service observability platform across hybrid cloud environments (AWS & Azure). Your mission is to empower engineering teams with actionable insights through rich metrics, logs, and traces, while championing automation and innovation at every layer. You will be responsible for: Observability Strategy & Implementation: Architect and manage scalable observability solutions using OpenTelemetry (OTel), encompassing Collectors, Instrumentation, Export Pipelines, Processors & Extensions for advanced enrichment and routing. DevOps Automation & Platform Reliability: Own the CI/CD experience using GitLab Pipelines, integrating infrastructure automation with Terraform, Docker, and scripting in Bash and Python. Build resilient and reusable infrastructure-as-code modules across AWS and Azure ecosystems. Cloud-Native Enablement: Develop observability blueprints for cloud-native apps across AWS (ECS, EC2, VPC, IAM, CloudWatch) and Azure (AKS, App Services, Monitor). Optimize cost and performance of telemetry pipelines while ensuring SLA/SLO adherence for observability services. Monitoring, Dashboards, and Alerting: Build and maintain intuitive, role-based dashboards in Grafana, New Relic, enabling real-time visibility into service health, business KPIs, and SLOs. Implement alerting best practices integrated with incident management systems. Innovation & Technical Leadership: Drive cross-team observability initiatives that reduce MTTR and elevate engineering velocity. Champion innovation projects including self-service observability onboarding, log/metric reduction strategies, AI-assisted root cause detection, and more. Mentor engineering teams on instrumentation, telemetry standards, and operational excellence. Requirements: - 6+ years of experience in DevOps, Site Reliability Engineering, or Observability roles - Deep expertise with OpenTelemetry, including Collector configurations, receivers/exporters (OTLP, HTTP, Prometheus, Loki), and semantic conventions - Proficient in GitLab CI/CD, Terraform, Docker, and scripting (Python, Bash, Go). Strong hands-on experience with AWS and Azure services, cloud automation, and cost optimization - Proficiency with observability backends: Grafana, New Relic, Prometheus, Loki, or equivalent APM/log platforms - Passion for building automated, resilient, and scalable telemetry pipelines - Excellent documentation and communication skills to drive adoption and influence engineering culture Nice to Have: - Certifications in AWS, Azure, or Terraform - Experience with OpenTelemetry SDKs in Go, Java, or Node.js - Familiarity with SLO management, error budgets, and observability-as-code approaches - Exposure to event streaming (Kafka, RabbitMQ), Elasticsearch, Vault, Consul,
Posted 2 weeks ago
10.0 - 14.0 years
0 Lacs
pune, maharashtra
On-site
We are seeking a highly experienced and passionate Senior UI/UX Lead to take ownership of both user experience design and frontend architecture for our product suite. The ideal candidate will have a blend of strong design sensibility and deep technical expertise to create scalable, performant, and user-centric digital experiences. As the Senior UI/UX Lead, you will be responsible for leading design strategy, establishing UI architecture standards, guiding cross-functional teams, and playing a critical role in designing, developing, and maintaining high-quality software solutions. Your role will involve leading the end-to-end design and development process, including user research, prototyping, and final implementation. You will architect scalable, modular, and maintainable frontend solutions, drive UX strategy through data-backed insights, and build and maintain a design system to ensure consistency and accessibility. Additionally, you will be responsible for defining and enforcing frontend testing strategies, implementing observability practices, and developing robust, scalable applications. As the Senior UI/UX Lead, you will drive the adoption of modern engineering practices, advocate for automated testing and continuous monitoring, and ensure adherence to secure coding practices. You will collaborate with architects, product owners, and cross-functional teams to design scalable systems, mentor junior engineers, and lead technical discussions to provide guidance on modern software architectures. The ideal candidate should have 10+ years of experience in UI/UX design and frontend development, with at least 3 years in a leadership/UI architect role. Strong proficiency in JavaScript/TypeScript, React, Angular, or similar modern frameworks is required, along with a deep understanding of UI architecture, design patterns, and performance optimization techniques. Experience with user research, wireframing, prototyping tools, design systems, and accessibility guidelines is essential. Strong communication and collaboration skills, coupled with problem-solving abilities and an automation-first mindset, are also important for this role. Good-to-have skills include experience with event-driven architecture, distributed systems, caching solutions, trunk-based development, feature flags, and progressive delivery strategies. Familiarity with backend integration, GraphQL/REST APIs, frontend monitoring, and analytics tools is a plus, as well as exposure to modern cloud-native technologies. If you are someone with a disability and require a reasonable accommodation to use our search tools or apply for a career opportunity, please review Accessibility at Citi. For more information on Citi's EEO Policy Statement and the Know Your Rights poster, please visit the Citi website.,
Posted 2 weeks ago
2.0 - 6.0 years
0 Lacs
karnataka
On-site
As a Technology Support II team member at JPMorgan Chase, you will be instrumental in maintaining the operational stability, availability, and performance of our production application flows. Your primary responsibilities will include analyzing and troubleshooting production application flows to ensure seamless service delivery, participating in problem management to enhance operational stability and availability, monitoring production environments for anomalies, and communicating effectively with stakeholders to address and resolve issues promptly. You will also be expected to identify trends and provide support for incidents, problems, and changes related to full stack technology systems, applications, or infrastructure. This role may involve providing on-call coverage during weekends to ensure continuous operational support. The ideal candidate for this position should have at least 2 years of experience working with Data/Python applications in a Production environment. Proficiency in programming or scripting language, particularly Python, is required. Experience with containers and container orchestration (such as Kubernetes), orchestration tools (like Control-M), cloud platforms (specifically AWS) with infrastructure provisioning using Terraform, as well as exposure to observability and monitoring tools, will be beneficial. Strong communication and collaboration skills are essential for effective engagement in a fast-paced, dynamic environment. Additionally, preferred qualifications include experience supporting applications on platforms like Databricks, Snowflake, or AWS EMR (with Databricks being preferred), a proactive approach to self-education and evaluation of new technologies, and knowledge of virtualization, cloud architecture, services, and automated deployments.,
Posted 2 weeks ago
6.0 - 10.0 years
0 Lacs
karnataka
On-site
As a Ping Engineer at TEKsystems, you will be an integral part of the Cloud migration & Application modernizing team dedicated to facilitating Cloud adoption for our banking client. Putting customers at the core of our operations, we leverage technology to enhance efficiency and ensure timely processing, a pivotal aspect of delivering exceptional customer service. Our continuous quest for innovation is driven by the dynamic nature of our systems and the evolving expectations of our customers, propelling us to explore cutting-edge technologies, experiment, and push boundaries. Based in Manyata Bangalore, this role requires a minimum of 6 years of industry experience. You will be expected to possess a proven track record in developing CIAM and PING solutions, along with a robust understanding of CIAM, authentication, and authorization processes. Proficiency in Ping Identity products, specifically Ping Federate and Ping Directory, is essential. In-depth knowledge of authentication and authorization standards such as OAuth2/OIDC, as well as identity and access management protocols like SSO, SAML, LDAP, OAuth, OIDC, and FIDO, will be crucial to your success in this role. Your responsibilities will include developing and maintaining authentication and authorization services, ensuring observability and monitoring of applications, and proficiently conducting test automation. Strong programming skills in languages such as C#, Python, Golang, or Java are a must, along with experience in GitHub Actions workflows. Your problem-solving abilities, keen attention to detail, and effective communication and collaboration skills will be key assets in this role. If you have the required expertise and experience, we encourage you to submit your resume to nvaseemuddin@teksystems.com to be considered for this exciting opportunity. #PingFederate #PingIdentity #PingDirectory #Javascript #OAuth,
Posted 2 weeks ago
6.0 - 10.0 years
15 - 30 Lacs
Hyderabad, Pune, Bengaluru
Hybrid
Senior DevOps for SDAP Registry team managing system health. Skills in Windows/Linux, Docker, K8s, GitLab CI/CD, MSSQL, Grafana, ELK, TCP/IP, Ansible, Solace, Mulesoft. Key tasks: system admin, automation, observability, security & pipeline dev.
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking Digital, you will play a crucial role in shaping the future of a globally recognized firm. Your contributions will have a direct and substantial impact in a domain specifically crafted for high-performing individuals in site reliability. Your primary responsibility will be to lead your team by demonstrating strong expertise across multiple technical domains and providing guidance on both technical and business matters. You will be in charge of conducting resiliency design reviews, breaking down complex issues into manageable tasks for other engineers, serving as a technical lead for medium to large-sized products, and offering advice and mentorship to fellow engineers. You will also be expected to: - Demonstrate and promote a culture of site reliability practices within your team and exert technical influence - Lead initiatives aimed at enhancing the reliability and stability of your team's applications and platforms by leveraging data-driven analytics - Collaborate with team members to establish comprehensive service level indicators and reasonable service level objectives with stakeholders - Showcase a high level of technical proficiency within one or more technical domains and proactively address technology-related bottlenecks - Act as the primary point of contact during major incidents, swiftly identifying and resolving issues to prevent financial losses - Share knowledge within the organization through internal forums and communities of practice To qualify for this role, you should possess: - Formal training or certification in software engineering concepts along with at least 5 years of practical experience - Proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices - Fluency in at least one programming language (e.g., Python, Java Spring Boot, .Net) - Deep understanding of software applications and technical processes, with emerging expertise in one or more technical disciplines - Experience in observability tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk - Proficiency in continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform - Familiarity with container and container orchestration technologies such as ECS, Kubernetes, Docker - Experience troubleshooting common networking technologies and issues - Ability to tackle problems related to complex data structures and algorithms - Eagerness to self-educate and explore new technologies - Capability to teach new programming languages to team members Join us in this exciting opportunity to lead and drive excellence in site reliability engineering at JPMorgan Chase, where your skills and knowledge will make a significant impact on our digital landscape.,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
You will lead the development of high-performance backend services using Java and Spring Boot, designing and building reliable and scalable REST APIs and microservices. Taking ownership of features and system components throughout the software lifecycle will be your responsibility. You will also design and implement CI/CD workflows using tools like Jenkins or GitHub Actions, contributing to architectural decisions, code reviews, and system optimizations. Your expertise in Java and advanced experience with the Spring Boot framework will be essential, along with proven experience in building and scaling REST APIs and microservices. Hands-on experience with CI/CD automation and DevOps tools is required, as well as working knowledge of distributed systems, cloud platforms, and Kafka. A strong understanding of system design, performance optimization, and best coding practices is crucial for this role. Proficiency in Docker and Kubernetes for containerized deployments, exposure to NoSQL databases such as MongoDB and Cassandra, and experience with configuration server management and dynamic config updates are nice-to-have skills. Familiarity with monitoring and logging tools like Prometheus, ELK Stack, or others, along with awareness of cloud security standards, observability, and incident management will be beneficial. This is a full-time position with benefits including Provident Fund. The work schedule is during the day shift, and the job requires at least 5 years of experience in Java Developer, Docker and Kubernetes, NoSQL databases MongoDB, Cassandra, Kafka, Spring Boot framework, Jenkins, GitHub, REST APIs, system design, cloud architectures and microservices, monitoring and logging tools, and awareness of cloud security. Work location is in person.,
Posted 2 weeks ago
1.0 - 5.0 years
0 Lacs
karnataka
On-site
At Goldman Sachs, our Engineers are dedicated to making the impossible possible. We are committed to changing the world by bridging the gap between people and capital with innovative ideas. Our mission is to tackle the most complex engineering challenges for our clients, crafting massively scalable software and systems, designing low latency infrastructure solutions, proactively safeguarding against cyber threats, and harnessing the power of machine learning in conjunction with financial engineering to transform data into actionable insights. Join our engineering teams to pioneer new businesses, revolutionize finance, and seize opportunities in the fast-paced world of global markets. Engineering at Goldman Sachs, consisting of our Technology Division and global strategists groups, stands at the heart of our business. Our dynamic environment demands creative thinking and prompt, practical solutions. If you are eager to explore the limits of digital possibilities, your journey starts here. Goldman Sachs Engineers embody innovation and problem-solving skills, developing solutions in various domains such as risk management, big data, and mobile technology. We seek imaginative collaborators who can adapt to change and thrive in a high-energy, global setting. The Data Engineering group at Goldman Sachs plays a pivotal role across all aspects of our business. Focused on offering a platform, processes, and governance to ensure the availability of clean, organized, and impactful data, Data Engineering aims to scale, streamline, and empower our core businesses. As a Site Reliability Engineer (SRE) on the Data Engineering team, you will oversee observability, cost, and capacity, with operational responsibility for some of our largest data platforms. We are actively involved in the entire lifecycle of platforms, from design to decommissioning, employing an SRE strategy tailored to this lifecycle. We are looking for individuals who have a development background and are proficient in code. Candidates should prioritize Reliability, Observability, Capacity Management, DevOps, and SDLC (Software Development Lifecycle). As a self-driven leader, you should be comfortable tackling problems with varying degrees of complexity and translating them into data-driven outcomes. You should be actively engaged in strategy development, participate in team activities, conduct Postmortems, and possess a problem-solving mindset. Your responsibilities as a Site Reliability Engineer (SRE) will include driving the adoption of cloud technology for data processing and warehousing, formulating SRE strategies for major platforms like Lakehouse and Data Lake, collaborating with data consumers and producers to align reliability and cost objectives, and devising strategies with data using relevant technologies such as Snowflake, AWS, Grafana, PromQL, Python, Java, Open Telemetry, and Gitlab. Basic qualifications for this role include a Bachelor's or Master's degree in a computational field, 1-4+ years of relevant work experience in a team-oriented environment, at least 1-2 years of hands-on developer experience, familiarity with DevOps and SRE principles, experience with cloud infrastructure (AWS, Azure, or GCP), a proven track record in driving data-oriented strategies, and a deep understanding of data multi-dimensionality, curation, and quality. Preferred qualifications entail familiarity with Data Lake / Lakehouse technologies, experience with cloud databases like Snowflake and Big Query, understanding of data modeling concepts, working knowledge of open-source tools such as AWS Lambda and Prometheus, and proficiency in coding with Java or Python. Strong analytical skills, excellent communication abilities, a commercial mindset, and a proactive approach to problem-solving are essential traits for success in this role.,
Posted 2 weeks ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineer III at JPMorgan Chase within the Corporate Technology, you will be at the center of a rapidly growing field in technology. Your role involves applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. You will be responsible for solving complex business problems with simple solutions through code and cloud infrastructure. Your tasks will include configuring, maintaining, monitoring, and optimizing applications and their associated infrastructure. You will play a vital role in decomposing and iteratively improving existing solutions, contributing significantly to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of applications or platforms. Responsibilities: - Guide and assist others in building appropriate level designs and gaining consensus from peers - Collaborate with software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines - Design, develop, test, and implement availability, reliability, scalability, and solutions in applications - Implement infrastructure, configuration, and network as code for applications and platforms - Collaborate with technical experts, key stakeholders, and team members to resolve complex problems - Understand service level indicators and utilize service level objectives to proactively resolve issues - Support the adoption of site reliability engineering best practices within the team Required Qualifications: - Formal training or certification on software engineering concepts with 3+ years of applied experience - Proficiency in site reliability culture and principles, and familiarity with implementing site reliability within an application or platform - Proficiency in at least one programming language such as Python, Java/Spring Boot, and .Net - Knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.) - Experience in observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. - Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform - Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker - Familiarity with troubleshooting common networking technologies and issues - Ability to contribute to large and collaborative teams with limited supervision - Proactive recognition of roadblocks and interest in learning innovative technologies - Ability to identify new technologies and relevant solutions to meet design constraints Preferred Qualifications: - Familiarity with popular IDEs for Software Development - General knowledge of the financial services industry (preferred),
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
As a Software Engineer- Operational Support Systems at Barclays, you will embark on a transformative journey, spearheading the evolution of the digital landscape and driving innovation and excellence. Your primary responsibility will be the design, build, and maintenance of the underlying OSS infrastructure and toolchains necessary to run the Barclays Global Network, deployed across cloud and On-Prem environments. To excel in this role, you should possess expertise in both front-end and back-end development. Proficiency in Java (Java 17+) and the Spring Ecosystem (Spring MVC, Data JPA, Security, etc.), along with strong SQL and NoSQL integration skills, are essential. Additionally, you should have experience with React.js, JavaScript frameworks like material UI and Ant design, and state management tools like Redus, Zustand, or Context API. Your role will also involve working with runtime technologies such as virtualization, containers, and Kubernetes, and implementing test-driven development using frameworks like Cypress, Playwright, or Selenium. Proficiency in CI/CD pipelines and tools like GitHub Actions, Jenkins, or Gitlab CI, as well as knowledge of monitoring and observability tools like Grafana/ELK, are crucial for success in this position. Highly valued skills for this role include expertise in building ELT pipelines, cloud/storage integrations, security practices (OAuth2, CSRF/XSS protection), performance optimization, and familiarity with Public, Private, and Hybrid Cloud technologies and Network domains. In this role, you will collaborate with product managers, designers, and fellow engineers to develop high-quality software solutions that are scalable, maintainable, and optimized for performance. You will actively contribute to the organization's technology communities, stay informed of industry trends, and promote a culture of technical excellence and growth. As a Software Engineer at Barclays, you will be expected to adhere to secure coding practices, implement effective unit testing, and actively contribute to a culture of code quality and knowledge sharing. Your role will be based in the Pune office, and you will play a crucial part in designing, developing, and improving software that enhances business, platform, and technology capabilities for customers and colleagues.,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking Digital, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and conduct resiliency design reviews, break up complex problems into digestible work for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to other engineers. Demonstrate and champion site reliability culture and practices and exert technical influence throughout your team. Lead initiatives to improve the reliability and stability of your team's applications and platforms using data-driven analytics to enhance service levels. Collaborate with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers. Display a high level of technical expertise within one or more technical domains and proactively identify and solve technology-related bottlenecks in your areas of expertise. Act as the main point of contact during major incidents for your application and demonstrate the skills to identify and solve issues quickly to avoid financial losses. Document and share knowledge within your organization via internal forums and communities of practice. Required qualifications, capabilities, and skills include formal training or certification on software engineering concepts and 5+ years of applied experience. Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform. Fluency in at least one programming language such as Python, Java Spring Boot, .Net, etc. Deep knowledge of software applications and technical processes with emerging depth in one or more technical disciplines. Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.). Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.). Experience with troubleshooting common networking technologies and issues. Ability to identify and solve problems related to complex data structures and algorithms. Drive to self-educate and evaluate new technology. Ability to teach new programming languages to team members.,
Posted 2 weeks ago
2.0 - 6.0 years
0 Lacs
karnataka
On-site
Propel operational success with your expertise in technology support and a commitment to continuous improvement. As a Technology Support II team member within JPMorgan Chase, you will play a vital role in ensuring the operational stability, availability, and performance of our production application flows. You will be responsible for troubleshooting, maintaining, identifying, escalating, and resolving production service interruptions for all internally and externally developed systems, thereby supporting a seamless user experience and fostering a culture of continuous improvement. You will analyze and troubleshoot production application flows to ensure end-to-end application or infrastructure service delivery supporting the business operations of the firm. Your contributions will be instrumental in improving operational stability and availability through participation in problem management. Additionally, you will monitor production environments for anomalies and address issues utilizing standard observability tools. Your role will involve assisting in the escalation and communication of issues and solutions to the business and technology stakeholders. Furthermore, you will play a key role in identifying trends and assisting in the management of incidents, problems, and changes in support of full stack technology systems, applications, or infrastructure. There may be instances where you will be required to provide on-call coverage during weekends. **Job Responsibilities:** - Analyze and troubleshoot production application flows to ensure end-to-end application or infrastructure service delivery supporting the business operations of the firm. - Improve operational stability and availability through participation in problem management. - Monitor production environments for anomalies and address issues utilizing standard observability tools. - Assist in the escalation and communication of issues and solutions to the business and technology stakeholders. - Identify trends and assist in the management of incidents, problems, and changes in support of full stack technology systems, applications, or infrastructure. - May require the role to provide on-call coverage during weekends. **Required qualifications, capabilities, and skills:** - Possess 2+ years of experience, ideally working with Data/Python applications in a production environment. - Experience in a programming or scripting language (Python). - Experience working with containers and container orchestration (Kubernetes). - Experience working with orchestration tools (Control-M). - Experience with cloud platforms (AWS), ideally provisioning infrastructure using Terraform. - Exposure to observability and monitoring tools and techniques. - Good communication and collaboration skills, with the ability to work effectively in a fast-paced, dynamic environment. **Preferred qualifications, capabilities, and skills:** - Significant advantage to have experience supporting applications on platforms such as Databricks, Snowflake, or AWS EMR (Databricks preferred). - Actively self-educates, evaluate new technology, and recommend suitable ones. - Knowledge of virtualization, cloud architecture, services, and automated deployments.,
Posted 2 weeks ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
As a Software Engineer- Operational Support Systems at Barclays, you will lead the evolution of the digital landscape by utilizing cutting-edge technology to enhance the digital offerings and ensure unmatched customer experiences. Your primary responsibility will be the design, construction, and maintenance of the OSS infrastructure and toolchains necessary to operate the Barclays Global Network across cloud and On-Prem environments. To excel in this role, you must demonstrate proficiency in front-end and back-end technologies, including Java (Java 17+), Spring Ecosystem, SQL, NoSQL, React.js, JavaScript, and state management expertise. Additionally, you should have expertise in virtualization, containers, Kubernetes, test-driven development, CI/CD pipelines, monitoring, observability, ELT pipelines, cloud/storage integrations, security practices, and performance optimization. Your role will involve collaborating with product managers, designers, and other engineers to define software requirements, devise solutions, and ensure alignment with business objectives. You will also participate in code reviews, promote a culture of code quality, and stay updated on industry technology trends to contribute to the organization's technology communities. As a Vice President, your responsibilities may include setting strategies, driving requirements, managing resources, policies, and budgets, delivering continuous improvements, and advising key stakeholders on functional areas of impact and alignment. You will also demonstrate leadership, accountability for risk management, and collaborate with other business areas to achieve organizational goals. All colleagues at Barclays are expected to uphold the Barclays Values of Respect, Integrity, Service, Excellence, and Stewardship, as well as the Barclays Mindset of Empower, Challenge, and Drive.,
Posted 2 weeks ago
1.0 - 5.0 years
0 Lacs
karnataka
On-site
At Goldman Sachs, our Engineers don't just make things - we make things possible. We change the world by connecting people and capital with ideas, solving the most challenging and pressing engineering problems for our clients. Join our engineering teams that build massively scalable software and systems, architect low latency infrastructure solutions, proactively guard against cyber threats, and leverage machine learning alongside financial engineering to continuously turn data into action. Create new businesses, transform finance, and explore a world of opportunity at the speed of markets. Engineering, which is comprised of our Technology Division and global strategists groups, is at the critical center of our business. Our dynamic environment requires innovative strategic thinking and immediate, real solutions. If you want to push the limit of digital possibilities, start here. Goldman Sachs Engineers are innovators and problem-solvers, building solutions in risk management, big data, mobile, and more. We look for creative collaborators who evolve, adapt to change, and thrive in a fast-paced global environment. Data plays a critical role in every facet of the Goldman Sachs business. The Data Engineering group is at the core of that offering, focusing on providing the platform, processes, and governance for enabling the availability of clean, organized, and impactful data to scale, streamline, and empower our core businesses. As a Site Reliability Engineer (SRE) on the Data Engineering team, you will be responsible for observability, cost, and capacity with operational accountability for some of Goldman Sachs's largest data platforms. We engage in the full lifecycle of platforms from design to demise with an adapted SRE strategy to the lifecycle. We are looking for individuals with a background as a developer who can express themselves in code. You should have a focus on Reliability, Observability, Capacity Management, DevOps, and SDLC (Software Development Lifecycle). As a self-leader comfortable with problem statements, you should structure them into data-driven deliverables. You will drive strategy with skin in the game, participate in the team's activities, drive Postmortems, and have an attitude that the problem stops with you. **How You Will Fulfil Your Potential** - Drive adoption of cloud technology for data processing and warehousing - Drive SRE strategy for some of GS's largest platforms including Lakehouse and Data Lake - Engage with data consumers and producers to match reliability and cost requirements - Drive strategy with data **Relevant Technologies**: Snowflake, AWS, Grafana, PromQL, Python, Java, Open Telemetry, Gitlab **Basic Qualifications** - A Bachelor's or Master's degree in a computational field (Computer Science, Applied Mathematics, Engineering, or in a related quantitative discipline) - 1-4+ years of relevant work experience in a team-focused environment - 1-2 years hands-on developer experience at some point in career - Understanding and experience of DevOps and SRE principles and automation, managing technical and operational risk - Experience with cloud infrastructure (AWS, Azure, or GCP) - Proven experience in driving strategy with data - Deep understanding of multi-dimensionality of data, data curation, and data quality - In-depth knowledge of relational and columnar SQL databases, including database design - Expertise in data warehousing concepts - Excellent communication skills - Independent thinker, willing to engage, challenge, or learn - Ability to stay commercially focused and to always push for quantifiable commercial impact - Strong work ethic, a sense of ownership and urgency - Strong analytical and problem-solving skills - Ability to build trusted partnerships with key contacts and users across business and engineering teams **Preferred Qualifications** - Understanding of Data Lake / Lakehouse technologies incl. Apache Iceberg - Experience with cloud databases (e.g., Snowflake, Big Query) - Understanding concepts of data modeling - Working knowledge of open-source tools such as AWS lambda, Prometheus - Experience coding in Java or Python,
Posted 2 weeks ago
4.0 - 8.0 years
0 Lacs
chennai, tamil nadu
On-site
The role available is within the Services - Client & Services Assurance group, which is comprised of various core streams, with direct oversight falling under the BCC - Orion stream. As an App Support Senior Analyst, you are expected to possess strong techno-functional skills. Your primary responsibilities include investigating alerts to pinpoint root causes through in-depth analysis of application flows, ensuring clear communication with stakeholders regarding system and business service availability, and managing Major Incidents to coordinate responses and escalations to maintain the flow of TTS Services to client-facing teams. Additionally, you will be tasked with identifying and eliminating operational inefficiencies from day-to-day support tasks, leading initiatives to enhance service efficiency and stability, and acting as a bridge between Business and Technology teams to expedite incident resolutions and provide a business-oriented perspective on remediation strategies. To excel in this role, you should hold a Technology academic degree with 4-6 years of relevant work experience. Proficiency in database management tools such as Oracle, Mongo DB, and Sybase, familiarity with application monitoring platforms like Grafana, ITRS Geneos, and AppDynamics, and a basic understanding of webserver technologies (WebLogic or WebSphere) are essential. Moreover, knowledge of Middleware technologies such as MQ, Kafka, and experience with ITIL tools like ServiceNow (ITIL Foundation Certification preferred) will be beneficial. Strong troubleshooting and problem-solving abilities coupled with effective verbal and written communication skills are crucial for success in this position. A foundational grasp of Observability, Site Reliability Engineering (SRE), and Open Telemetry Principles will also be advantageous. This is a full-time role within the Technology job family group, specifically categorized under Applications Support. If you require any accommodations due to a disability to utilize our search tools or apply for this position, please refer to the Accessibility at Citi guidelines. For further details on Citis EEO Policy Statement and your rights, please review the respective documents.,
Posted 2 weeks ago
10.0 - 14.0 years
0 Lacs
karnataka
On-site
The Walmart Enterprise Business Services (EBS) is comprised of exceptional teams delivering top-tier technology solutions and services that have a significant impact on various levels within Walmart. As a Principal Engineer, you will play a crucial role in shaping the engineering direction by leveraging your deep full stack expertise and extensive background in cloud-native development, specifically on Google Cloud Platform (GCP). Your responsibilities will include leading the design and development of high scalability and resilient end-to-end full stack applications, architecting complex cloud-native systems utilizing a wide array of GCP services, setting technical direction, defining best practices, and ensuring engineering excellence across teams. Additionally, you will be responsible for building and optimizing APIs and frontend frameworks, guiding the adoption of serverless and container-based architectures, championing CI/CD pipelines and Infrastructure as Code, driving code quality through design reviews and automated testing, collaborating with cross-functional teams, and staying updated on industry trends and evaluating new tools and frameworks for enhanced productivity and performance. You should possess a minimum of 10 years of experience in full stack development, with at least 2 years in a technical leadership or principal engineering role. Deep proficiency in JavaScript/TypeScript, Python, or Go is required, along with strong experience in modern frontend frameworks, particularly React. Expertise in designing and operating cloud-native systems on GCP, proficiency in microservices architecture, Docker, Kubernetes, and event-driven systems, extensive experience in CI/CD, DevOps practices, managing production-grade cloud systems, familiarity with both SQL and NoSQL databases, exceptional communication, leadership, and collaboration skills, GCP Professional Certification, experience with serverless platforms and observability tools, as well as exposure to large-scale data processing pipelines or ML workflows on GCP are also essential for this role. Joining Walmart Global Tech means being part of a team that makes a significant impact in the retail industry. Embrace the opportunity to work with a diverse group of professionals and experts, where your contributions can influence millions of people worldwide. As a part of Walmart Global Tech, you will have the chance to innovate, grow, and shape the future of retail while being supported by a culture that values every individual and fosters a sense of belonging for all associates, customers, and suppliers. At Walmart, we are committed to creating an inclusive workplace where everyone feels valued and respected. We believe that diversity and inclusion are key to our success and aim to provide equal opportunities for all associates, customers, and communities we serve. Join us in our mission to be a workplace where everyone is included, and everyone wins.,
Posted 2 weeks ago
2.0 - 5.0 years
15 - 20 Lacs
Pune
Hybrid
Monitor production systems & services using observability tools (logs, metrics, traces, dashboards, Respond to incidents Design, implement & maintain observability solutions (eg Prometheus, Grafana, ELK) Technical Operations & Continuous Improvement Required Candidate profile Must have* Exp in Azure services with AWS Hands on with (IaC) tools such as Terraform Scripting skills in Python/Bash/PowerShell Familiarity with Gitlab CI/CD tools Notice Period - 1 month or less
Posted 2 weeks ago
7.0 - 12.0 years
25 - 32 Lacs
Pune
Work from Office
Hi, Wishes from GSN!!! Pleasure connecting with you!!! We been into Corporate Search Services for Identifying & Bringing in Stellar Talented Professionals for our reputed IT / Non-IT clients in India. We have been successfully providing results to various potential needs of our clients for the last 20 years. Who are we looking for? Skilled IT Operations Consultant specializing in Monitoring and Observability to design, implement and optimize monitoring solutions for our customers. Strong background in monitoring, observability and IT service management is MUST . 1. WORK LOCATION : PUNE 2. Job Role: LEAD ENGINEER 3. EXPERIENCE : 7+ yrs 4. CTC Range: Rs. 25 LPA to Rs. 30 LPA 5. Work Type : WFO ****** Looking for SHORT JOINERS ****** Job Description : Required Skills : Strong understanding of infrastructure and platform development principles and experience with programming languages such as Python, Ansible for developing custom scripts . Strong knowledge of monitoring frameworks, logging systems (ELK stack, Fluentd), and tracing tools (Jaeger, Zipkin) along with the OpenSource solutions like Prometheus, Grafana. Extensive EXP with monitoring and observability solutions such as OpsRamp, Dynatrace, New Relic , must have worked with ITSM integration (e.g. integration with ServiceNow, BMC remedy etc.) Working EXP with RESTful APIs and understanding of API integration with the monitoring tools . Knowledge of ITIL processes and Service Management frameworks . Familiarity with security monitoring and compliance requirements. Familiarity with AIOps and Machine Learning techniques for anomaly detection and incident prediction. Excellent analytical and problem-solving skills, ability to debug and troubleshoot complex automation issues Roles & Responsibilities : Design end-to-end monitoring and observability solutions to provide comprehensive visibility into infrastructure, applications and networks. Implement monitoring tools and frameworks (e.g., Prometheus, Grafana, OpsRamp, Dynatrace, New Relic) to track key performance indicators and system health metrics. Integration of monitoring and observability solutions with IT Service Management Tools. Develop and deploy dashboards and reports to proactively identify and address system performance issues. Architect scalable observability solutions to support hybrid and multi-cloud environments. Collaborate with infrastructure, development and DevOps teams to ensure seamless integration of monitoring systems into CI/CD pipelines. Continuously optimize monitoring configurations and thresholds to minimize noise and improve incident detection accuracy. Utilize AIOps and machine learning capabilities for intelligent incident management and predictive analytics. Work closely with business stakeholders to define monitoring requirements and success metrics. Document monitoring architectures, configurations and operational procedures. ****** Looking for SHORT JOINERS ****** If interested, dont hesitate to click APPLY for IMMEDIATE response. Best Wishes, GSN HR | Google review : https://g.co/kgs/UAsF9W
Posted 3 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough