Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
12.0 - 14.0 years
10 - 16 Lacs
Hyderabad, Pune
Hybrid
We are hiring Senior DevOps Engineers for a 6-month contractual role with a leading MNC. The selected candidate will be working on the MNC's project but will be on third-party payroll (our company). If you're experienced in AWS, Kubernetes (EKS), Terraform, and CI/CD, and looking to work on enterprise-scale infrastructure, this is a great opportunity to contribute to high-impact projects. Job Type: Contractual (6 Months) Extendable based on performance Payroll Type:Third-Party Payroll You will be on our companys payroll and deployed at the client site (MNC) Roles and Responsibilities : Manage cloud infrastructure and Kubernetes (EKS) clusters on AWS. Perform EKS upgrades and handle pod-level troubleshooting. Build and maintain infrastructure automation using Terraform. Maintain CI/CD pipelines and automate DevOps workflows. Work with tools like Ansible, Chef, or Puppet for configuration management. Monitor systems using tools like ELK, Splunk, and application monitoring stacks. Collaborate with teams for incident response and system optimization. Desired Candidate Profile: 12+ years of DevOps experience in cloud-native environments. Hands-on experience with AWS, Kubernetes, and EKS management. Proficiency in Terraform and Infrastructure as Code (IaC). Experience with Linux environments and scripting (Shell/Python). Good knowledge of monitoring and logging tools. AWS and Kubernetes certifications preferred. Strong analytical and troubleshooting skills.
Posted 1 month ago
6.0 - 11.0 years
9 - 19 Lacs
Chennai
Remote
Cluster Management and Maintenance Design and Implement Elasticsearch Clusters: Create and configure Elasticsearch clusters to meet specific performance and availability requirements. Monitor Cluster Health: Use monitoring tools to track cluster health, performance metrics, and error logs, ensuring optimal operation. Deep Understanding of Elasticsearch: Expertise in the architecture, configuration, and management of Elasticsearch clusters. Knowledge of JSON and REST APIs: Familiarity with JSON data format and RESTful API principles is crucial for interacting with Elasticsearch. Data Modeling and Indexing Techniques: Ability to design efficient data models and understand how to structure data for optimal indexing and retrieval. Familiarity with Elasticsearch Ecosystem: Knowledge of related tools like Kibana, Logstash, and Beats enhances the engineers ability to deliver complete solutions. Basic Programming Skills: Proficiency in programming languages such as Python, Java, or Go is beneficial for automation and customization tasks. Data Indexing and Query Optimization Develop Indexing Strategies: Determine how data should be indexed to maximize search efficiency and minimize latency. Optimize Queries: Analyze and refine search queries to improve response times and resource utilization. Data Integration and Transformation Integrate Data Sources: Connect Elasticsearch with various data sources, such as databases and logging frameworks, using tools like Logstash or Beats. Transform Data for Indexing: Preprocess and transform data to meet Elasticsearch indexing requirements. Collaboration and Support Work with Development Teams: Collaborate with software engineers to implement search features and improve user experiences. Provide Technical Support: Troubleshoot and resolve issues related to Elasticsearch performance, data integrity, and availability.
Posted 1 month ago
2.0 - 5.0 years
10 - 20 Lacs
Bengaluru
Work from Office
Experience : 2+ years Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Must have skills required: Bash, Dynatrace, ELK, Grafana, Prometheus, Terraform, AWS, Kubernetes, ???Linux, Python Job Overview We are looking for a Site Reliability Engineer (SRE) with 2.5 to 5 years of experience to join our team. The ideal candidate will be responsible for ensuring the availability, scalability, and reliability of our distributed systems, improving observability, automating infrastructure, and enhancing system performance. This role provides an opportunity to work on high-scale, mission-critical environments and contribute to building a resilient infrastructure. Key Responsibilities Improve observability by implementing and managing monitoring, logging, and alerting solutions using Prometheus, ELK stack, and Grafana. Work with APMs like Dynatrace, New Relic to monitor performance metrics, define SLIs, SLOs, and error budgets. Participate in incident management, including on-call rotation, and Root Cause Analysis (RCA). Automate infrastructure provisioning using Terraform and Infrastructure as Code (IaC) principles. Ensure system scalability, reliability, and performance in a distributed environment. Strengthen security by applying cybersecurity best practices, vulnerability assessments, and compliance policies. Collaborate with cross-functional teams to establish SRE best practices, improve release pipelines, and minimize deployment risks. Maintain and improve disaster recovery plans to enhance resilience. Manage and optimize workflows using Apache Airflow to ensure efficient scheduling and execution of data pipelines. Support Snowflake data operations, ensuring high availability, performance optimization, and security compliance. Qualifications & Certifications Education: Bachelor's degree in Computer Science, Engineering, or related fields. Experience: 2.5 to 5 years of experience in Site Reliability Engineering, Observability, or Performance Monitoring. Hands-on experience in: Monitoring and observability using Prometheus, ELK, Grafana. Application Performance Monitoring (APM) tools like Dynatrace, New Relic, or Datadog. Incident response and on-call rotation management. Infrastructure automation using Terraform. Distributed systems operations and scaling. Load testing and performance analysis using tools like JMeter, k6, or Locust. Security at scale, including vulnerability scanning and compliance automation. Workflow automation and orchestration using Apache Airflow. Experience with Snowflake, including query optimization, data management, and security controls. Technical Skills: Strong knowledge of cloud platforms (AWS preferred). Experience with troubleshooting distributed systems and high-traffic environments. Hands-on knowledge of Linux, networking, and security fundamentals. Familiarity with container orchestration (Kubernetes, Docker). Ability to write automation scripts using Python, Bash, or Go. Preferred Certifications: AWS Certified DevOps Engineer Professional (or equivalent AWS certification). HashiCorp Certified: Terraform Associate. Certified Kubernetes Administrator (CKA). Google SRE Professional Certificate (preferred but not mandatory). Skills Bash, Dynatrace, ELK, Grafana, Prometheus, Terraform, AWS, Kubernetes, ???Linux, Python
Posted 1 month ago
6.0 - 10.0 years
15 - 25 Lacs
Noida, Gurugram, Delhi
Work from Office
Mandatory Skills (Docker and Kubernetes) Should have good understanding of various components of Kubernetes cluster Should have hands on experience of provisioning of Kubernetes cluster Should have expertise on managing and upgradation Kubernetes Cluster / Redhat Openshift platform Should have good experience of Container storage Should have good experience on CICD workflow (Preferable Azure DevOps, Ansible and Jenkin) Should have hands on experience of linux operating system administration Should have understanding of Cloud Infrastructure preferably Vmware Cloud Should have good understanding of application life cycle management on container platform Should have basis understanding of cloud networks and container networks Should have good understanding of Helm and Helm Charts Should be good in performance optimization of container platform Should have good understanding of container monitoring tools like Prometheus, Grafana and ELK Should be able to handle Severity#1 and Severity#2 incidents Good communication skills Should have capability to provide the support Should have analytical and problem-solving capabilities, ability to work with teams Should have experience on 24*7 operation support framework) Should have knowledge of ITIL Process Preferred Skills/Knowledge Container Platforms - Docker, CRI/O, Kubernetes and OpenShift Automation Platforms - Shell Scripts, Ansible, Jenkin Cloud Platforms - GCP/AZURE/OpenStack Operating System - Linux/CentOS/Ubuntu Container Storage and Backup
Posted 1 month ago
2.0 - 6.0 years
4 - 8 Lacs
Bengaluru
Work from Office
React/Redux, HTML5, CSS3, JavaScript, Python, Django and REST APIs. BS or MS in Computer Science or related field. Strong foundation in Computer Science, with deep knowledge of data structures, algorithms, and software design. Experience with GIT, CI/CD tools, Sentry, Atlassian software and AWS CodeDeploy a plus Contribute with ideas to overall product strategy and roadmap. Improve codebase with continuous refactoring. Self-starter to take ownership of the platform engineering and application development. Work on multiple projects simultaneously and get things done. Take products from prototype to production. Collaborate with team in Sunnyvale, CA to lead 24x7 product development. Bonus: If you have worked on one or more below then highlight those projects when applying: Experience with Time Series DB - M3DB, Prometheus, InfluxDB, OpenTSDB, ELK Stack Experience with visualization tools like Tableau, KeplerGL etc. Experience with MQTT or other IoT communication protocols a plus
Posted 1 month ago
4.0 - 5.0 years
10 - 20 Lacs
Bengaluru
Work from Office
We are seeking a highly skilled and experienced DevOps Engineer to join our dynamic team for a 6-month contract. The ideal candidate will focus on infrastructure enhancement, containerization, and collaborative deployment while ensuring system health and mentoring team members. Responsibilities include leading the development and troubleshooting of infrastructure, driving containerization with Kubernetes, EKS, and GKE, monitoring system health, and deploying infrastructure on private cloud platforms. Expertise in Linux systems, CI/CD pipelines, and scripting/programming is essential, along with proficiency in tools like Terraform, Ansible, and Splunk/ELK. A passion for continuous learning, excellent communication skills, and the ability to work in a collaborative environment are critical.
Posted 1 month ago
7.0 - 12.0 years
40 - 45 Lacs
Noida
Hybrid
Expected Notice Period: 30 Days Shift: (GMT+05:30) Asia/Kolkata (IST) Opportunity Type: Hybrid (Noida) What do you need for this opportunity? Must have skills required: GCP, AWS, Docker, Jenkins, Apache, ELK, Jira, PHP, Java, Kafka, Micro services, MySQL Looking for: Responsibilities : Be able to conceptualize and develop prototype quickly Research, design and build highly reliable, available and scalable platforms. Build reusable components as libraries, utilities and services and promote reuse. Work closely with our engineering managers, product managers, strategists and team members to develop Agri-Tech products. Complete ownership of Service/Services that your team is responsible for Designing, developing, and maintaining new and existing code coding standards, best practices and frameworks. Lead by example, mentor andguide team members on everything from structured problem solving to development of best practices Implement continuous deployment to ship code every day, once a day. Attend daily stand-ups and any other meetings schedules Contribute to or lead group discussions and coach junior team members Own large technical deliverables and execute in an exemplary way. Manage tasks using JIRA and communicate status to tech leads and managers. Create and groom Tech specific backlog. Drive technical roadmap of the team in collaboration with Engineering and Product Support production releases and investigate issues, if needed Evangelize emerging technologies/applications or and find the opportunities to integrate them into operations. Coach others on the new technologies Requirements: Substantial experience in building complex and scalable solutions. Experience leading multi-engineer projects and mentoring junior engineers. 7+ years of programming experience with Java including object-oriented design. Strong object oriented design skills, ability to apply design patterns, and an uncanny ability to design intuitive module and class-level interfaces Comprehensive operational experience including, optimisations, deployments and tuning servers like apache/mysql/tomcat/solr Strong in coding, data structures, algorithms and problem solving. Experience designing for performance, scalability, availability and security. Strong desire to build, sense of ownership, urgency, and drive. Expertise in delivering high-quality and innovative applications. Experience in communicating with users, other technical teams, and senior management to collect requirements, describe software product features, product strategy and influence outcomes in technical decision-making. Excellent written communication and verbal agility are strong assets. Quickly adapt to new development environments and changing business requirements. Demonstrated ability to mentor other software developers in all aspects of their engineering skill sets. Track record of building and delivering mission critical, 24x7 production software systems. Performance optimisation knowledge must to have Should have the ability to do the code review of the team. Strong and deep professional experience designing and implementing web applications, especially developing and consuming microservices Experience in using git to manage code bases, branching, merging, etc. Experience in microservices architecture Experience in performance tuning on MySQL, PostgreSQL and MongoDB Skills/Knowledge: Strong collaboration skills Deep expertise with any or a combination of programming languages: Java & PHP, or any object-oriented high-level open source language with strong programming constructs. Outstanding attention to detail and adherence to deadlines; Ability to work effectively, both independently and as a member of a team; Distributed Systems Architecture, components modeling, data flow, Scaling and managing large pieces of data. Articulating system requirements, problem comprehension and identifying high level building blocks Ability to handle multiple tasks in a fast-paced environment; Ability to "think outside the box" while identifying problems and developing creative solutions Should have worked in microservices architecture Experience with release building and deployment software, such as Jenkins, preferred but not required Experience with Docker and Cloud Infra like GCP, AWS etc. Expertise with log analyzing tools like splunk or ELK stack etc... Should have knowledge of Queueing Implementation like Kafka, RabbitMQ or SQS Should have experience in one of cloud environment like AWS or GCP Should be able to write modular and functionally complete object oriented code, NFR implementation, abstractions, separation of concerns, concurrency & thread safety, extensibility, hooks etc.
Posted 1 month ago
0.0 - 3.0 years
3 - 6 Lacs
Bengaluru
Work from Office
Position Summary: We are looking for a highly motivated and enthusiastic Junior DevOps / SRE Engineer to join our growing technology team. As a fresher or early-career professional, you will work alongside experienced engineers to support, automate, and optimize our cloud infrastructure, CI/CD pipelines, and observability systems. Key Responsibilities: Assist in designing, developing, and maintaining CI/CD pipelines using tools like GitHub Actions, Jenkins, GitLab CI, etc. Work with senior team members to deploy, monitor, and troubleshoot cloud-native applications. Support infrastructure provisioning and configuration using tools like Terraform, Ansible, or Helm. Help maintain uptime, reliability, and performance of systems using observability tools (Prometheus, Grafana, ELK, etc.). Write automation scripts and participate in routine maintenance tasks. Support containerization and orchestration using Docker and Kubernetes. Participate in on-call rotations under supervision (after adequate training). Basic Qualifications: Bachelor’s degree in Computer Science, IT, or related technical field (or final year students who are eligible to join full-time). Internship, project, or academic exposure to any of the following DevOps/SRE tools: CI/CD: GitHub Actions, GitLab CI, Jenkins, etc. Cloud Platforms: AWS, GCP, or Azure (basic knowledge or certification is a plus). Containers & Orchestration: Docker, Kubernetes. Infrastructure as Code: Terraform, Ansible, etc. Monitoring & Logging: Prometheus, Grafana, ELK/EFK Stack, etc. Strong understanding of Linux/Unix systems and networking basics. Familiarity with scripting languages such as Bash, Python, or Go Passion for automation, problem-solving, and learning modern DevOps practices. Preferred Qualifications Open-source contributions or personal GitHub projects related to DevOps. Certification in cloud (AWS/GCP/Azure Foundations or DevOps). Familiarity with Agile and DevOps workflows.
Posted 1 month ago
5.0 - 9.0 years
5 - 9 Lacs
Hyderabad
Work from Office
SUMMARY : At Surgical Information Systems (SIS), the DevOps Engineer will manage infrastructure projects and processes. A keen attention to detail, problem-solving abilities, and solid knowledge base are essential. The DevOps Engineer will need to have a high aptitude to learn new technologies and processes and deliver against the overall strategy across a wide variety of development environments including public, private, infrastructure as a service and platform as a service cloud operations. You will design mission critical services with a focus on security, resiliency, scale and performance. You need to have a solid understanding of automation and orchestration principles and be eager to automate wherever and whenever possible. ESSENTIAL DUTIES/ RESPONSIBILITIES: Work with the DevOps team to design and implement build, test, deployment, and configuration management workflows and pipelines Work with Security team to implement DevSecOps where needed Build and test automation tools for infrastructure provisioning Handle code deployments in all environments Monitor metrics and develop ways to improve insights into pipeline, software and environment performance Test implementation designs and consult with peers for feedback during testing stages Build, maintain, and monitor configuration standards Contribute to day-to-day management and administration of projects Creating, Customizing and Managing CI\CD pipelines Document and design various processes; update existing processes Improve infrastructure development and application development Assist in troubleshooting and root cause failure analysis for product enhancement Follow all best practices and procedures as established by SIS EDUCATION DESIRED: B.E/B.Tech/MCA/Any graduate SPECIFIC KNOWLEDGE & SKILLS REQUIRED: 3+ years experience in development and operations, or related IT, computer, or operations fields Previous experience with software development, infrastructure development, or development and operations Experience with Windows infrastructures, databases (MS SQL), CI/CD tools, scripting: Experience with automation tools (Ansible, Puppet, Chef, Python, Jenkins, Terraform, Azure DevOps Pipelines) Monitoring tools (Spluk, ELK, Nagios) At least 2 years' experience with Powershell script writing is required Containerization Technologies (Docker, Kubernetes, Rancher) Public cloud experience, preferably Azure Good interpersonal skills and communication with all levels of management Able to multitask, prioritize, and manage time efficiently High level technical aptitude and the ability to problem solve in a logical manner. Ability to work effectively in a team environment. SUPERVISORY RESPONSIBILITIES: None. PHYSICAL REQUIREMENTS: Requires ability to use a telephone Requires ability to use a computer Most of work will be spent in a seated, climate-controlled office
Posted 1 month ago
10.0 - 15.0 years
20 - 35 Lacs
Gurugram
Work from Office
Key Responsibilities Design and implement scalable, secure, and highly available infrastructure solutions. Architect and maintain CI/CD pipelines for efficient, reliable software delivery. Drive adoption of DevOps tools, practices, and automation across engineering teams. Lead cloud infrastructure strategy (AWS/GCP/Azure), cost optimization, and security controls. Implement Infrastructure as Code (IaC) using tools like Terraform, Cloud Formation, or Pulumi. Ensure monitoring, alerting, and observability best practices (Prometheus, ELK, Datadog, etc.). Guide container orchestration using Docker, Kubernetes, EKS, or AKS. Collaborate with development, QA, and security teams to ensure high-quality delivery. Mentor and support a team of DevOps engineers and promote a DevSecOps culture. Participate in architecture discussions and contribute to system design decisions. Required Skills & Qualifications 10+ years of experience in DevOps, Site Reliability Engineering, or Infrastructure Engineering. Expertise in at least one cloud platform (AWS, Azure, or GCP). Strong hands-on experience with CI/CD tools (Jenkins, GitLab CI/CD, CircleCI, etc.). Proficiency with Docker, Kubernetes, Helm. Infrastructure as Code (Terraform, Cloud Formation, Ansible). Solid scripting skills in Bash, Python, or Go. Deep understanding of networking, security, and cloud architecture patterns. Strong knowledge of monitoring/logging tools like Prometheus, Grafana, ELK, or Splunk. Experience with version control systems (Git, GitHub, Bitbucket). Excellent problem-solving and communication skills.
Posted 1 month ago
3.0 - 6.0 years
10 - 20 Lacs
Gurugram
Work from Office
The Site Reliability Team is responsible for monitoring all aspects of MakeMyTrip including production servers and services. You will be acting as first line of defense against any kind of service unavailability or performance of our production services 24 x 7 x 365. You will be frequently interacting with various groups within organization like Engineering, Sales & Products and hence need to develop a good all-round understanding of components, systems and networks is must. Diligence and attention to detail are also key skills along with an ability to multi-task and prioritize work appropriately. We don't expect you to have all the required knowledge when you join us, as many of these skills can be picked up through experience in the job, however those who want to gain new skills and grow must be prepared to spend time in doing suitable research and learning. You must be eager and quick learner with decent communication skills and must be able to use their initiative to tackle a broad range of problems. Prime Responsibilities: - Regularly examine multiple monitoring systems for unexpected deviations in any of application layers. - React to alerts with well-defined procedures, escalate problems to the appropriate people, follow up till resolution and finally incident reporting. - Setup/Monitor alerts on OPS tools and monitoring applications like Zabbix, Grafana, ELK stack. - Create shell/Python script-based reports & CRON scheduling to support periodic reports. - Adhere to defined process and be ready for some adhoc and surprise incidents - Help your coworkers by creating documentation and detailed knowledge sharing for continuous improvement. - Communications skills and clearness in reporting and communication. - Troubleshooting Live site production issue by co-relating different components. - Day-to-day maintenance of the application systems in operation, including tasks related to identifying and troubleshooting application issues and issues resolution or escalation. Desired Skills: - 3-6 years of relevant experience in 24x7 AWS Cloud based Linux production environment. - Ability to monitor diverse architecture, troubleshoot problems, analyze impact and escalation - Willing to work in precise schedules, night shifts & weekends to support our 24x7 systems on rotational basis. - Basic Linux command skills is must & experience in any scripting language (Shell/Python) is plus. - Basic Knowledge of Web/Internet concepts i.e. DNS, Common Protocols, Ports, Cookies, Firebug. - Hands on experience in L2 debugging like finding errors/exceptions in logs. - Basic Knowledge of SQL queries - Work well in a busy team, being quick to learn and able to deal with a wide range of issues - Prior experience in ELK, Zabbix or Grafana would be added advantage. - Knowledge of AWS Cloud environment is huge plus.
Posted 1 month ago
4.0 - 6.0 years
17 - 22 Lacs
Gurugram
Work from Office
Job Description Summary: We are looking for 4-6 years of experienced highly skilled, solution focused Senior DevOps engineer who is passionate about increasing system development and business agility. This may involve software configuration management and deployments, automation, Infrastructure as Code (IAC) along with experience on diverse platforms like Windows Desktop, Linux, web, mobile etc. and supporting various delivery teams. You will be part of the DevOps team responsible for installing, configuring, automating, and maintaining development and testing environment based on needs of Design & Development teams and the accredited Test Facilities. We put strong emphasis on individual ownership and value people who take pride in working over the full lifecycle of a project. Personal Skills: Positive attitude, Self-starter, self-motivated. Good communication skills to report program status crisply and accurately. Must have strong analytical and creative problem-solving skills. Ability to plan activities meticulously, identifying dependencies and proactively work towards resolution. Capable of taking responsibility for tasks and ensuring a successful outcome. Independently determines methods and procedures on new or special assignments. Demonstrates an extremely high level of accuracy and attention to detail. Ability to work independently as well as team oriented. Good blend of technical skills and business acumen. Comfortable in collaborating with team-mates working from around the globe. Roles and Responsibilities Strong experience in DevOps and CI/CD implementation. At least 3+ years of working experience with Docker, Kubernetes and Helm Charts with understanding of microservice design and architectural patterns. Proficient in Linux. At least 3+ years of working experience with infrastructure configuration management tool Ansible. Must have experience with Jenkins, management and extensions with other CI platforms and tools. Responsible for enabling teams through automation using Jenkins pipelines. At least 3+ years of working experience in deployment of PostgreSQL, Redis, ELK. Hands-on with Google Cloud Platform & AWS. Plan, Configure, Deploy and Operate a cloud solution. At least 3+ years of experience with automation of infrastructure and application deployment on GCP and AWS. Ensure Performance standards by configuring Auto-Scaling Solution to meet varying Load requirements. Configure access and security with experience on networking principles and protocols such as IP subnetting, routing, firewall rules, Virtual Private Cloud, Load Balancer, Cloud DNS, Cloud CDN, etc. Good understanding of cloud design considerations and limitations and its impact on Pricing. Deliver Proof of Concepts for new Solutions on Cloud. At least 3+ years of experience with Terraform & Packer. Prior experience of working with version control systems (GitHub). Interact with Development, Test and customer success teams to understand, develop and support product deployment strategy. Troubleshoot issues, isolating build/deployment issues due to code issues. Experience with deployment of .Net Core Applications.
Posted 1 month ago
3.0 - 8.0 years
3 - 7 Lacs
Noida
Hybrid
Job Title: DevOps Engineer (Kubernetes & Terraform) Location: Noida Experience: 3 to 8 years Type: Full-time About the Role: We are looking for a DevOps Engineer with 38 years of experience who specializes in Kubernetes and Terraform. This role is ideal for someone passionate about automation, infrastructure scalability, and cloud-native technologies. You will be responsible for designing and maintaining infrastructure platforms that support continuous delivery and scalability across our development and production environments. Key Responsibilities: Design, deploy, and manage scalable and secure Kubernetes clusters in production. Develop and manage Infrastructure as Code (IaC) using Terraform to provision cloud infrastructure. Build and maintain CI/CD pipelines to automate build, test, and deployment workflows. Ensure system availability, performance, and security across all environments. Work closely with development and QA teams to enable efficient DevOps practices. Automate system provisioning, configuration, and application deployments. Monitor infrastructure using tools like Prometheus, Grafana, ELK, or similar. Implement security best practices in container orchestration and infrastructure management. Must-Have Qualifications: 3-8 years of experience in DevOps, SRE, or infrastructure engineering roles. Hands-on experience with Kubernetes (deployment patterns, Helm, RBAC, ingress controllers, etc.). Proficiency in Terraform, including module creation and state management. Strong background in at least one public cloud provider (AWS, Azure, or GCP). Experience with CI/CD tools such as Jenkins, GitLab CI, GitHub Actions, or ArgoCD. Solid Linux administration skills. Experience with containerization using Docker. Scripting skills in Bash, Python, or Go. What You'll Get: Competitive compensation and benefits. Exposure to cutting-edge DevOps tools and practices. A collaborative, remote-friendly engineering culture. Opportunities for upskilling and certifications. Involvement in end-to-end infrastructure design and decisions.
Posted 1 month ago
15.0 - 20.0 years
45 - 60 Lacs
Mumbai
Work from Office
This position is for Site reliability Engineer within Client Engagement and Protection APS team. The primary purpose is to be accountable for all core engineering / transformation activities of ISPL Transversal CEP APS Responsibilities Direct Responsibilities Automate away toil using a combination of scripting, tooling, and process improvements Drive transformation strategies involving infrastructure hygiene / end of life Implementing new technologies or processes to improve efficiency and reduce costs eg:- CI/CD implementation Monitoring system performance and capacity levels to ensure high availability of applications with minimal downtime Investigating any service disruptions or other service issues to identify their causes Performing regular audits of computer systems to check for signs of degradation or malfunction Developing and implementing new methods of measuring service quality and customer satisfaction Conducting capacity planning to ensure that new technologies can be accommodated without impacting existing users Conducting post-mortem examinations of failed systems to identify and address root cause Drive various Automation, Monitoring & Tooling common purpose initiatives across CEP APS and other teams within CIB APS Accountable for generation, reporting and improvements of various Production KPIs, SLs and dashboards for APS teams Accountable for improvements in service and presentations for all governances and steering committees Accountable for maintenance and improvement of IT continuity plans (ICP) Contributing Responsibilities Technical & Behavioral Competencies Strong knowledge of DevOps methodology and toolsets Strong knowledge of Cloud based applications/services Strong knowledge of APM Tools i.e. Dynatrace / AppDynamics Strong Distributed Computing and Database technologies skillset Strong knowledge of Jenkin, Ansible, Python, Scripting etc. Good understanding of Log aggregators i.e. Splunk/ELK Good understanding of observability tools i.e. Grafana / Prometheus Ability to work with various APS, Development, Operations stakeholders, locally and globally Dynamic, proactive and teamwork oriented Independent, self-starter and fast learner Good communication and interpersonal skills Practical knowledge of change, incident & problem management tools Innovative and transformational mindset Flexible attitude Ability to perform under pressure Strong analytical skills Preferred to have ITIL Dockers/Kubernetes Prior knowledge on Site Reliability Engineering / Dev-Ops / Application Production Support / Development background Specific Qualifications (if required) Graduate in any discipline or Bachelor in Information Technology 15 of IT experience Skills Referential Behavioural Skills : (Please select up to 4 skills) Ability to collaborate / Teamwork Creativity & Innovation / Problem solving Ability to deliver / Results driven Communication skills - oral & written Transversal Skills: (Please select up to 5 skills) Ability to manage a project Ability to set up relevant performance indicators Ability to anticipate business / strategic evolution Ability to develop and adapt a process Analytical Ability Education Level: Bachelor Degree or equivalent Experience Level At least 15 years
Posted 1 month ago
3.0 - 8.0 years
15 - 30 Lacs
Bengaluru
Remote
Hiring for USA based big Multinational Company (MNC) The Cloud Engineer is responsible for designing, implementing, and managing cloud-based infrastructure and services. This role involves working with cloud platforms such as AWS, Microsoft Azure, or Google Cloud to ensure scalable, secure, and efficient cloud environments that meet the needs of the organization. Design, deploy, and manage cloud infrastructure in AWS, Azure, GCP, or hybrid environments. Automate cloud infrastructure provisioning and configuration using tools like Terraform, Ansible, or CloudFormation. Ensure cloud systems are secure, scalable, and reliable through best practices in architecture and monitoring. Work closely with development, operations, and security teams to support cloud-native applications and services. Monitor system performance and troubleshoot issues to ensure availability and reliability. Manage CI/CD pipelines and assist in DevOps practices to streamline software delivery. Implement and maintain disaster recovery and backup procedures. Optimize cloud costs and manage billing/reporting for cloud resources. Ensure compliance with data security standards and regulatory requirements. Stay current with new cloud technologies and make recommendations for continuous improvement. Bachelors degree in Computer Science, Information Technology, Engineering, or a related field. 3+ years of experience working with cloud platforms such as AWS, Azure, or Google Cloud. Proficiency in infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Experience with CI/CD tools (e.g., Jenkins, GitLab CI, Azure DevOps). Familiarity with containerization and orchestration (e.g., Docker, Kubernetes). Strong scripting skills (e.g., Python, Bash, PowerShell). Solid understanding of networking, security, and identity management in the cloud. Excellent problem-solving and communication skills. Ability to work independently and as part of a collaborative team.
Posted 1 month ago
4.0 - 8.0 years
14 - 19 Lacs
Hyderabad
Work from Office
WHAT YOU DO AT AMD CHANGES EVERYTHING. We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.. AMD together we advance_. SMTS SOFTWARE DEVELOPMENT ENGINEER. The Role. Ideal candidate should have 12+ years of experience in technical roles involving tool development with some focus in areas of regression management like scheduling, executing harness, failure analysis & assignment and reporting. Also should have hands-on experience in working with DevOps env tools including cloud, databases & AI/ML technologies.. The Person. The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.. Key Responsibilities. Drive and improve implementation of test infrastructure to provide robust test environment to developers & testers. Analyze, optimize & improve current architecture of various components of test Infra tools including integration. Implement containerized test infra with full on-prem/cloud portability Compatible with various job management systems like LSF, SLURM, and Kubernetes orchestration framework. Drive & Implement strategies to leverage AI/ML models at various stages of testinfra. Drive & Implement Next Gen QOR (Quality of results for FPGA designs) regression execution & reporting. Overseeing and providing development support to team members.. Collaborating with cross-functional teams, providing technical support, and troubleshooting migration to next generation tools, CI/CD, containerization, Kubernetes, and cloud tooling issues.. Creating comprehensive documentation, mentoring junior team members, and conducting training sessions. Preferred Experience. Professional 12+yrs of technical experience and at least 5 years' experience in design & implementation of product developments. Experience in developing tools around test Infra automation. Expert in structured & OOP in Python. Proficiency in Scripting and automation languages (e.g., Python, Bash, Csh,..). Linux & Windows shells working environment. Understanding of AI/ML principles and some experience in applying LLM & ML models in tool development. Experience with working in DevOps environment like GitHub, Perforce version control systems, containerization technologies like Docker, Kubernetes orchestration and CI/CD pipelines using Jenkins or Github actions. Additionally, experience with monitoring and logging tools for containerized environments (e.g., Prometheus, Grafana, ELK Stack). Excellent problem-solving abilities with a keen eye for detail are highly valued. Academic Credentials. Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. Benefits offered are described: AMD benefits at a glance.. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.. Show more Show less
Posted 1 month ago
1.0 - 3.0 years
5 - 9 Lacs
Pune
Work from Office
Join us as a Monitoring & Observability Engineer at Barclays, where you'll take part in the evolution of our digital landscape, driving innovation and excellence. You'll harness cutting-edge technology to revolutionise our digital offerings, ensuring unparalleled customer experiences. As a part of the team, you will deliver technology stack, using strong analytical and problem solving skills to understand the business requirements and deliver quality solutions. You'll be working on complex technical problems that will involve detailed analytical skills and analysis. This will be done in conjunction with fellow engineers, business analysts and business stakeholders.. To be successful as a Monitoring & Observability Engineer you should have experience with:. Proficiency in managing AppD, Grafana, ELK stack, AppD, ITRS,Netcool monitoring tools. Strong knowledge and experience in Windows and UNIX/LINUX/Windows platforms.. Proficiency in collecting, processing, and analyzing various telemetry signals (metrics, logs, traces, events).. Solid understanding of distributed systems, cloud platforms (AWS, Azure, GCP), containerization (Docker, Kubernetes), and microservices architectures.. Experience with scripting languages (Python, Bash, PowerShell) for automation and data manipulation.. Familiarity with database monitoring (SQL, NoSQL).. Knowledge of networking concepts and protocols.. Exceptional problem-solving abilities with a systematic approach to diagnosing complex technical issues.. Strong analytical skills to interpret data, identify patterns, and draw actionable conclusions.. Curiosity and a proactive mindset to anticipate and prevent issues.. Excellent written and verbal communication skills to explain technical concepts to both technical and non-technical stakeholders.. Ability to collaborate effectively with cross-functional teams.. Strong interpersonal skills for mentoring and knowledge sharing.. Strong Work experience in IT operations, monitoring, Site Reliability Engineering (SRE), or a dedicated observability role.. Hands-on experience with implementing and managing observability solutions in an enterprise environment.. Some Other Highly Valued Skills Include. Experience in Chef automation is preferred. Good understanding of Ansible playbook. Exposure to governance, control and risk management policies. Understanding of ITIL – Event/Incident processes. You may be assessed on key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen, strategic thinking and digital and technology, as well as job-specific technical skills.. This role is based in Pune.. Purpose of the role. To design, develop and improve software, utilising various engineering methodologies, that provides business, platform, and technology capabilities for our customers and colleagues.. Accountabilities. Development and delivery of high-quality software solutions by using industry aligned programming languages, frameworks, and tools. Ensuring that code is scalable, maintainable, and optimized for performance.. Cross-functional collaboration with product managers, designers, and other engineers to define software requirements, devise solution strategies, and ensure seamless integration and alignment with business objectives.. Collaboration with peers, participate in code reviews, and promote a culture of code quality and knowledge sharing.. Stay informed of industry technology trends and innovations and actively contribute to the organization’s technology communities to foster a culture of technical excellence and growth.. Adherence to secure coding practices to mitigate vulnerabilities, protect sensitive data, and ensure secure software solutions.. Implementation of effective unit testing practices to ensure proper code design, readability, and reliability.. Assistant Vice President Expectations. To advise and influence decision making, contribute to policy development and take responsibility for operational effectiveness. Collaborate closely with other functions/ business divisions.. Lead a team performing complex tasks, using well developed professional knowledge and skills to deliver on work that impacts the whole business function. Set objectives and coach employees in pursuit of those objectives, appraisal of performance relative to objectives and determination of reward outcomes. If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L – Listen and be authentic, E – Energise and inspire, A – Align across the enterprise, D – Develop others.. OR for an individual contributor, they will lead collaborative assignments and guide team members through structured assignments, identify the need for the inclusion of other areas of specialisation to complete assignments. They will identify new directions for assignments and/ or projects, identifying a combination of cross functional methodologies or practices to meet required outcomes.. Consult on complex issues; providing advice to People Leaders to support the resolution of escalated issues.. Identify ways to mitigate risk and developing new policies/procedures in support of the control and governance agenda.. Take ownership for managing risk and strengthening controls in relation to the work done.. Perform work that is closely related to that of other areas, which requires understanding of how areas coordinate and contribute to the achievement of the objectives of the organisation sub-function.. Collaborate with other areas of work, for business aligned support areas to keep up to speed with business activity and the business strategy.. Engage in complex analysis of data from multiple sources of information, internal and external sources such as procedures and practises (in other areas, teams, companies, etc).to solve problems creatively and effectively.. Communicate complex information. 'Complex' information could include sensitive information or information that is difficult to communicate because of its content or its audience.. Influence or convince stakeholders to achieve outcomes.. All colleagues will be expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence and Stewardship – our moral compass, helping us do what we believe is right. They will also be expected to demonstrate the Barclays Mindset – to Empower, Challenge and Drive – the operating manual for how we behave.. Show more Show less
Posted 1 month ago
3.0 - 7.0 years
11 - 16 Lacs
Noida
Work from Office
About Aeris:. The Internet of Things (IoT) will unlock trillions of dollars in value over the next 10 years as 50 billion devices are brought online. Aeris is at the forefront of this industry, building networks and applications to enable Fortune 500 clients like Chrysler, Honda and Bosch fundamentally improve their businesses. Headquartered in Silicon Valley with offices in Bucharest, Chicago, London, Delhi, Bangalore, Helsinki, and Tokyo as well as other markets. We rank among the top ten cellular providers for the IoT globally, powering critical projects across energy, transportation, retail, healthcare and more.. Built from the ground up for IoT and road-tested at scale, Aeris IoT Services are based on the broadest technology stack in the industry, spanning connectivity up to vertical solutions. As veterans of the industry, we know that implementing an IoT solution can be complex, and we pride ourselves on making it simpler. Our company is in an enviable spot. We’re profitable, and both our bottom line and our global reach are growing rapidly. We’re playing in an exploding market where technology evolves daily and new IoT solutions and platforms are being created at a fast-pace.. A few things to know about us:. We put our customers first. When making decisions, we always seek to do what is right for our customer first, our company second, our teams third, and individual selves last.. We do things differently. As a pioneer in a highly-competitive industry that is poised to reshape every sector of the global economy, we cannot fall back on old models. Rather, we must chart our own path and strive to out-innovate, out-learn, out-maneuver and out-pace the competition on the way.. We walk the walk on diversity. We’re a brilliant and eclectic mix of ethnicities, religions, industry experiences, sexual orientations, generations and more – and that’s by design. We see diverse perspectives as a core competitive advantage.. Integrity is essential. We believe in doing things well – and doing them right. Integrity is a core value here: you’ll see it embodied in our staff, our management approach and growing social impact work (we have a VP devoted to it). You’ll also see it embodied in the way we manage people and our HR issues: we expect employees and managers to deal with issues directly, immediately and with the utmost respect for each other and for the Company.. We are owners. Strong managers enable and empower their teams to figure out how to solve problems. You will be no exception, and will have the ownership, accountability and autonomy needed to be truly creative.. Aeris is looking for experienced and highly motivated CloudOps Engineer to join our team. The ideal candidate will have extensive knowledge and hands-on experience with Cloud Platform (GCP/AWS/Azure) and cloud operations. As a CloudOps Engineer, you will be responsible for managing, optimizing, and ensuring the reliability of our cloud infrastructure. You will work closely with development, operations, and DevOps teams to implement best practices and improve system performance and availability.. Key Responsibilities. Cloud Infrastructure Management. Access Control: Implementing Cloud IAM, service accounts, and roles with the principle of least privilege.. Resource Management: Effectively managing projects, quotas, and policies.. Cloud Costs Management: Setting budget alerts and conducting cost analysis and reporting.. Periodic Audit: Performing regular audits to ensure compliance and optimize performance.. Security and Compliance. Security Strategy: Implementing and monitoring a comprehensive security strategy.. Alert and Threat Management: Overseeing Cloud Security Tools and managing the Threat Detection Dashboard.. Risk Assessment: Conducting threat models and risk assessments for cloud environments.. Security Testing: Supporting security testing, including penetration testing and code analysis, and implementing corrective measures.. Impact Analysis: Evaluating business impacts from security threats and vulnerabilities, communicating risks, and collaborating on security initiatives.. Cloud Operations. Automation and Scripting: Developing and maintaining automation scripts using tools like Terraform, Ansible, and custom scripts for provisioning and managing Cloud resources; implementing Infrastructure as Code (IaC) to automate deployment and configuration processes.. Monitoring and Logging: Setting up and configuring tools such as Cloud Native Monitoring Tools, Prometheus, and Grafana for monitoring, logging, and alerting; creating dashboards and reports to analyze system performance and resource utilization.. Incident Management and Troubleshooting: Responding to and resolving cloud infrastructure incidents; conducting root cause analysis and implementing corrective measures to prevent future incidents.. Collaboration and Support: Working closely with development, operations, and DevOps/SRE teams to provide cloud support and guidance; participating in on-call rotations for 24/7 critical infrastructure support.. Documentation and Training: Creating and maintaining comprehensive documentation for cloud infrastructure, processes, and procedures; providing training to team members on Cloud best practices and service usage.. Additionally, it emphasizes understanding:. Networking Principles and Protocols: Including IP subnetting, routing, firewall rules, and various cloud services such as virtual private cloud, load balancers, cloud DNS, and cloud CDN.. Knowledge Expansion: Continuously updating knowledge on cloud PaaS components.. Proof of Concepts: Delivering demonstrations for new cloud solutions.. Hybrid Cloud: Managing integration across multiple platforms like GCP, on-premises systems, AWS, and Azure.. Required Skills and Qualifications. Education: Bachelor’s degree in Computer Science, Information Technology, or a related field.. Experience:. Minimum of 10+ years of experience in IT infrastructure, with at least 5+ years in a CloudOps role.. Proven experience with Cloud (GCP/AWS/Azure) in a production environment.. Strong background in cloud operations, infrastructure management, and automation.. Technical Skills:. Proficiency in GCP/AWS/Azure services such as Compute Engine, Cloud Storage, VPC, Cloud Functions, Cloud Pub/Sub, and BigQuery.. Experience with automation and configuration management tools such as Terraform, Ansible, and Jenkins.. Strong scripting skills in languages such as Python, Bash, or Go.. Experience with monitoring and logging tools such as Stackdriver, Prometheus, Grafana, and ELK stack.. Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).. Soft Skills:. Strong problem-solving and troubleshooting abilities.. Excellent communication and collaboration skills.. Ability to work independently and in a team-oriented environment.. Preferred Qualifications. Google Cloud Professional certifications (e.g., Google Cloud Professional Cloud Architect, Google Cloud Professional DevOps Engineer).. Experience with DevOps practices and CI/CD pipelines.. Knowledge of security best practices and compliance standards.. Familiarity with other cloud platforms such as AWS or Azure.. What is in it for you?. You get to build the next leading edge connected vehicle platform and internet of things platform. The ability to collaborate with our highly skilled groups who work with cutting edge technologies. High visibility as you support the systems that drive our public facing services. Career growth opportunities. Aeris walks the walk on diversity. We’re a brilliant mix of varying ethnicities, religions, cultures, sexual orientations, gender identities, ages and professional/personal/military experiences – and that’s by design. Diverse perspectives are essential to our culture, innovative process and competitive edge. Aeris is proud to be an equal opportunity employer.. Show more Show less
Posted 1 month ago
5.0 - 8.0 years
6 - 10 Lacs
Chennai
Work from Office
hackajob is collaborating with Comcast to connect them with exceptional tech professionals for this role.. Cloud Engineer 3. Location Chennai, India. Job Summary. Responsible for planning and designing new software and web applications. Analyzes, tests and assists with the integration of new applications. Documents all development activity. Assists with training non-technical personnel. Has in-depth experience, knowledge and skills in own discipline. Usually determines own work priorities. Acts as a resource for colleagues with less experience.. Job Description. Core Responsibilities. Job Description. Position: Cloud DevOps Engineer 3. Experience: 5 years to 7 years. Job Location: Chennai Tami Nadu. HR Contact: Ramesh_M2@comcast.com. Technical Skills:. Must Have: Python, Terraform, Docker and Kubernetes, CICD, AWS, Bash, Linux/Unix, Git, DBMS (e.g. MySQL), NoSQL (e.g. MongoDB). Good to have: Ansible, Helm, Prometheus, ELK stack, R, GCP/Azur. Key Responsibilities. Design, build, and maintain efficient, reusable, and reliable code. Work with analysis, operations, and test teams to achieve the best possible outcome within time and budget. Troubleshoot infrastructure issues. Attend cloud engineering meetings. Participate in code reviews and quality assurance activities. Participate in estimation discussions with the product team. Continuously improve knowledge and coding skills. Qualifications & Requirements. Bachelor’s degree in computer science, Engineering, or a related field.. experience in a scripting language (e.g. Bash, Python). 3+ years of hands-on experience with Docker and Kubernetes. 3+ years of hands-on experience with CI tools (e.g. Jenkins, GitLab CI, GitHub Actions, Concourse CI, ...). 2+ years of hands-on experience with CD tools (e.g. ArgoCD, Helm, kustomize). 2+ years of hands-on experience with LINUX/UNIX systems. 2+ years of hands-on experience with cloud providers (e.g. AWS, GCP, Azure). 2+ years of hands-on experience with one IAC framework (e.g. Terraform, Pulumi, Ansible). Basic knowledge of virtualization technologies (e.g. VMware) is a plus. Basic knowledge of one database (MySQL, SQL Server, Couchbase, MongoDB, Redis, ...) is a plus. Basic knowledge of GIT and one Git Provider (e.g. GitLab, GitHub). Basic knowledge of networking. Experience writing technical documentation.. Good Communication & Time Management Skills.. Able to work independently and as part of a team.. Analytical thinking & Problem-Solving Attitude.. Disclaimer. This information has been designed to indicate the general nature and level of work performed by employees in this role. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities and qualifications.. Comcast is proud to be an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law.. Base pay is one part of the Total Rewards that Comcast provides to compensate and recognize employees for their work. Most sales positions are eligible for a Commission under the terms of an applicable plan, while most non-sales positions are eligible for a Bonus. Additionally, Comcast provides best-in-class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That’s why we provide an array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality to help support you physically, financially and emotionally through the big milestones and in your everyday life.. Comcast brings together the best in media and technology. We drive innovation to create the world's best entertainment and online experiences. As a Fortune 50 leader, we set the pace in a variety of innovative and fascinating businesses and create career opportunities across a wide range of locations and disciplines. We are at the forefront of change and move at an amazing pace, thanks to our remarkable people, who bring cutting-edge products and services to life for millions of customers every day. If you share in our passion for teamwork, our vision to revolutionize industries and our goal to lead the future in media and technology, we want you to fast-forward your career at Comcast.. Education. Bachelor's Degree. While possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience.. Relevant Work Experience. 5-7 Years. Show more Show less
Posted 1 month ago
20.0 - 24.0 years
30 - 45 Lacs
Bengaluru
Work from Office
Responsibilities: Lead and manage the cloud engineering team to design, develop, and deploy scalable cloud solutions. Collaborate with various teams to understand business needs and translate them into technical requirements. Develop and implement cloud strategies that align with business goals and drive revenue growth. Provide technical leadership and mentorship to engineering teams, ensuring best practices in cloud architecture and deployment. Engage with clients to present cloud solutions, address technical concerns, and demonstrate the value of our offerings. Stay updated with the latest cloud technologies and trends to ensure our solutions remain competitive. Work closely with product management to define and prioritize features and enhancements. Oversee the integration of cloud solutions with existing systems and infrastructure. Ensure compliance with security standards and best practices in cloud deployments. Proven experience in pre-sales, solution, Estimation, and stakeholder engagement. Excellent communication and presentation skills, with the ability to convey complex technical concepts to non-technical stakeholders. Experience with DevOps practices and tools, including CI/CD pipelines, automation, and infrastructure as code. Strong problem-solving skills and the ability to think strategically about technology and business needs. Ability to work in a fast-paced, dynamic environment and manage multiple priorities. Demonstrated success in a leadership role, managing and coaching engineering teams. Roles and Responsibilities Qualifications: Bachelor's or masters degree in computer science, Engineering, or a related field. Proven experience in cloud engineering and architecture, with a strong understanding of AWS cloud platforms. AWS Certified Solutions Architect Technical Skills Required: Proficiency in AWS cloud platforms and their services. Experience with infrastructure as code tools (e.g., Terraform, CloudFormation). Knowledge of containerization technologies (e.g., Docker, Kubernetes). Familiarity with serverless architectures and services (e.g., AWS Lambda, Azure Functions). Understanding of microservices architecture and design patterns. Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Knowledge of networking concepts and technologies (e.g., VPC, VPN, DNS, Load Balancers). Proficiency in scripting and programming languages (e.g., Python, Java, Go). Experience with database technologies (e.g., SQL, NoSQL, PostgreSQL, MongoDB). Understanding of security best practices and compliance standards (e.g., IAM, encryption, GDPR).
Posted 1 month ago
5.0 - 10.0 years
15 - 30 Lacs
Gurugram, Delhi / NCR
Work from Office
Work Environment: This role involves rotational shifts on a weekly basis . Shift allowances will be provided as per company policy. Employees will also have the flexibility to work from home during night shifts to support convenience and continuity. Job Responsibilities: System Monitoring and Incident Management: Monitor the health and performance of critical systems, applications, and services. Respond to incidents, troubleshoot issues, and ensure timely resolution to minimize downtime and service disruptions. Automation and Scripting: Develop and maintain automation scripts and tools to streamline operational tasks, deployment processes, and infrastructure management. Infrastructure Management: Manage and scale the underlying infrastructure, including servers, cloud services, and network components. Implement best practices for configuration management, monitoring, and disaster recovery. Release Management: Collaborate with development teams to ensure smooth and reliable software releases. Participate in the design and implementation of deployment strategies. Performance Optimization: Identify performance bottlenecks and optimize the system to improve reliability and response times. Capacity Planning: Analyze system capacity and plan for future growth to meet increasing demands. Security and Compliance: Implement security best practices and ensure compliance with relevant industry standards and regulations. Collaboration and Documentation: Work closely with cross-functional teams, including developers, product managers, and operations, to ensure efficient communication and knowledge sharing. Document processes, procedures, and troubleshooting guides. On-Call Support: Participate in an on-call rotation to handle urgent issues and incidents outside regular business hours. Qualifications: Experience with Cloud Technologies: Proficiency in working with one or more cloud platforms like AWS, Google Cloud Platform, or Microsoft Azure. Programming and Scripting Skills: Strong knowledge of at least one programming language (e.g., Python, Java,) and experience with shell scripting. System Administration: Linux/Unix system hands on and good to have administration and networking concepts. Monitoring and Logging: Experience with monitoring tools such as Prometheus, Grafana, Nagios, and log management solutions like ELK stack. Infrastructure as Code (IaC): Knowledge of Infrastructure as Code tools like Terraform or CloudFormation. Automation and Configuration Management: Experience with tools like Ansible, Chef, or Puppet for automating infrastructure management. Version Control: Familiarity with version control systems like Git. Problem-Solving Skills: Ability to analyze and troubleshoot complex technical issues and can work with other teams to help and streamline Process. Communication Skills: Strong verbal and written communication skills to collaborate effectively with team members and stakeholders. KPI/Metrics: Understand Key SRE Metrics such as Availability, SLA/SLO, MTTA and MTTR Any hands on individual with BCA/MCA and B.Tech background.
Posted 1 month ago
5.0 - 7.0 years
25 - 40 Lacs
Pune
Work from Office
Our world is transforming, and PTC is leading the way.Our software brings the physical and digital worlds together, enabling companies to improve operations, create better products, and empower people in all aspects of their business. Our people make all the difference in our success. Today, we are a global team of nearly 7,000 and our main objective is to create opportunities for our team members to explore, learn, and grow – all while seeing their ideas come to life and celebrating the differences that make us who we are and the work we do possible. Job Details As a senior SRE / Observability Engineer, you will be part of the Atlas Platform Engineering team and will: Create and maintain observability standards and best practices Review the current observability platform, identify areas for improvement, and guide the team in enhancing monitoring, logging, tracing, and alerting capabilities. Expand the observability stack across multiple clouds, regions, and clusters, managing all observability data. Design and implement monitoring solutions for complex distributed systems to provide deep insights into systems and services aiming at complete visibility of digital operations Supporting the ongoing evaluation of new capabilities in the observability stack, conducting proof of concepts, pilots, and tests to validate their suitability. Assist teams in creating clear, informative, and actionable dashboards to improve system visibility. Automate monitoring and alerting processes, including enrichment strategies and ML-driven anomaly detection where applicable. Provide technical leadership to the observability team with clear priorities ensuring agreed outcomes are achieved in a timely manner. Work closely with R&D and product development teams (understand their requirements and challenges) to ensure seamless visibility into system and service performance. Work closely with the Traffic Management team to identify and standardise on existing and new observability tools as part of a holistic solution Conduct training sessions and create documentation for internal teams Support the definition of SLI (service level indicators) and SLO (service level objectives) for the Atlas services. Keep track of the error budget of each service Participate in the emergency response process Conduct RCAs (root cause analysis) Help to automate repetitive tasks and reduce toil. Qualifications: People and communication qualifications Be a strong team player Have good collaboration and communication skills Ability to translate technical concepts for non-technical audiences Problem-solving and analytical thinking Technical qualifications - general: Familiarity with cloud platforms (Ideally Azure) Familiarity with Kubernetes and Istio as the architecture on which the observability and Atlas services run, and how they integrate and scale. Experience with infrastructure as code and automation Knowledge of common programming languages and debugging techniques Have a strong technical background and be hands on. Linux and scripting languages (Bash, Python, Golang). Significant Understanding of DevOps principles. Technical qualifications - observability Strong understanding of observability principles (metrics, logs, traces) Experience with APM tools and distributed tracing Proficiency in log aggregation and analysis Knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, Prometheus, Grafana, Datadog, New Relic, Sumologic, ELK Stack, or others Knowledge of Open Telemetry, including OTEL collector and code instrumentation Experience designing and building unified observability platforms that enable the use of data (metrics, logs, and traces) to determine quickly if their application or service is operating as desired. Technical qualifications – SRE Understanding of the Google SRE principles Experience in defining SLIs and SLOs Experience in performing RCAs (root cause analysis) Experience in system performance Experience in incident response Knowledge of status tools, such as Atlassian Status Page or similar Knowledge of incident management and paging tools, such as PagerDuty or similar Knowledge of ITIL (Information Technology Infrastructure Library) processes Qualifications: People and communication qualifications • Be a strong team player • Have good collaboration and communication skills • Ability to translate technical concepts for non-technical audiences • Problem-solving and analytical thinking Technical qualifications - general: • Familiarity with cloud platforms (Ideally Azure) • Familiarity with Kubernetes and Istio as the architecture on which the observability platform runs, and how they integrate and scale. • Experience with infrastructure as code and automation • Knowledge of common programming languages and debugging techniques • Have a strong technical background and be hands on. • Linux and scripting languages (Bash, Python, Golang). • Significant Understanding of DevOps principles. Technical qualifications - observability • Strong understanding of observability principles (metrics, logs, traces) • Experience with APM tools and distributed tracing • Proficiency in log aggregation and analysis • Knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, Prometheus, Grafana, Datadog, New Relic, Sumologic, ELK Stack, or others • Knowledge of Open Telemetry, including OTEL collector and code instrumentation • Experience designing and building unified observability platforms that enable the use of data (metrics, logs, and traces) to determine quickly if their application or service is operating as desired. Life at PTC is about more than working with today’s most cutting-edge technologies to transform the physical world. It’s about showing up as you are and working alongside some of today’s most talented industry leaders to transform the world around you. If you share our passion for problem-solving through innovation, you’ll likely become just as passionate about the PTC experience as we are. Are you ready to explore your next career move with us? We respect the privacy rights of individuals and are committed to handling Personal Information responsibly and in accordance with all applicable privacy and data protection laws. Review our Privacy Policy here ."
Posted 1 month ago
4.0 - 9.0 years
7 - 17 Lacs
Bengaluru
Work from Office
About this role: Wells Fargo is seeking a Senior Software Engineer. In this role, you will: Lead moderately complex initiatives and deliverables within technical domain environments Contribute to large scale planning of strategies Design, code, test, debug, and document for projects and programs associated with technology domain, including upgrades and deployments Review moderately complex technical challenges that require an in-depth evaluation of technologies and procedures Resolve moderately complex issues and lead a team to meet existing client needs or potential new clients needs while leveraging solid understanding of the function, policies, procedures, or compliance requirements Collaborate and consult with peers, colleagues, and mid-level managers to resolve technical challenges and achieve goals Lead projects and act as an escalation point, provide guidance and direction to less experienced staff Required Qualifications: 4+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education Desired Qualifications: Deep understanding of the network stack, operating system stack and middleware to be utilized for troubleshooting production incidents Hands on experience with observability practices for MELT (Metrics, Events, Logs, and Traces. Preferred experience with Cisco suite of observability tools as well as OTEL Ability to define and create SLOs (Service Level Objectives) that visualize the customer experience as defined by a products User Journey Experience with deep inspection of log and Automate workflows using AIOps and observability-related technologies, products, and solutions. Familiarity with time series database internals Monitor and fine-tune Splunk search queries, dashboards, and indexing strategies to improve system performance and user experience. Experience with Observability solutions using OpenTelemetry, Prometheus, Grafana Collaborates with stakeholders to define business requirements and ensure alignment with IT strategies. Proficiency in Splunk, Dynatrace, and ELK Stack (Elastic Search, Logstash, Kibana). Strong expertise in Kubernetes and microservice architecture Hands-on experience with observability tools and frameworks. Familiarity with test automation for monitoring frameworks is a plus. Develop and implement end-to-end AIOps strategies and processes. Experience in creating user journey maps that capture the look and feel of a product or service. Create target-state AIOps technical design documents and implementation playbooks for . Build and maintain business insights dashboards, telemetry data collection, and streaming pipelines. Experienced troubleshooting Unix, Linux and cloud environments Strong communication with the ability to communicate on all levels of the organization Demonstrate knowledge/understanding of emerging technologies, industry trends, and outside perspectives, and communicate relevance to the organizations strategic and tactical goals Lead proof of concepts and prototyping Write and present white papers and present in industry conferences Lead execution of critical/complex project deliverables Experience with mentorship by training, documenting, certifying and building the teams skill set Deep understanding of business observability with proven expertise in designing and implementing observability strategies that align to business outcomes. Ability to dynamically engage, attend high impact production incidents, troubleshoot to resolution, and provide immediate incident analysis both written and spoken
Posted 1 month ago
5.0 - 9.0 years
13 - 18 Lacs
Bengaluru
Work from Office
Reference 25000A5L Responsibilities Role Description Responsible for the technical direction of the project and guides the development team, Job Responsibilities - Leading and mentoring a team of software developers, providing technical guidance, feedback and support, Collaborating with stakeholders to understand project requirements and define technical solutions, Implementing best practices, coding standards and quality assurance processes within the team, Participating in code reviews, ensuring adherence to coding standards and best practices, Troubleshooting, resolving technical issues and addressing challenges in project implementation, Serving as a technical expert and advocate for innovation and continuous improvement within the team and company, Collaborate with cross-functional teams, including product managers, designers and QA engineers, to deliver high-quality software products, Required Profile required Technical Skills: Full Stack Backend (Java + Spring Boot and other frameworks, CI/CD Jenkins Configuration as Code etc , Code Quality (Sonar, Jacoco, Checkmarx, TDD, BDD), Cloud Platform (Azure (AKS), Docker, Kubernetes, Helm etc), Must be a self-starter who is comfortable working in a very dynamic environment, with rapidly changing priorities, Rigorous, Dynamic, detail oriented, fast learning capacity and able to work in a high-pressure environment Understanding of development cycles: SDLC, Agile, Continuous delivery ELK Stack, Kafka (Connects and Streams) Why join us We are committed to creating a diverse environment and are proud to be an equal opportunity employer All qualified applicants receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status, Business insight At SocitGnrale, we are convinced that people are drivers of change, and that the world of tomorrow will be shaped by all their initiatives, from the smallest to the most ambitious Whether youre joining us for a period of months, years or your entire career, together we can have a positive impact on the future Creating, daring, innovating, and taking action are part of our DNA If you too want to be directly involved, grow in a stimulating and caring environment, feel useful on a daily basis and develop or strengthen your expertise, you will feel right at home with us! Still hesitating You should know that our employees can dedicate several days per year to solidarity actions during their working hours, including sponsoring people struggling with their orientation or professional integration, participating in the financial education of young apprentices, and sharing their skills with charities There are many ways to get involved, We are committed to support accelerating our Groups ESG strategy by implementing ESG principles in all our activities and policies They are translated in our business activity (ESG assessment, reporting, project management or IT activities), our work environment and in our responsible practices for environment protection, Diversity and Inclusion We are an equal opportunities employer and we are proud to make diversity a strength for our company Societe Generale is committed to recognizing and promoting all talents, regardless of their beliefs, age, disability, parental status, ethnic origin, nationality, gender identity, sexual orientation, membership of a political, religious, trade union or minority organisation, or any other characteristic that could be subject to discrimination,
Posted 1 month ago
5.0 - 10.0 years
6 - 10 Lacs
Bengaluru
Work from Office
As a Senior DevOps Engineer with a strong background in Azure, you will join the Data & AI Solutions - Engineering team in our Healthcare R&D business. Your expertise will enhance cloud-based platforms in our D&A Landscape using AWS Cloud Services and Azure AD, supporting our R&D efforts in drug discovery and development. You will bridge software development, quality assurance, and IT operations, ensuring our platform is reliable, scalable, and automated. Your expertise will contribute to accelerate deployment cycles and minimize downtime. Key responsibilities include deploying new releases, maintaining Azure AD for identity management, and collaborating with cross-functional teams to advocate for DevOps best practices and guide architectural decisions. Join a multicultural team working in agile methodologies with high autonomy. The role requires office presence at our Bangalore location. Who You Are: University degree in Computer Science, Engineering, or a related field 5+ years of experience applying DevOps in solution development & delivery Proficiency in Azure DevOps incl. project configurations, repositories, pipelines, and environments Proficiency in Azure AD incl. apps registration, and authentication flows (OBO, Client Credentials) Good understanding of AWS services and cloud system design Strong experience in observability practices, including logging, monitoring, and tracing using tools like Prometheus, Grafana, ELK Stack, or AWS-native solutions. Proficiency in Infrastructure as Code (IaC) using Terraform and AWS CloudFormation for automated and repeatable cloud infrastructure deployments. Experience with configuration management tools such as Ansible for automating service provisioning, configuration and management. Knowledge of security best practices in DevOps, including secrets management, role-based access control (RBAC), and compliance frameworks Strong scripting and automation skills using Python for developing of automations and integration workflows Willingness to work in a multinational environment and cross-functional teams distributed between US, Europe (mostly, Germany) and India Sense of accountability and ownership, fast learner Fluency in English & excellent communication skills for technical and non-technical stakeholders
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39815 Jobs | Dublin
Wipro
19317 Jobs | Bengaluru
Accenture in India
15105 Jobs | Dublin 2
EY
14860 Jobs | London
Uplers
11139 Jobs | Ahmedabad
Amazon
10431 Jobs | Seattle,WA
IBM
9214 Jobs | Armonk
Oracle
9174 Jobs | Redwood City
Accenture services Pvt Ltd
7676 Jobs |
Capgemini
7672 Jobs | Paris,France