Get alerts for new jobs matching your selected skills, preferred locations, and experience range.
3 - 8 years
8 - 18 Lacs
Pune, Mumbai, Delhi
Work from Office
Minimum 3 years of experience in MySQL Community and Enterprise Database • Familiarity with other SQL databases such as Maria DB, Percona etc. • Strong Knowledge in MySQL replication (binlog/GTID) • Strong Knowledge in bcakup/restore & Disaster recovery • Strong Knowledge in MySQL InnoDB cluster setup and Galera cluster & Group replication • Strong Experience in Migration / Upgradation projects • Knowledge in shell script to automate the DBA tasks • Knowledge in open source tools like - Percona toolkit,Proxysql,pmm, grafana prometheus,mysqldbcompare and maxscale • Optimizing MySQL Server performance by re-writing optimized Queries & DB performance tuning • Should possess work experience in handling change management, Problem management • Responsible for resolving all technical incidents escalated by the L-2 team • Ability to plan resource requirements from high level specifications • Work experience in cloud platform - Azure / AWS / GCP
Posted 2 months ago
1 - 5 years
8 - 18 Lacs
Navi Mumbai, Mumbai, Delhi
Work from Office
Below is the JD for Click House Database Helping build production-grade systems based on ClickHouse: advise how to design schemas, plan clusters etc. Environments range from single node setups to clusters with 100s of nodes, Cloud, managed ClickHouse service. Working on infrastructure projects related to ClickHouse Improving ClickHouse itself – fixing bugs, improving docs, creating test-cases, etc. Studying new usage patterns, ClickHouse functions, & integration with other products. Working with the community – GitHub, Stack Overflow, Telegram. Installation multiple node cluster , configure, backup and recovery and maintain ClickHouse database. Monitor and optimize database performance, ensuring high availability and responsiveness. Troubleshoot database issues, identify and resolve performance bottlenecks. Design and implement database backup and recovery strategies. Develop and implement database security policies and procedures. Collaborate with development teams to optimize database schema design and queries. Provide technical guidance and support to development and operations teams. Experience with big data stack components like Hadoop, Spark, Kafka, Nifi, Experience with data science/data analysis Knowledge of SRE / DevOP stacks – monitoring/system management tools (Prometheus, Ansible, ELK, ) Version control using git Handling support calls from customers using ClickHouse. This includes diagnosing problems connecting to ClickHouse, designing applications, deploying/upgrading ClickHouse, and operations
Posted 2 months ago
3 - 7 years
40 - 45 Lacs
Bengaluru
Work from Office
As a Cloud Site Reliability Engineer at our company, you will play a critical role in ensuring the robustness, performance, and security of our cloud-based systems. Your focus will be on maintaining and improving our cloud infrastructure with a special emphasis on cloud security and observability. You will work closely with development teams to architect, deploy, and optimize systems that are not only reliable but also resilient and secure. On a Normal Day, You Will Develop, manage, and optimize Terraform modules and deployments across multiple environments. Handle SRE operational duties including responding to pull requests and ensuring smooth continuous integration and delivery processes. Maintain and fine-tune applications for optimal performance, ensuring they meet specified requirements. Explore and experiment with new technologies through Proof-of-Concepts to enhance existing functionalities or discover new opportunities. Automate deployment, configuration, and operational processes to improve efficiency and accuracy. Collaborate with development teams to guide system architecture and design, focusing on reliability, efficiency, and scalability. Implement and manage observability tools such as Grafana, Prometheus, and New Relic to ensure all critical services are monitored effectively. Develop custom reliability tools and frameworks for use by engineering teams. Participate in an on-call rotation for critical systems, lead incident responses, and conduct thorough post-mortem analyses. Drive system and process efficiencies including capacity planning, configuration management, performance tuning, monitoring, and root cause analysis. Act as a consultant within the organization for best practices in infrastructure management and assist teams in effective infrastructure utilization. Play a key role in capacity planning to help teams prepare for scaling and growth. You Have In-depth knowledge of cloud service providers like Azure or AWS, with a professional or specialty level certification (security certification is a plus). Strong understanding of REST and/or Graph APIs. Background in DevSecOps or cloud security, with experience in cloud security posture management applications. Experience with state machines such as AWS Step Functions or Azure Logic Apps. Deep knowledge in telemetry and observability; experience with Prometheus, OpenTelemetry, or DynaTrace is highly desirable. Proficiency in Kubernetes with CKA/CKAD certification being advantageous. Expertise in Terraform, with experience in setting up pipelines for multi-environment deployments. Good programming skills in high-level languages, with a preference for Python. Go, or any other compiled languages is an advantage. Familiarity with Observability tools like Grafana, Prometheus, and New Relic. Strong project management and organizational skills. An open mindset with the ability to quickly adapt to new technologies and learning practices.
Posted 2 months ago
6 - 10 years
8 - 12 Lacs
Bengaluru
Work from Office
About The Role Strong experience inZabbixadmin Strong experience inLinux Good knowledge onPerformance monitoringtool Knowledge ofSplunk/Nagiosarchitecture components such as Management Server, Agents, cross platform, management packconfiguration Should be able to create custom Rules and Monitors Experience incloud/ troubleshooting in monitoring Microsoft infrastructure tool experience Primary Skills Experience with additional monitoring tools (Nagios, Prometheus,Grafana) Hands-onwithDevOps tools Git(Azure DevOps,GitHub,GitLab) CI/CD (Jenkins,GitHub Actions,GitLab CI/CD) Secondary Skills Cloud platformsexperience (AWS, Azure) Exposure to Cribl for log processing and analytics
Posted 2 months ago
14 - 21 years
37 - 55 Lacs
Chennai, Pune, Bengaluru
Hybrid
Technical Skills: • 20+ years of Overall IT experience . Technical Leader with hands on experience in the space of Software Development and / or SRE. Should have technically worked on at least couple of end-to-end SRE assignments with an automation mindset and proven capabilities. • Hands on experience on defining/implementing SRE metrics like SLI, SLO, SLA, EB, MTTX and applying at Enterprise level to optimize platform/product resiliency and availability. Real time experience on MLOps, AIOps, Chao Practices are an added advantage. • Continuously refine maturity to incorporate industry best practices, standards and guidelines related to client ecosystem (ways of working, tools and technologies, and platforms). • Should have some development (Java, .Net or Python) background with strong code handling capabilities & automation using Python Or at least should have supporting experience (L3) of applications and scaled infrastructure with any cloud technology. • Hands-On Experience on Observability Configuration: Prometheus/Splunk, Grafana, Data Dog, Alert Manager /PagerDuty. ELK stack, API Gateway Platform/Kong, WebLogic /Tomcat /JBoss. Should be able to build & configure customized metric-exporters/dashboards. • Hands on Experience on SRE Transformation, Toil automation or SRE implementation in large Banks, Financial Orgs with legacy infrastructure is an advantage. • Hands on with any Cloud Technology (AWS/Azure/GCP) or maintaining and managing Physical servers (Linux Job Description: Engineering Leader | Distinguished Architect SRE mostly)/clusters and/or should have worked on large scale Data Centers Operation with >20K TPS level • Good understanding of Digital Engineering with DevOps and Cloud Technical space of work • Excellent verbal and written communication, and ability to present technical concepts to both technical and nontechnical groups. Bring in the alignment on Strategic goals of the Program
Posted 2 months ago
15 - 20 years
35 - 40 Lacs
Pune
Work from Office
Job Overview: 15+ years of experience in designing, implementing, and managing a comprehensive suite of tools to support large-scale software development projects for government agencies. This role requires deep expertise in development, deployment, monitoring, and management tools, with a focus on ensuring high efficiency, security, and compliance with government standards. Responsibilities: v Toolchain Design and Integration: Design and implement a robust toolchain for the entire software development lifecycle (SDLC), including development, testing, deployment, and monitoring. Integrate tools seamlessly into existing infrastructure to support continuous integration and continuous delivery (CI/CD). v Tool Selection and Evaluation: Identify and evaluate tools that meet project requirements for source control, build automation, code quality, security, and performance. Perform proof of concept (PoC) evaluations to validate tool effectiveness and compatibility. v Automation and Optimization: Automate development and operational workflows to enhance productivity and reduce manual effort. Optimize tool configurations and workflows for performance, scalability, and security. v Security and Compliance: Ensure that all tools and processes comply with relevant government security standards and regulations (e.g., NIST, ISO, GDPR). Implement security best practices for tool usage, data protection, and access control. v Performance Monitoring and Reporting: Develop and deploy tools for real-time monitoring of system performance, application health, and security. Generate reports and dashboards to provide insights into project progress, system health, and compliance status. v Collaboration and Support: Work closely with development, operations, and security teams to ensure tools meet their needs and are effectively integrated. Provide technical guidance and support for tool-related issues and training for team members. v Documentation and Knowledge Sharing: Create and maintain comprehensive documentation for tool usage, configurations, and best practices. Promote knowledge sharing and best practices across the team and organization. v Vendor Management and Licensing: Evaluate and manage relationships with tool vendors, ensuring tools meet technical and compliance requirements. Oversee tool procurement, licensing, and maintenance to ensure cost-effective and compliant usage. Qualifications: Proficiency in CI/CD tools such as Jenkins, GitLab CI, or CircleCI. Experience with configuration management tools like Ansible, Chef, or Puppet. Strong knowledge of monitoring tools like Prometheus, Grafana, or Splunk. Familiarity with code quality and security tools such as SonarQube, Fortify, or Black Duck. Expertise in containerization and orchestration tools like Docker and Kubernetes. Knowledge of cloud-based tools and services on AWS, Azure, or GCP. Key Technologies CI/CD: Jenkins, GitLab CI, CircleCI Version Control: Git, Bitbucket Configuration Management: Ansible, Chef, Puppet Monitoring: Prometheus, Grafana, Nagios, Splunk Code Quality: SonarQube, CodeClimate, Fortify Security: Black Duck, Checkmarx, OWASP ZAP Containerization: Docker, Kubernetes, OpenShift Cloud: AWS, Azure, Google Cloud Platform
Posted 3 months ago
4 - 8 years
7 - 17 Lacs
Bengaluru
Work from Office
About this role: Wells Fargo is seeking a Senior Systems Operations Engineer In this role, you will: Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area Contribute in increasing system efficiencies and lowering the human intervention time on related tasks Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability Work with vendors and other technical personnel for problem resolution Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability Required Qualifications: 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education Desired Qualifications: Advance understanding of application monitoring stack (Logs, Metrics, Events, Traces, Alerts) and ability to visualize and setup end to end observability (Infra and App components) Strong experience in using industry standard monitoring tools (AppD, Splunk, ELK, APM, Grafana, Prometheus, Etc) Experience in deploying the application to cloud platforms Experience in using CI/CD tools like Jenkins, Gradle, Groovy and Maven Experience in CM tools like Ansible and Puppet Proficient in one of the programming Languages (Java and Python) Knowledge of Web services Experience in working Agile methodology Proficient in multiple infrastructure technologies Job Expectations: Design, code, test and deliver software to automate manual operation work Partner with different application teams throughout the life cycle to understand their application infrastructure monitoring and apply site reliability principles to baseline and set up SLOs for critical components. Identify application patterns and analytics in support of better service level objectives Design automated software and product upgrades, change management and release management solutions Other responsibilities extend to application deployment, change management, incident management, capacity upgrades, reporting, system integrations and essentially ensuring the availability of a stable and performing platform used by development and technology across the firms Design self-healing and resiliency patterns. Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents Experience in running the chaos experiments Experience in measuring the reliability stack using SLI, SLO and Error budget Hands on experience in Linux Database experience - hands on in Oracle SQL, Pl/SQL, Mongo DB DevOps experience Very good experience on Autosys Knowledge of Abinitio Exposure to tools like Service Now, JIRA Hands on experience in Ansible, Python, Shell Scripting Familiarity with NDM, SFTP Basics of Networking
Posted 3 months ago
4 - 6 years
7 - 17 Lacs
Hyderabad
Work from Office
About this role: Wells Fargo is seeking a Senior Systems Operations Engineer In this role, you will: Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area Contribute in increasing system efficiencies and lowering the human intervention time on related tasks Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability Work with vendors and other technical personnel for problem resolution Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability Required Qualifications: 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education Desired Qualifications: Advance understanding of application monitoring stack (Logs, Metrics, Events, Traces, Alerts) and ability to visualize and setup end to end observability (Infra and App components) Strong experience in using industry standard monitoring tools (AppD, Splunk, ELK, APM, Grafana, Prometheus, Etc) Experience in deploying the application to cloud platforms Experience in using CI/CD tools like Jenkins, Gradle, Groovy and Maven Experience in CM tools like Ansible and Puppet Proficient in one of the programming Languages (Java and Python) Knowledge of Web services Experience in working Agile methodology Proficient in multiple infrastructure technologies Job Expectations: Design, code, test and deliver software to automate manual operation work Partner with different application teams throughout the life cycle to understand their application infrastructure monitoring and apply site reliability principles to baseline and set up SLOs for critical components. Identify application patterns and analytics in support of better service level objectives Design automated software and product upgrades, change management and release management solutions Other responsibilities extend to application deployment, change management, incident management, capacity upgrades, reporting, system integrations and essentially ensuring the availability of a stable and performing platform used by development and technology across the firms Design self-healing and resiliency patterns. Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents Experience in running the chaos experiments Experience in measuring the reliability stack using SLI, SLO and Error budget Hands on experience in Linux Database experience - hands on in Oracle SQL, Pl/SQL, Mongo DB DevOps experience Very good experience on Autosys Knowledge of Abinitio Exposure to tools like ServiceNow, JIRA Hands on experience in Ansible, Python, Shell Scripting Familiarity with NDM, SFTP Basics of Networking.
Posted 3 months ago
8 - 11 years
20 - 30 Lacs
Pune
Hybrid
So, what’s the role all about? A Java developer is a software professional specializing in designing, developing, and maintaining applications and systems using the Java programming language. They play a critical role in building scalable, robust, and high-performing applications for a variety of industries, including finance, healthcare, technology, and e-commerce. How will you make an impact? Bachelor’s degree in computer science, Business Information Systems or related field or equivalent work experience is required. 8- 11 year experience in software development Well established technical problem-solving skills. Experience in Java, springboot and microservices. Experience with Kafka, Kinesis , KDA ,Apache Flink Experience in Kubernetes operators, Grafana, Prometheus Experience with Snowflake or any DWH solution. Experience with AWS Technology including (EKS, EMR, S3, Kinesis, Lambda’s, Firehose, IAM, CloudWatch, etc) Excellent communication skills, problem-solving skills, decision-making skills Experience in Databases Experience in CI/CD, git, github Actions Jenkins based pipeline deployments. Strong experience in SQL Working knowledge of unit testing Working knowledge of user stories or use cases Working knowledge of design patterns or equivalent experience. Working knowledge of object-oriented software design. Team Player Have you got what it takes? Bachelor’s degree in computer science, Business Information Systems or related field or equivalent work experience is required. 8-11 year ( SE) experience in software development – Well established technical problem-solving skills. Experience in Java, springboot and microservices. Experience with Kafka, Kinesis , KDA ,Apache Flink Experience in Kubernetes operators, Grafana, Prometheus Experience with Snowflake or any DWH solution. Experience with AWS Technology including (EKS, EMR, S3, Kinesis, Lambda’s, Firehose, IAM, CloudWatch, etc) You will have an advantage if you also have: Experience in Big data What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next Nicer! Enjoy NICE-FLEX! At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID: 6692 Reporting into: Tech Manager Role Type: Individual Contributor
Posted 3 months ago
10 - 14 years
35 - 37 Lacs
Pune
Work from Office
Job Overview: 10+ years of experience in designing and implementing scalable, secure, and efficient network and storage solutions for large-scale software development projects. Deep technical expertise in network architecture, data storage technologies, and a strong understanding of government standards and compliance requirements. Responsibilities: Network Architecture Design: Design, implement, and maintain highly available and secure network architectures. Ensure network solutions meet performance, scalability, and security requirements. Optimize network infrastructure to support large-scale software applications and services. Storage Solutions Development: Architect and deploy scalable and resilient storage solutions (SAN, NAS, Object Storage). Design data storage strategies to meet high availability, disaster recovery, and backup requirements. Implement data lifecycle management practices to ensure data integrity and compliance. Infrastructure Integration: Integrate network and storage solutions with cloud environments (AWS, Azure, GCP). Ensure seamless integration of on-premises and cloud infrastructure. Collaborate with cloud architects to optimize hybrid cloud environments. Security and Compliance: Design and implement security measures for network and storage systems in line with government standards (NIST, ISO, GDPR). Conduct regular security assessments and audits. Ensure compliance with all relevant regulations and data protection laws. Performance Optimization: Analyze and optimize network and storage performance to meet project requirements. Implement monitoring solutions to ensure continuous performance improvement. Conduct capacity planning and scalability assessments. Collaboration and Support: Work closely with software development and IT operations teams to support project requirements. Provide technical guidance and support for network and storage-related issues. Mentor junior engineers and promote knowledge sharing within the team. Documentation and Reporting: Develop comprehensive documentation for network and storage architectures. Prepare reports on system performance, security, and compliance. Maintain up-to-date records of network and storage configurations. Vendor Management: Evaluate and manage relationships with network and storage vendors. Ensure that vendor solutions meet technical and compliance requirements. Oversee procurement and implementation of vendor solutions. Qualifications: Mandatory Minimum CCNA, CCNP certified. CCIE will be an advantage. Relevant certifications such as VMware VCP, AWS Certified Solutions Architect, or equivalent are a plus. Expertise in network technologies (e.g., routers, switches, firewalls, VPNs). Proficiency in storage technologies (e.g., SAN, NAS, SSDs, RAID). Strong knowledge of cloud platforms (AWS, Azure, GCP). Good to have Familiarity with data protection and security standards. Experience with network monitoring and management tools Key Technologies Knowledge Networking: Cisco, Juniper, F5, Palo Alto Storage: NetApp, EMC, HPE 3PAR, AWS S3, Azure Blob Storage Cloud: AWS, Azure, Google Cloud Platform Security: VPN, Firewalls, Encryption, Access Control Tools: Nagios, SolarWinds, Splunk, Prometheus.
Posted 3 months ago
6 - 10 years
18 Lacs
Bengaluru
Remote
Technical Support Engineer 3- (Night Shift) Role & Responsibilities Provide Level 3 support for our Stream+ platform. Diagnose, troubleshoot, and resolve production issues, ensuring swift resolution to minimize customer impact. Conduct root cause analysis (RCA) for recurring issues and implement permanent fixes. Maintain and troubleshoot MS SQL Server databases, ensuring data integrity, availability, and performance. Collaborate with Level 1 and Level 2 support teams to escalate and resolve issues efficiently. Document fixes, enhancements, and issue resolutions to facilitate knowledge sharing and future reference. Assist in the release of hotfixes or patches in coordination with the development team. Ensure compliance with Service Level Agreements (SLAs) for response times and issue resolution. Share feedback with product and engineering teams regarding product supportability and customer pain points. This role requires to work from 8:00 AM to 5:00 PM CST hours. Requirements & Qualifications 5+ years of experience in a technical support role Strong proficiency in at least one programming language – Ruby, Golang, Python or Javascript. Solid knowledge of Microsoft SQL (MS SQL) for database maintenance and troubleshooting. Strong understanding of REST APIs and experience in troubleshooting API-related issues Experience with monitoring tools (e.g., Prometheus, Grafana, AWS CloudWatch) Proven experience in conducting root cause analysis (RCA) and resolving production issues. Familiarity with support tools (e.g., Jira) and processes for issue tracking and maintaining Service Level Agreements (SLAs). Excellent communication skills to effectively interact with customers and internal teams. Ability to work independently and resolve production issues in high-pressure environments. Previous experience in CST shifts or a support-oriented role is preferred.
Posted 3 months ago
15 - 24 years
25 - 30 Lacs
Pune
Hybrid
Job Purpose The Infrastructure Operations Lead is responsible for overseeing the day-to-day operations, maintenance, and support of IT infrastructure. This includes ensuring system availability, performance, security, and reliability across cloud and on-premises environments. The role also involves incident management, problem resolution, and operational process improvements to align with business and IT objectives. Ensure the availability, reliability, and performance of IT infrastructure, including on premises and cloud servers of Linux and Windows. Oversee the management of monitoring tools to track system health and prevent outages. Lead incident management efforts, including troubleshooting, root cause analysis, and resolution of infrastructure issues. Ensure compliance with IT service management (ITSM) processes, including change management, problem management, and service request fulfilment. Collaborate with internal teams and vendors to ensure SLAs are met. Ensure infrastructure security best practices are followed, including patching, vulnerability management, and access controls. Work closely with security teams to enforce compliance with security policies, audits, and regulatory requirements. Implement backup and disaster recovery strategies to ensure business continuity. Identify opportunities for automation in infrastructure provisioning, monitoring, and troubleshooting. Optimize operational processes by implementing best practices for DevOps, ITIL, and cloud infrastructure management by working with DevOps engineer and Infrastructure Technical Lead. Lead a team of system administrators, ensuring alignment with business goals. Provide technical guidance and mentorship to the team. Collaborate with cross-functional teams, including development, security, and cloud engineering, to align operational strategies. Maintain and improve documentation for infrastructure components, operational procedures, and incident reports. Provide regular reports on system performance, incident trends, and operational improvements to management. Providing support for AWS systems including monitoring and resolution of issues Optimizing the resources & work on resource tagging to allocate costs and for carefully planning of budgeting, governance, and reporting. Participate in and support capacity planning and the development of long-term strategic goals for systems and software in conjunction with end-users and department managers Collaborate with other teams and team members to develop automation strategies and deployment processes. Provide after-hours support for Infrastructure related emergencies as well occasional weekend maintenance Develop and maintain documentation about current environment setup, standard operating procedures, and best practices. DevOps knowledge is a plus Multiskilled person is preferred or must have attitude to learn multiple technologies Skills & Technical Competences & Behaviors Certification : VMware/ Nutanix, MCSE, Citrix, AWS, and Azure Certification 15+ years of professional experience exclusively as a Windows and Linux Administrator. 5+ years of experience as a team lead. Strong knowledge of IT infrastructure components, including networking, servers, storage, virtualization, and cloud platforms (AWS, Azure). Experience with monitoring tools (Zabbix, Prometheus, Datadog) and log management tools (Splunk, ELK). Have familiarity with MS SQL server, windows clustering. Experience with automation tools (Ansible, Terraform, PowerShell, Python, Bash). Hands-on experience with ITSM tools such as ServiceNow for incident and change management. Working experience in VMware vCenter, ESXi Environment and Nutanix Prism Element and Prism Central Aggressively automates repeated tasks to allow the team to scale with the organizations growth Experience working together with application owners, business units and 3rd parties to deliver shared goals Hands-on solution design skills and the ability to objectively quality assure 3rd party solution designs to ensure they meet business expectations. Fluent in English. Strong verbal and written communication skills to engage with technical and non-technical stakeholders. Bear personal responsibility and demonstrate quality awareness. Behave loyally and comply with rules, regulations and legal requirements. Experience in mentoring and leading operational teams. Ability to enforce ITIL-based operational processes effectively. Ability to troubleshoot and resolve complex infrastructure issues. KPIs In aligned with the business SLA agreement Always up Infrastructure Timely support to business requirements
Posted 3 months ago
6 - 9 years
11 - 16 Lacs
Bengaluru
Work from Office
Your Impact: We are seeking a highly skilled and experienced Lead Software Engineer to design, implement, and manage robust and scalable CI/CD pipelines using GitLab CI and other DevOps tools. The ideal candidate will have a deep understanding of infrastructure as code (IaC), cloud platforms (AWS, GCP, Azure), and automation techniques to streamline deployment and infrastructure management. This role requires expertise in Terraform, Ansible, Kubernetes, and Python to enhance operational efficiency and security. What the role offers: Define and lead the architectural vision for Helm-based installers and upgrade frameworks for Kubernetes applications. Design and optimize Helm charts to streamline installation, upgrades, and rollbacks. Establish best practices for Kubernetes-based deployments, scalability, and fault tolerance. Lead the evaluation and adoption of new tools and technologies in the Kubernetes ecosystem. Oversee security, compliance, and performance considerations in installation and upgrade processes. Provide technical leadership and mentorship to engineers in the Helm/Kubernetes domain. Troubleshoot and resolve complex deployment and upgrade issues. Stay up to date with emerging trends and advancements in Kubernetes, Helm, and cloud-native technologies. What you need to succeed: Bachelors or Masters degree in computer science, Information Technology, or a related field. 6 to 9 years of experience in software engineering or DevOps/CD. Strong understanding on Cloud Platforms: AWS, Azure, GCP, OpenShift Proficiency in Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation. Hands on experience with CICD pipelines, GitOps mythologies and automation framework. DevOps Tools: GitLab CI/CD, ArgoCD/FluxCD, Helm, Maven, NodeJs, JFrog Artifactory, SonarQube DataBase: Postgres Experience in Monitoring C Logging tools like Prometheus, Grafana. Automation C Scripting: Shell Script, Python Deep knowledge of security, networking, and performance optimization in Kubernetes environments. Strong problem-solving, leadership, and communication skills..
Posted 3 months ago
7 - 10 years
13 - 17 Lacs
Hyderabad
Work from Office
YOUR IMPACT: The Senior Site Reliability Engineer (SRE) will be responsible for ensuring the availability, reliability, and scalability of cloud infrastructure and services. This role focuses on automation, performance optimization, incident response, and CI/CD pipeline management to support highly available and resilient applications. The ideal candidate will bring deep expertise in AWS, Kubernetes, GitLab CI/CD, and Infrastructure as Code (IaC). WHAT THE ROLE OFFERS: Cloud Infrastructure & Reliability Engineering Architect, deploy, and maintain highly available and scalable cloud environments in AWS. Design and manage Kubernetes clusters (EKS) and containerized applications with Docker. Implement auto-scaling, load balancing, and fault tolerance for cloud services. Develop and optimize Infrastructure as Code (IaC) using Terraform, Tofu, or Ansible. CI/CD & Automation Design, implement, and maintain CI/CD pipelines using GitLab CI/CD and ArgoCD. Automate deployment workflows, infrastructure provisioning, and release management. Ensure secure, compliant, and automated software delivery across multiple environments. Monitoring, Incident Response & Performance Optimization Implement observability and monitoring using tools like CloudWatch, Prometheus, Grafana, ELK, or Datadog. Analyze system performance, detect anomalies, and optimize cloud resource utilization. Drive incident response and root cause analysis, ensuring fast recovery (MTTR) and minimal downtime. Establish Service Level Objectives (SLOs) and error budgets to maintain system health. Security & Compliance Implement security best practices, including IAM policies, encryption, network security, and vulnerability scanning. Automate patch management and security updates for cloud infrastructure. Ensure compliance with industry standards and regulations (SOC2, ISO27001, HIPAA, etc.). Collaboration & Leadership Work closely with DevOps, security, and development teams to drive reliability best practices. Lead blameless postmortems and continuously improve operational processes. Provide mentorship and training to junior engineers on SRE principles and cloud best practices. Participate in on-call rotations, ensuring 24/7 reliability of production services. WHAT YOU NEED TO SUCCEED: Bachelors degree in Computer Science, Engineering, or equivalent experience. 7-10 years of experience in Site Reliability Engineering (SRE), DevOps, or Cloud Engineering. Expertise in AWS Cloud Hands-on experience with EC2, VPC, RDS, S3, IAM, Lambda, and EKS. Strong Kubernetes knowledge Hands-on experience with EKS, Helm charts, and cluster management. CI/CD experience Proficiency in GitLab CI/CD, ArgoCD for automated software deployments. Infrastructure as Code (IaC) Experience with Terraform, Tofu Monitoring & Logging Familiarity with CloudWatch, Prometheus, Grafana, ELK, or Datadog. Scripting & Automation Proficiency in Python, Shell scripting, or Golang. Incident Management & Reliability Practices Experience with SLOs, SLIs, error budgets, and chaos engineering.
Posted 3 months ago
8 - 13 years
10 - 15 Lacs
Bengaluru
Work from Office
Java Tech Lead: Exp:8 - 13 Yrs. CTC - 30 - 32 LPA Loc - Pune, Hyd, Blr, Chennai and Vadodara Shifts and Work Mode - 2 PM - 11 PM and Hybrid Skillset -Java, Spring, Springboot, Microservices, RestAPI, MongoDB and Kafka / RabbitMQ / ActiveMQ Required Skills Java :Strong expertise in Java8+ with hands-on experience in building enterprise applications. Spring Boot :Proficiency in developing RESTful APIs and microservices using Spring Boot. JavaBatch :Experience with batch processing frameworks like Spring Batch . Experience with Angular is preferred. Oracle Database :In-depth knowledge of PL/SQL , query optimization, and database management. Development Tools :Familiarity with build tools (e.g., Maven, version control systems (e.g., Git), and CI/CD pipelines. Testing :Experience with unit testing (e.g., JUnit) and integration testing frameworks. Preferred Skills Knowledge of cloud platforms (AWS, Azure, or GCP). Familiarity with Docker and Kubernetes for containerized deployments. Experience with monitoring tools like Splunk , New Relic , or Prometheus . Exposure to Agile/Scrum methodologies.
Posted 3 months ago
6 - 8 years
8 - 10 Lacs
Chennai
Work from Office
About The Role :: We are seeking a highly skilled and experienced Lead SRE Developer with a strong background in Java and Kotlin. The ideal candidate will lead a team of Java developers, ensuring high-quality software delivery and support for production environments. This role involves designing and implementing scalable Java solutions, collaborating with cross-functional teams, and driving innovation through the adoption of new technologies. Overall Responsibilities: Lead a team of Java developers and ensure high-quality software delivery. Develop and maintain Java-based applications and systems. Design and implement scalable and efficient Java solutions to meet business requirements. Collaborate with cross-functional teams to resolve technical issues and drive innovation. Provide support for production environments, guiding users to appropriate teams for issue resolution and incident escalation as needed. Technical Skills: Primary Skills: Expertise in SRE (Site Reliability Engineering) practices. Proficiency in Java (mandatory) and Kotlin (preferred). Strong understanding of OOPs concepts, concurrency, and exception handling. Hands-on experience in designing and building Java microservices using VERT.X and SPRING BOOT. Proficient in GitHub Actions for efficient code integration and deployment workflows. Expertise in Cloud-native technologies, including Kubernetes, Helm, and containerization. Strong debugging and troubleshooting skills. Knowledge of databases like Cassandra, Couchbase, and MongoDB. Experience with monitoring tools such as Micrometer, Prometheus, Elastic, Kibana, Grafana, and Splunk. Secondary Skills: Demonstrated ability to contribute to the development of complex applications or highly scalable solutions end-to-end in SDLC. Experience: At least 6-8 years of experience in Java development. Proven experience leading a team of Java developers. Previous experience in delivering Java-based solutions for large enterprises. Day-to-Day Activities: Lead and mentor a team of Java developers. Analyze business requirements and translate them into technical solutions. Develop, test, and deploy Java-based applications and systems. Troubleshoot and resolve technical issues in a timely manner. Collaborate with cross-functional teams to drive innovation and implement new technologies. Qualifications: Bachelor's degree in Computer Science, Computer Engineering, or a related field. Advanced certifications in Java development (e.g., Oracle Certified Professional, Java SE 11 Developer). Soft Skills: Excellent communication and leadership skills. Strong interpersonal and collaboration skills. Ability to work under pressure and meet tight deadlines. Positive attitude and strong work ethic. A commitment to continuous learning and professional development. S YNECHRONS DIVERSITY & INCLUSION STATEMENT Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative Same Difference is committed to fostering an inclusive culture promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more. All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicants gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law . Candidate Application Notice
Posted 3 months ago
7 - 11 years
12 - 20 Lacs
Mumbai
Hybrid
Job Title: DevOps Engineer Location: Mumbai Work mode: Onsite Notice Period: Immediate iSource Services is hiring for one of their client for the position of DevOps Engineer. Skills: DevOps (AWS, Jenkins, k8, Prometheus, Splunk, Grafana), PHP Framework OOP Responsibilities: Security patches QCR/Compliance Bug fixes. L3 escalations including PDP dropin (FE) BF/Holiday. Special events preparation Testing (e2e, performance) Release validation. Deployment on pre-prod environment. Monitoring and alerting changes Monitor AWS resources, K8 clusters On-call duties.
Posted 3 months ago
8 - 10 years
10 - 12 Lacs
Chennai
Work from Office
POC:shipra java fullstack Mandatory Skills:Java, Spring, Spring BootAzure SQL, KafkaKubernetes, DockerMonitoring Tools (Grafana, Prometheus)Cloud Platforms (Azure, GCP)Good-to-Have Skills:Experience in CI/CD pipeline setup and DevOps practicesMax budget - 24 to 26lpa, Loc:only chennai exp:8 to 10
Posted 3 months ago
8 - 10 years
10 - 12 Lacs
Bengaluru
Work from Office
Java backend 8 to 10 bangalore poc:viswanath Java, Spring, Sprint boot, Data Structures, Algorithms, OOPs, SQL, kafka, Kubernates, Docker, Monitoring Tools (Grafana, Prometheus), Cloud (Azure, GCP). Regular IDC Shift + Overlaps
Posted 3 months ago
8 - 13 years
35 - 65 Lacs
Bengaluru
Hybrid
Say hello to possibilities. Its not everyday that you consider starting a new career. Were RingCentral, and we’re happy that someone as talented as you is considering this role. First, a little about us, we’re the $2 billion global leader in cloud-based communications and collaboration software. We are fundamentally changing the nature of human interaction—giving people the freedom to connect powerfully and personally from anywhere, at any time, on any device. This is where you and your skills come in. We’re currently looking for: You will be a part of the team responsible for running our product and its cloud infrastructure. You will contribute to the product and infrastructure focusing on availability, maintainability, and scalability. You will apply the best practices of site reliability engineering, operational discipline, and automation.You should be motivated, organized, excited about technology and SaaS products, a thorough critical thinker, and relentless in code quality, scalability, latency, and platform stability. Our culture is motivational, constructive, and positive. We value teamwork, camaraderie, and collaboration. If you’re up for a fun challenge, we want to hear from you. To succeed in this role you must have experience in: Technology Stack: AWS, Kubernetes (EKS), Aurora RDS (PostgreSQL/MySQL), Kafka, Argo CD, Prometheus, Jenkins, GitLab CI, Terraform, Ansible, Python, Java, Ruby. Design, plan and implement a HA and cost-effective cloud infrastructure with an IaC approach Develop, scale, and maintain automated CI/CD process using the GitOps Increase service automation to improve maintainability, scalability, and engineering productivity Plan system capacity and develop tooling for product on-demand scaling Troubleshoot and resolve software and technical issues, participate in incidents resolution, and perform root cause analysis Participate in an on-call process Plan disaster recovery procedures and develop automation for fast and reliable service restoration Implement security & compliance requirements Interact with development and architecture teams to improve service observability and performance, eliminate logging and monitoring white spots, suggest architectural and process improvements Evaluate and adopt new cloud-native technologies Desired Qualifications: 8+ years of technical experience in the same or similar role supporting large-scale and high-load cloud-based production systems Experience in the development and support of public cloud infrastructure Hands-on experience in running HA applications and development of the CI/CD process in Kubernetes Proven programming skills in Python, Go or similar Good knowledge of Linux environment, TCP/IP, network routing, DNS Familiar with SRE principles, DevOps practices, and modern cloud-native landscape Accuracy, attention to details, ability to follow processes Good communication skills Experience with Contact Center, VoIP solutions is a HUGE plus Ability to read and troubleshoot Java code if needed is a plus Experience in SQL/NoSQL DB's or attitude to develop skills in this field is a plus What we offer: Mediclaim benefits Paid holidays Casual/Sick leave Privilege leave Bereavement leave Maternity & Paternity leave Wellness programs & coaching Employee referral bonus Professional development allowances Night shift allowances RingCentral’s Engineering team works on high-complexity projects that set the standard for performance and reliability at massive scale. What kind of scale? Millions of users today and hundreds of millions tomorrow. This is your chance to help imagine, develop and deliver products that raise the technological bar, and power human connections. If you’re a talented, ambitious, creative thinker, RingCentral is the perfect environment to join a world class team and bring your ideas to life. RingCentral’s work culture is the backbone of our success. And don’t just take our word for it: we are recognized as a Best Place to Work by Glassdoor, the Top Work Culture by Comparably and hold local BPTW awards in every major location. Bottom line: We are committed to hiring and retaining great people because we know you power our success. RingCentral offers on-site, remote and hybrid work options optimized for the ways we work and live now. About RingCentral RingCentral, Inc. (NYSE: RNG) is a leading provider of business cloud communications and contact center solutions based on its powerful Message Video Phone™(MVP™) global platform. More flexible and cost effective than legacy on-premises PBX and video conferencing systems that it replaces, RingCentral® empowers modern mobile and distributed workforces to communicate, collaborate, and connect via any mode, any device, and any location. RingCentral is headquartered in Belmont, California, and has offices around the world.RingCentral is an equal opportunity employer that truly values diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Posted 3 months ago
5 - 8 years
12 - 19 Lacs
Mumbai
Work from Office
Key Responsibilities: 1. Platform Engineering: - Design, build, and maintain scalable and resilient platform solutions using Kubernetes, Docker, and other container orchestration tools. - Implement and manage Kafka clusters for real-time data streaming and event-driven architecture. - Collaborate with application teams to design and deploy microservices-based architectures, focusing on scalability, performance, and reliability. 2. Tooling & Automation: - Develop and maintain CI/CD pipelines to automate application deployment and infrastructure provisioning using tools like Jenkins, Git. - Create automation scripts and tooling to reduce manual intervention in operational tasks, leveraging languages such as Python, Bash. - Implement Infrastructure as Code (IaC) using Ansible to manage cloud and on-premises environments efficiently. 3. Observability & Monitoring: - Implement observability solutions using Grafana, Prometheus, and ELK Stack to monitor application performance, infrastructure health, and system reliability. - Develop dashboards, alerts, and runbooks to enable proactive incident management and quick response to service disruptions. - Conduct performance tuning and capacity planning to ensure optimal operation of platforms and applications. 4. Application Engineering Support: - Work closely with development teams to optimize application performance and troubleshoot production issues. - Implement service mesh solutions for microservices management, ensuring secure and efficient communication between services. - Assist in the design and implementation of scalable data pipelines and workflows using Kafka and other streaming technologies. 5. Security & Compliance: - Ensure platform security through effective access controls, secure deployment practices, and regular vulnerability assessments. - Collaborate with security teams to implement policies and tools that safeguard data and application integrity. 6. Collaboration & Documentation: - Document infrastructure, processes, and best practices to ensure knowledge sharing across teams. - Work in a cross-functional environment, collaborating with software developers, QA engineers, and other SREs to continuously improve system reliability. Qualifications: - Bachelors degree in Computer Science, Engineering, or equivalent practical experience. - 5+ years of experience in SRE, DevOps, or Platform Engineering roles. - Strong knowledge of Kubernetes, Docker, and container orchestration platforms. - Proficiency in managing Kafka clusters and understanding data streaming technologies. - Experience with observability tools such as Grafana, Prometheus, and ELK Stack. - Hands-on experience with CI/CD pipelines and automation tools. - Expertise in scripting languages like Python, Bash,. - Familiarity with cloud platforms (AWS, GCP, Azure) and Infrastructure as Code (IaC) tools like Ansible.
Posted 3 months ago
5 - 10 years
15 - 30 Lacs
Chennai
Work from Office
Role/Job Title: Senior DevOps Engineer Function/Department: Information Technology Job Purpose: We are seeking a highly skilled and experienced DevOps Engineer with a primary focus on AWS cloud services. The ideal candidate will be proficient in automated provisioning, cloud management, and possess expertise in tools such as Terraform and Packer. As a DevOps Engineer, you will play a crucial role in designing, implementing, and maintaining our cloud infrastructure to ensure optimal performance, reliability, and scalability. Roles and Responsibilities: AWS Cloud Management: Design, deploy, and manage AWS cloud infrastructure. Optimize and maintain cloud resources for performance and cost efficiency. Monitor and ensure the security of cloud-based systems. Automated Provisioning: Develop and implement automated provisioning processes for infrastructure deployment. Utilize tools like Terraform and Packer to automate and streamline the provisioning of resources. Infrastructure as Code (IaC): Champion the use of Infrastructure as Code principles. Collaborate with development and operations teams to define and maintain IaC scripts for infrastructure deployment and configuration. Collaboration and Communication: Work closely with cross-functional teams to understand project requirements and provide DevOps expertise. Communicate effectively with team members and stakeholders regarding infrastructure changes, updates, and improvements. Continuous Integration/Continuous Deployment (CI/CD): Implement and maintain CI/CD pipelines to automate software delivery processes. Ensure reliable and efficient deployment of applications through the development lifecycle. Performance Monitoring and Optimization: Implement monitoring solutions to track system performance, troubleshoot issues, and optimize resource utilization. Proactively identify opportunities for system and process improvements. Mandatory Skills: Proven experience as a DevOps Engineer or similar role, with a focus on AWS. Strong proficiency in automated provisioning and cloud management. Experience with Infrastructure as Code tools, particularly Terraform and Packer. Solid understanding of CI/CD pipelines and version control systems. Strong scripting skills (e.g., Python, Bash) for automation tasks. Excellent problem-solving and troubleshooting skills. Good interpersonal and communication skills for effective collaboration. Secondary Skills: AWS certifications (e.g., AWS Certified DevOps Engineer, AWS Certified Solutions Architect). Experience with containerization and orchestration tools (e.g., Docker, Kubernetes). Knowledge of microservices architecture and serverless computing. Familiarity with monitoring and logging tools (e.g., CloudWatch, ELK stack).
Posted 3 months ago
4 - 8 years
40 - 65 Lacs
Bengaluru, Bangalore Rural
Hybrid
Our mission is to create transformative, innovative, and personalized experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties. About the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes managed cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed. These platform instances, and the workloads running on them, are hosted both in datacenters (on-premises) and on public cloud infrastructure (AWS). The Private Cloud platform has three primary internal customer-facing verticals: virtualization, containerization, and serverless, corresponding to the three types of workloads it supports. At the highest level, the Private Cloud drives three primary business outcomes: Agility in provisioning and using cloud infrastructure. Efficiency in cost and utilization of cloud infrastructure, as well as toil reduction for developers and engineers. Trust in the safety, reliability, and performance of our cloud infrastructure. Key Job Responsibilities and Duties: The core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here. You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers You will have the opportunity to collaborate with many of the worlds leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc What youll be Doing: Design, develop and implement systems software that improves the stability, scalability, availability and latency of the products; Take ownership of one or more services and have the freedom to do what is best for our business and customers; Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again; Build effective monitoring to monitor the health of your system, and jump in to handle outages; Build and run capacity tests to handle the growth of your systems; Plan for reliability by designing systems to work across our multinational data centers; Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day; Share the on-call rotation and be an escalation contact for incidents (depending on level of role) What youll bring: Solid experience in at least one programming language. Experience with building, operating and maintaining scalable distributed systems, and with operations automation; Experience with Infrastructure as Code technologies; Knowledge of cloud computing fundamentals; Solid foundation in Linux administration and troubleshooting; Understanding of Service level agreements and objectives; Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable; Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus; Good interpersonal skills Proficient command of the English language, both written and spoken
Posted 3 months ago
5 - 10 years
20 - 35 Lacs
Bengaluru
Work from Office
Airties is seeking a SRE for its AWS cloud-based Wi-Fi monitoring and optimization system. This position involves hands on deployment, administration, maintenance, and support and of the system and data extraction for analysis. Your deliverables will enable our product and engineering teams to spin up, maintain, and monitor the necessary infrastructure they need to run our applications and services. As our Site Reliability Engineer (AWS) , youll get the opportunity to choose and implement a variety of technologies that will help us improve and streamline our infrastructure and processes. What you will do: Define and monitor SLOs and SLIs for critical services to ensure they meet performance and reliability targets. Regularly review and adjust these metrics as necessary. Lead and participate in incident response activities, including identifying, investigating, and resolving incidents to minimize impact on service availability and performance. Conduct post-incident reviews (postmortems) to identify root causes and implement preventative measures. Analyze system performance metrics and forecast capacity requirements to ensure adequate resources are available to support current and future workloads. Identify opportunities for performance optimization and efficiency improvements. Continuously evaluate and improve processes, tools, and infrastructure to enhance reliability, efficiency, and scalability. Stay up-to-date with industry trends, emerging technologies, and best practices, and drive innovation within the organization. Monitor system health and performance using monitoring tools and alerting systems, and respond promptly to alerts and incidents. Drive efficiency by automating repetitive tasks and processes. Evaluate and implement technology options for managing our enterprise SaaS products in the cloud. Enhance our platform by identifying areas for improvement based on monitoring data. Ensure robust security practices by leveraging industry best practices and available tools. Regularly assess and enhance security measures. Collaborate with security teams to implement and maintain compliance standards Be the go-to expert for AWS services. Participate in design discussions related to AWS architecture. Optimize AWS resources, cost, and performance. Work closely with the development team to create a development environment that fosters productivity and innovation. Propose and drive adoption of new solutions that enhance our platform. Diagnose and resolve complex system and application issues promptly. What you should ideally bring: Hold a Bachelor of Science (BSc) degree in Engineering or a related field. Minimum 3 years of relevant experience in Platform Engineering, SRE, and/or DevOps in production environments. Expertise in AWS Clous setup with 3+ years of hands-on experience. Proven track record of owning the uptime of distributed cloud-based systems. Possess at least 3 years of experience with scripting languages (Bash, Python, NodeJS, Ruby, or PHP) and related automation projects. Proficiency in “Infrastructure-as-Code” tools such as CloudFormation, Terraform, Chef, Ansible, and Puppet. Experience in building and using Observability frameworks for a microservice based distributed AWS cloud setup with tools such as Prometheus, Grafana, CloudWatch etc. Proficient in setting up and managing CI/CD pipelines and deployment tools (e.g., Jenkins, Git, GitHub etc). Experienced is 24x7 Support model for Cloud uptime and maintenance activities Strong spoken and written English communication skills. Self-driven, responsible, eager to learn, and proactive. Independent, goal-oriented, and proactive attitude. Disciplined and effective in remote work environments. Nice to Have: Comprehensive understanding of networking concepts (layers, firewalls, DNS, VPN, etc.) and how to build secure infrastructure and an awareness of common server security vulnerabilities. Have experience in designing and building scalable ETL infrastructure Have experience working in a distributed team(s) environment Got SaaS product experience, especially in a dynamic environment Used to own and provide 24/7 SaaS product support, running on a major vendor cloud AWS Cloud certification(s) Perks and benefits
Posted 3 months ago
2 - 5 years
8 - 18 Lacs
Navi Mumbai
Hybrid
Incident Handling & Resolution, coordinating resources, and ensuring swift resolution. Prioritize and categorize incidents based on impact and urgency. Escalate issues as needed to ensure appropriate levels of attention. APM Tools
Posted 3 months ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Prometheus is a popular monitoring and alerting tool used in the field of DevOps and software development. In India, the demand for professionals with expertise in Prometheus is on the rise. Job seekers looking to build a career in this field have a promising outlook in the Indian job market.
These cities are known for their vibrant tech industry and have a high demand for professionals skilled in Prometheus.
The salary range for Prometheus professionals in India varies based on experience levels. Entry-level positions can expect to earn around ₹5-8 lakhs per annum, whereas experienced professionals can earn up to ₹15-20 lakhs per annum.
A typical career path in Prometheus may include roles such as: - Junior Prometheus Engineer - Prometheus Developer - Senior Prometheus Engineer - Prometheus Architect - Prometheus Consultant
As professionals gain experience and expertise, they can progress to higher roles with increased responsibilities.
In addition to Prometheus, professionals in this field are often expected to have knowledge and experience in: - Kubernetes - Docker - Grafana - Time series databases - Linux system administration
Having a strong foundation in these related skills can enhance job prospects in the Prometheus domain.
As you explore opportunities in the Prometheus job market in India, remember to continuously upgrade your skills and stay updated with the latest trends in monitoring and alerting technologies. With dedication and preparation, you can confidently apply for roles in this dynamic field. Good luck!
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
36723 Jobs | Dublin
Wipro
11788 Jobs | Bengaluru
EY
8277 Jobs | London
IBM
6362 Jobs | Armonk
Amazon
6322 Jobs | Seattle,WA
Oracle
5543 Jobs | Redwood City
Capgemini
5131 Jobs | Paris,France
Uplers
4724 Jobs | Ahmedabad
Infosys
4329 Jobs | Bangalore,Karnataka
Accenture in India
4290 Jobs | Dublin 2