Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and uptime of critical services for our client's team. Your primary responsibilities will revolve around Kubernetes administration, CentOS server management, Java application support, incident handling, and change management. The ideal candidate for this role should have a solid background in ArgoCD for Kubernetes management, Linux proficiency, basic scripting skills, and familiarity with modern monitoring, alerting, and automation tools. We are seeking a self-motivated individual with strong communication skills, both verbal and written, who can work effectively both independently and collaboratively. Your daily tasks will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will be responsible for conducting routine system and application maintenance tasks following standard operating procedures to prevent and resolve issues promptly. Additionally, you will be in charge of responding to and managing incidents, facilitating post-mortem meetings, conducting root cause analysis, and ensuring timely issue resolution. Furthermore, you will monitor production systems, applications, and overall performance, utilizing tools to detect abnormal behaviors in software and collect relevant information for developers to understand and address the underlying causes. Security checks, policy and procedure documentation, script/code writing for tool and service development, post-mortem learning, and administration work on tools like JIRA and New Relic are also part of your responsibilities. In terms of technical skills, you should have at least 5 years of experience in a SaaS and Cloud environment. Proficiency in Kubernetes cluster administration, Linux scripting, database systems (MySQL, DB2), Linux (CentOS / RHEL) administration, change management procedures, on-call responsibilities, deployment management using Jenkins, monitoring tools (e.g., New Relic, Splunk, Nagios), log aggregation tools (e.g., Splunk, Loki, Grafana), and scripting knowledge in at least one language is essential. Experience with API programming and integrating tools such as Jira, Slack, xMatters/PagerDuty will be advantageous for this role.,
Posted 1 day ago
2.0 - 6.0 years
0 Lacs
ghaziabad, uttar pradesh
On-site
At RightCrowd, we are revolutionizing physical access control with SmartAccess, a next-generation platform that redefines how people interact with security systems. We are transforming an outdated industry into a seamless, futuristic experience. Imagine doors opening effortlessly, just like in Star Trek! Our innovative platform powers cutting-edge solutions that enhance the daily experiences of employees, visitors, and users. Trusted by some of the world's largest organizations, including top tech companies, our products are making a global impact. We are not looking for the perfect candidate with a flawless resume. Instead, we value curiosity, a willingness to learn, and a commitment to making a difference. If you are excited about tackling challenges, growing your skills, and contributing to innovative solutions, we'd love to hear from you, even if you do not meet every single requirement. To enhance existing features and develop new, groundbreaking solutions, we are looking for a passionate Full Stack Software Engineer to join our remote team. Our team has its roots in a Belgian startup, and we still carry the startup spirit within us. We strive to maintain a small team size and minimize corporate overhead. In essence, we offer a high-responsibility, high-expectation environment with cutting-edge technology, free from unnecessary rules and constraints. **Key Responsibilities:** - Develop and maintain our web interfaces. - Review and give feedback on use cases, UI and UX design. - Contribute to the development of our backend services. - Support and evolve our cloud-native platform and infrastructure. - Perform development testing to ensure high-quality deliverables. - Assist in requirements gathering & architectural decision-making and provide feedback to shape the product roadmap. - Create and maintain documentation while continuously sharing knowledge with the team and the broader company. - Assist in third-line support and handle customer support requests when needed. - Eager learner. We don't expect anyone to already know everything. **Requirements:** - Fluency in English, clear communicator - A commitment to lifelong learning - Proven 2-4 years of experience in software development within complex environments - Strong knowledge and experience in: - NodeJS and related frameworks - TypeScript - React - Unix systems and networking - Containerization, Docker - Excellent debugging and problem-solving skills - Analytical, intelligent and well-organized - Flexible, hands-on and comfortable in a fast-paced environment *Bonus points if you have experience with any of the following:* - Terraform - Containerization, Docker - Unix systems and networking - Good understanding of and experience with Kubernetes & GitOps - Observability (metrics, logs, and tracing) **Why Join Us ** - Be part of a company that is a leader in the safety, security, and compliance solution space. - Opportunity to work on innovative products that have a real impact on safety and security. - Collaborative and supportive work environment with opportunities for professional growth and development. - Competitive salary and benefits package. Ready to make an impact Apply now to join our team!,
Posted 2 days ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
You will be joining our client's team as a Site Reliability Engineer, where your main responsibility will be ensuring the reliability and uptime of critical services. This will involve a strong focus on Kubernetes administration, CentOS servers, Java application support, incident management, and change management. The ideal candidate for this role will have strong experience with ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge, and familiarity with modern monitoring, alerting, and automation tools. We are looking for someone who is self-motivated, possesses excellent communication skills (both oral and written), and can work both independently and collaboratively. Your main tasks will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will also be responsible for conducting routine tasks for system and application maintenance, following SOPs to correct and prevent issues. In addition, you will respond to and manage running incidents, conduct post-mortem meetings, perform root cause analysis, and ensure timely resolution. Furthermore, you will be monitoring production systems, applications, and overall performance, using tools to detect abnormal behaviors in the software and collect information to help developers understand the root causes of problems. Security checks, running meetings with business partners, writing and maintaining policy and procedure documents, writing scripts or code as necessary to develop tools and services, and learning from post-mortems to prevent new incidents are also part of your responsibilities. Technical skills required for this role include 5+ years of experience working in a SaaS and Cloud environment, administration of Kubernetes clusters with ArgoCD, Linux scripting for automation, experience with database systems like MySQL and DB2, Linux administration skills, understanding of change management procedures, on-call responsibilities, experience with managing deployments using Jenkins, and familiarity with monitoring tools like New Relic, Splunk, and Nagios. Additionally, experience with log aggregation tools like Splunk, Loki, or Grafana, strong scripting knowledge in at least one language, and experience with API programming and integrating tools such as Jira, Slack, and xMatters/PagerDuty are preferred. This is an exciting opportunity for a motivated individual with the right skill set to make a significant impact on our client's team.,
Posted 3 days ago
2.0 - 4.0 years
0 Lacs
Bengaluru, Karnataka, India
On-site
Position: Engineering Support Analyst As an Engineering Support Analyst, you will play a critical role in ensuring the stability and performance of essential business systems. Acting as a software detective, you will identify, investigate, and resolve issues across various system components. Your responsibilities will include triaging bugs, escalating tickets with detailed context, responding to system alerts, and initiating On-Call procedures when necessaryall while maintaining clear and effective communication. Key Responsibilities Provide technical support for critical business systems, ensuring timely issue identification and resolution. Collaborate with Traders, Developers, DevOps, and SRE teams to maintain seamless system operations. Conduct root cause analysis and implement preventive measures to mitigate recurring issues. Monitor system alerts and proactively address incidents to minimize downtime. Escalate issues with comprehensive documentation to ensure swift resolution. Offer coverage for global teams, including those in Australia (AEDT) and Europe (CET). Continuously drive improvements in system reliability and support processes. Key Accountabilities Deliver high-quality support to global stakeholders. Resolve incidents efficiently and effectively. Leverage monitoring tools to detect and respond to issues proactively. Contribute to continuous improvement initiatives and innovation in support practices. Preferred Experience & Skills 23 years of experience in technical support for critical business systems. Strong analytical and problem-solving abilities. Excellent verbal and written communication skills for effective collaboration with global teams. Solid understanding of incident and problem management principles. Experience with server stack and website support is a plus. Proficiency in debugging, issue analysis, and resolution. Technical Knowledge Familiarity with monitoring and observability tools such as: Grafana , Prometheus , Loki , Tempo Kubernetes , Docker Linux , Windows Kafka , Postgres Experience in building Grafana dashboards that integrate metrics, logs, and traces for proactive error detection. Testing experience is an added advantage. Education & Certifications A tertiary qualification in Information Technology or a related field is highly desirable. Show more Show less
Posted 3 days ago
7.0 - 9.0 years
0 Lacs
Bengaluru, Karnataka, India
On-site
Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl We are always moving forward always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The Role Join us as a Site Reliability Engineer (SRE) and embark on an exciting journey of ensuring reliability, resiliency, and innovation in our information systems and ecosystems. As an SRE at Kyndryl, you&aposll be at the forefront of driving continuous improvement and delivering exceptional service to our customers. Your role goes beyond traditional engineering, as you&aposll have the opportunity to analyze business needs, tackle complex problems, and provide strategic advice and designs. You&aposll be involved in every stage of the software lifecycle, from building and testing to deploying changes and maintaining robust systems. We&aposre looking for a true visionary who can think strategically and help shape the future of our services. Your expertise in building trusted relationships with customers and partnering with them for success will be instrumental in driving our growth. As an SRE, you&aposll have the unique opportunity to work on end-to-end services, spanning customer sites and platforms. Collaboration and proactivity are key as you work alongside a talented team of professionals, eager to make a difference. You&aposll embrace an entrepreneurial mindset, taking ownership of your responsibilities and constantly seeking innovative solutions. With an unwavering focus on quality, robustness, and security, you&aposll be a driving force in implementing cutting-edge tools that enhance our operations, improve reliability, and gather valuable feedback on our platforms. Your ability to identify and mitigate common operational issues will play a crucial role in delivering seamless experiences to our customers. If you&aposre passionate about pushing the boundaries of technology, thrive in a collaborative environment, and are motivated by the opportunity to shape the future of reliability engineering, then we want to hear from you. Join our team and be part of a dynamic and forward-thinking organization that values innovation and excellence in everything we do. Your Future at Kyndryl Kyndryl has a global footprint, which means that as a Site Reliability Engineer at Kyndryl you will have opportunities to work on projects and collaborate with colleagues from around the world. This role is dynamic and influential offering a wide range of professional and personal growth opportunities that you wont find anywhere else. Who You Are Youre good at what you do and possess the required experience to prove it. However, equally as important you have a growth mindset; keen to drive your own personal and professional development. You are customer-focused someone who prioritizes customer success in their work. And finally, youre open and borderless naturally inclusive in how you work with others. Required Technical And Professional Experience MS SQL with 7+ years of experience in operational management, including incident management and escalations. Oversee maintenance and optimization of various databases, ensuring reliability, performance, and availability. Service Recovery Management System to recovery customer IT service(s) in response to severity incidents. Engage/provide subject matter expertise and create & lead recovery plan. Handle customer communication (if required) & strong troubleshooting and problem solving approach, performance tuning and strong architectural knowledge. Conduct performance tuning activities, analyze database metrics, and make recommendations for improvement. Lead troubleshooting and resolution of database-related issues, conducting root cause analysis and implementing preventive measures. Review RCA documents for quality check & learnings & Mentor and provide guidance to team members, fostering their professional development and effectiveness. Hypercare support to troubled accounts to ensure stability of IT operations. Conduct Technical Heath Assessment (THA) to support service availability, service reliability and service stability. Preferred Technical And Professional Experience Degree in Computer Science, Engineering, or other highly technical, scientific discipline. Expertise with Ansible, Terraform, and Python. Experience with distributed technologies as well as dynamic resource management frameworks such as Kubernetes. Expertise in leveraging open-source tooling such as Prometheus, Grafana, or Loki. Being You Diversity is a whole lot more than what we look like or where we come from, its how we think and who we are. We welcome people of all cultures, backgrounds, and experiences. But were not doing it single-handily: Our Kyndryl Inclusion Networks are only one of many ways we create a workplace where all Kyndryls can find and provide support and advice. This dedication to welcoming everyone into our company means that Kyndryl gives you and everyone next to you the ability to bring your whole self to work, individually and collectively, and support the activation of our equitable culture. Thats the Kyndryl Way. What You Can Expect With state-of-the-art resources and Fortune 100 clients, every day is an opportunity to innovate, build new capabilities, new relationships, new processes, and new value. Kyndryl cares about your well-being and prides itself on offering benefits that give you choice, reflect the diversity of our employees and support you and your family through the moments that matter wherever you are in your life journey. Our employee learning programs give you access to the best learning in the industry to receive certifications, including Microsoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations. At Kyndryl, we invest heavily in you, we want you to succeed so that together, we will all succeed. Get Referred! If you know someone that works at Kyndryl, when asked How Did You Hear About Us during the application process, select Employee Referral and enter your contact&aposs Kyndryl email address. Show more Show less
Posted 4 days ago
2.0 - 6.0 years
0 Lacs
chennai, tamil nadu
On-site
Job Description: Explore your next opportunity at a Fortune Global 500 organization and envision innovative possibilities as you experience a rewarding culture and work with talented teams that help you become better every day. If you have the unique combination of skill and passion to lead yourself or teams, there are roles ready to cultivate your skills and take you to the next level at UPS. Job Summary: As a member of the UPS team, you will provide input, support, and perform full systems life cycle management activities, including analyses, technical requirements, design, coding, testing, and implementation of systems and applications software. You will participate in component and data architecture design, technology planning, and testing for Applications Development (AD) initiatives to meet business requirements. Collaboration with teams and support for emerging technologies to ensure effective communication and achievement of objectives will be a key aspect of this role. Your expertise will be utilized to provide knowledge and support for applications development, integration, and maintenance, with input to department and project teams on decisions supporting projects. Responsibilities: - Experience developing with various technologies including front end, APIs/services/backend, database, MQ/Messaging, HTML/JavaScript, .NET, .NET Core, OpenShift, Azure DevOps Server/TFS, GIT, Jenkins - CI/CD, SonarQube, Netsparker, Dynatrace, Grafana/Loki - Security compliance - Experience with Restful services and CI/CD pipelines - Proficiency in Object-Oriented Analysis & Design - Familiarity with Agile and Scrum concepts - Excellent written and verbal communication skills - Strong problem-solving and debugging skills Qualifications: - 2-4 years of development experience using .Net, Angular, and frontend technologies - Bachelor's Degree or International equivalent in Computer Science, Information Systems, Mathematics, Statistics, or related field (Preferred) Employee Type: Permanent UPS is committed to providing a workplace free of discrimination, harassment, and retaliation.,
Posted 1 week ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
As a DevOps Engineer at NTT DATA Business Solutions, your role involves implementing and maintaining cloud infrastructure to ensure the smooth operation of the environment. You will be responsible for evaluating new technologies in infrastructure automation and cloud computing, looking for opportunities to enhance performance, reliability, and automation. Additionally, you will provide DevOps capability to team members and customers, perform code deployments, and manage release activities. Your responsibilities will also include resolving incidents and change requests, documenting solutions, and communicating them to users. You will work on optimizing existing solutions, diagnosing, troubleshooting, and resolving issues to ensure the smooth operation of services. Demonstrating a proactive attitude and aptitude for taking ownership of your work and collaborating with team members will be crucial. To excel in this role, you are required to have a Bachelor's degree in IT, computer science, computer engineering, or a related field, along with a minimum of 6 years of overall experience with at least 3 years as a DevOps Engineer. Advanced experience with Cloud Infrastructure and Cloud Services, particularly on Microsoft Azure, is essential. You should also have expertise in container orchestration (Kubernetes, Docker, Helm), Linux scripting (Bash, Python), log and metrics management (ELK Stack), monitoring tools (Prometheus, Loki, Grafana, Dynatrace), and infrastructure as code (Terraform). Furthermore, you must be proficient in continuous integration/continuous delivery tools (Gitlab CI, Jenkins, Nexus), infrastructure security principles, Helm, CI/CD pipelines configuration, and DevOps tools like Jenkins, SonarQube, Nexus, etc. Exposure to SDLC and Agile processes, SSO integrations, and AI tools is desirable. In addition to technical skills, you should possess strong attitude, soft, and communication skills. Experience in handling technically critical situations, driving expert teams, and providing innovative solutions is essential. Critical thinking, a DevOps mindset, and customer-centric thinking are key attributes for this role. Proficiency in English (written and spoken) is mandatory, while knowledge of other languages such as German or French is a plus. If you are looking to join a dynamic team at NTT DATA Business Solutions and transform SAP solutions into value, this opportunity is for you. Get empowered by our innovative and collaborative work environment. For further inquiries regarding this position, please contact the Recruiter, Pragya Kalra, at Pragya.Kalra@nttdata.com. Join us in our mission to deliver cutting-edge IT solutions and become a part of our global success story!,
Posted 1 week ago
3.0 - 7.0 years
3 - 8 Lacs
Noida
Work from Office
We are seeking a skilled and proactive Observability Engineer to join our team. In this role, you will be responsible for configuring and implementing observability solutions, setting up performance monitoring systems and creating actionable insights to enhance the reliability, capacity, and scalability of our infrastructure. Technical Skills: Extensive knowledge and experience of Performance monitoring/Observability tool using Prometheus and Grafana Experience on Observability tools configuration, implementation, alerts setup and integrations Possess knowledge on SRE, KPIs/SLOs/Metrics for monitoring the health of application & infrastructure components Have hands on knowledge of alerting, incidents creation and dashboard creations. Hands on experience in creating single pane of view for IT & Business visualization Work with dev, platform engineering to finalize business KPI's logic for Observability Pilot, recommendations, solutions & establish success criteria. Execute business observability pilot for any identified critical user journeys. Derive implementation roadmap with milestones & continuous improvement opportunities. Role & responsibilities : Gather Performance Monitoring Requirements. Conduct system performance engineering to ensure system reliability, capacity and scalability. Generate monitoring reports for IT stakeholders review. Analyze root causes of performance issues and provide corrective actions. Suggest techniques to improve monitoring efficiency. Preferred Tools: Grafana, Prometheus, Loki, New Relic, Data Dog, App Dynamics, Tempo, Mimir etc. Why Join Us? Be part of a forward-thinking company that values reliability, efficiency, and user experience. Work in a collaborative environment that encourages continuous learning and professional growth. Competitive salary and benefits package, with opportunities for career advancement.
Posted 1 week ago
2.0 - 4.0 years
6 - 10 Lacs
Chennai, Bengaluru
Work from Office
Location: Bangalore, India Experience: 2 to 4 Years Employment Type: Full-Time Job Description: We are looking for a skilled DevOps Engineer with hands-on experience in GitLab to join our team in Bangalore. The ideal candidate should have a strong understanding of CI/CD pipelines, infrastructure automation, and cloud technologies. If you are passionate about DevOps and want to work in a dynamic and fast-paced environment, we would love to hear from you! Key Responsibilities: Customer Engagement & Implementation: Work directly with enterprise customers to understand their DevOps landscape and GitLab implementation needs. Lead the design, installation, and configuration of GitLab Self-Managed (OnPrem) environments across cloud and on-premise infrastructure. Translate customer requirements into scalable GitLab deployment architectures. CI/CD Pipeline Enablement: Architect and set up secure and scalable GitLab CI/CD pipelines aligned with customer release workflows. Integrate GitLab with third-party tools such as Kubernetes, Docker, Terraform, Jenkins, and Prometheus. Automation & Infrastructure as Code (IaC): Leverage Ansible, Terraform, and Helm charts for environment provisioning and GitLab automation. Manage GitLab runners and their configuration across distributed infrastructures. Monitoring & Optimization: Implement observability using tools like Prometheus, Loki, Grafana, and GitLab metrics dashboards. Optimize performance, ensure high availability (HA), backup, disaster recovery (DR), and auto-scaling. Knowledge Transfer & Documentation: Deliver technical documentation, operational runbooks, and knowledge transfer sessions for client upskilling. Assist clients in building internal GitLab usage guidelines, governance models, and compliance checks. Collaboration & Support: Coordinate closely with DevOps, Development, Support, and Infrastructure teams to ensure smooth rollouts and version upgrades. Troubleshoot GitLab issues including user management, access controls, LDAP/SAML integration, and runner performance Required Skills & Experience: 2 to 5 years of hands-on experience in DevOps engineering, preferably in customer-facing roles. Proven expertise in GitLab Self-Managed (OnPrem) setup, configuration, upgrade, and maintenance . Strong experience with CI/CD tools , Docker, Kubernetes, and cloud platforms (Azure, AWS, GCP). Proficiency in Infrastructure-as-Code using Terraform, Ansible, and Helm. Experience in monitoring stacks: Prometheus, Loki, Grafana, and OpenTelemetry . Working knowledge of scripting (e.g., Python, Bash ) and Linux system administration. Experience implementing GitLab RBAC, GitOps principles, and GitLab security scans is a plus Preferred Qualifications: Bachelors degree in Computer Science, Information Technology, or a related field. GitLab Certified Associate or GitLab CI/CD Specialist certification is a plus. Exposure to Agile/Scrum practices and experience leading technical deliverables. Experience in customer environments requiring high uptime and regulatory compliance. Why Join Us? • Opportunity to work on cutting-edge DevOps technologies. • Collaborative and innovative work environment. • Competitive salary and benefits. • Career growth and learning opportunities. If you are an experienced DevOps Engineer with GitLab expertise and are ready to join immediately, apply now!
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
You will be joining our client's team as a Site Reliability Engineer, where your main responsibility will be to ensure the reliability and uptime of critical services. Your focus will include Kubernetes administration, CentOS servers, Java application support, incident management, and change management. The ideal candidate for this role will have strong experience with ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge, and familiarity with modern monitoring, alerting, and automation tools. We are looking for a self-motivated individual with excellent communication skills, both oral and written, who can work effectively both independently and collaboratively. Your responsibilities will include monitoring, maintaining, and managing applications on CentOS servers to ensure high availability and performance. You will be conducting routine tasks for system and application maintenance and following SOPs to correct or prevent issues. Responding to and managing running incidents, including post-mortem meetings, root cause analysis, and timely resolution will also be part of your responsibilities. Additionally, you will be monitoring production systems, applications, and overall performance, using tools to detect abnormal behaviors in the software and collecting information to help developers understand the issues. Security checks, running meetings with business partners, writing and maintaining policy and procedure documents, writing scripts or code as necessary, and learning from post-mortems to prevent new incidents are also key aspects of the role. Technical skills required for this position include: - 5+ years of experience in a SaaS and Cloud environment - Administration of Kubernetes clusters, including management of applications using ArgoCD - Linux scripting to automate routine tasks and improve operational efficiency - Experience with database systems like MySQL and DB2 - Experience as a Linux (CentOS / RHEL) administrator - Understanding of change management procedures and enforcement of safe and compliant changes to production environments - Knowledge of on-call responsibilities and maintaining on-call management tools - Experience with managing deployments using Jenkins - Prior experience with monitoring tools like New Relic, Splunk, and Nagios - Experience with log aggregation tools such as Splunk, Loki, or Grafana - Strong scripting knowledge in one of Python, Ruby, Bash, Java, or GoLang - Experience with API programming and integrating tools like Jira, Slack, xMatters, or PagerDuty If you are a dedicated professional who thrives in a high-pressure environment and enjoys working on critical services, this opportunity could be a great fit for you.,
Posted 1 week ago
3.0 - 8.0 years
6 - 12 Lacs
Gurugram
Work from Office
Location: NCR Team Type: Platform Operations Shift Model: 24x7 Rotational Coverage / On-call Support (L2/L3) Team Overview The OpenShift Container Platform (OCP) Operations Team is responsible for the continuous availability, health, and performance of OpenShift clusters that support mission-critical workloads. The team operates under a tiered structure (L2, L3) to manage day-to-day operations, incident management, automation, and lifecycle management of the container platform. This team is central to supporting stakeholders by ensuring the container orchestration layer is secure, resilient, scalable, and optimized. L2 OCP Support & Platform Engineering (Platform Analyst) Role Focus: Advanced Troubleshooting, Change Management, Automation Experience: 3–6 years Resources : 5 Key Responsibilities: Analyze and resolve platform issues related to workloads, PVCs, ingress, services, and image registries. Implement configuration changes via YAML/Helm/Kustomize. Maintain Operators, upgrade OpenShift clusters, and validate post-patching health. Work with CI/CD pipelines and DevOps teams for build & deploy troubleshooting. Manage and automate namespace provisioning, RBAC, NetworkPolicies. Maintain logs, monitoring, and alerting tools (Prometheus, EFK, Grafana). Participate in CR and patch planning cycles. L3 – OCP Platform Architect & Automation Lead (Platform SME) Role Focus: Architecture, Lifecycle Management, Platform Governance Experience: 6+ years Resources : 2 Key Responsibilities: Own lifecycle management: upgrades, patching, cluster DR, backup strategy. Automate platform operations via GitOps, Ansible, Terraform. Lead SEV1 issue resolution, post-mortems, and RCA reviews. Define compliance standards: RBAC, SCCs, Network Segmentation, CIS hardening. Integrate OCP with IDPs (ArgoCD, Vault, Harbor, GitLab). Drive platform observability and performance tuning initiatives. Mentor L1/L2 team members and lead operational best practices. Core Tools & Technology Stack Container Platform: OpenShift, Kubernetes CLI Tools: oc, kubectl, Helm, Kustomize Monitoring: Prometheus, Grafana, Thanos Logging: Fluentd, EFK Stack, Loki CI/CD: Jenkins, GitLab CI, ArgoCD, Tekton Automation: Ansible, Terraform Security: Vault, SCCs, RBAC, NetworkPolicies
Posted 1 week ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
As an engineer joining Zinier's Customer Engineering team, you will be focusing on a low-code platform. Your role will involve debugging, analyzing JavaScript code, optimizing queries, solving customer-facing issues, and automating routine tasks. You will be responsible for investigating and resolving customer-reported issues in a JavaScript + JSON low-code environment. This includes identifying and fixing bugs, implementing enhancements to enhance product performance, reliability, and usability, and supporting customers globally. Additionally, you will create and maintain documentation related to program development, logic, coding, testing, and changes. Collaboration with cross-functional teams is a key aspect of this role. You will partner with customer success, solution/engineering teams to address issues promptly, provide feedback from field operations to enhance product robustness, and participate in continuous improvement cycles. You should have the ability to drive outcomes, meet delivery milestones, and coordinate effectively across multiple teams. The required skills for this role include a minimum of 3 years of experience in Solution Development or Engineering roles, a strong understanding of JavaScript, JSON handling, and API interactions, proficiency in SQL with the ability to debug query bottlenecks, familiarity with observability stacks like Grafana, Loki, Tempo, and knowledge of AWS. Desirable skills include exposure to the Field Service Management domain, experience in products with workflows, debugging algorithms related to scheduling, or working on backend systems. Joining Zinier offers a unique opportunity to work closely with Solution Architects, influence Product blueprints, and collaborate across the full tech stack. You will have the chance to work on debugging backend services in Java, Spring Boot, explore front-end interfaces in React, and contribute to mobile UI development. Additionally, you will build internal tools, address production issues, and contribute to engineering stability while learning from experienced platform, product, and solution engineers. Being part of a high-impact team at Zinier means bridging engineering and customer experience to enhance product quality and customer trust. The company values learning, ownership, and long-term growth, providing you with a rewarding environment to grow your skills and expertise.,
Posted 1 week ago
2.0 - 6.0 years
0 Lacs
telangana
On-site
You will be joining our team as a System Development Engineer focusing on the Hybrid Scientific Computing Stack. A strong background in computer science and software development is required for this role, and knowledge of quantum computing would be an added advantage. Your responsibilities will include working on backend services such as FastAPI, Celery, OAuth, PostgreSQL, and Redis. You will also be involved in hybrid job orchestration using tools like Celery, RabbitMQ, Slurm, and Kubernetes. Containerized workflows using Docker, Singularity, and Helm will be part of your tasks. Monitoring and observability tasks will involve tools like Prometheus, Grafana, Loki, and Flower. Cloud-based deployment on platforms like GCP, AWS, and Azure, as well as secure on-prem server management, will also be within your purview. Additionally, you will work on scientific environments involving CUDA, Qiskit, Conda, GROMACS, and Lmod. To qualify for this position, you should hold a minimum Bachelor's Degree in Computer Science or related fields and have at least 2 years of professional work experience in full-stack systems engineering. Proficiency in Python (FastAPI/Celery), Linux (Ubuntu/Debian), and DevOps is required. Familiarity with cloud-native tools like Docker, Kubernetes, Helm, and GitHub Actions is essential. Experience with Slurm, GPU resource allocation, and secure job execution will be beneficial. Any familiarity with quantum SDKs such as Qiskit, PennyLane, and Cirq will be considered a bonus.,
Posted 2 weeks ago
2.0 - 6.0 years
0 Lacs
telangana
On-site
You will be joining our team as a Systems Development Engineer for the Hybrid Scientific Computing Stack. A strong background in computer science and software development is essential for this role, with knowledge of quantum computing considered a valuable asset. Your responsibilities will include managing backend services such as FastAPI, Celery, OAuth, PostgreSQL, and Redis. You will also be involved in hybrid job orchestration using tools like Celery, RabbitMQ, Slurm, and Kubernetes, as well as working on containerized workflows with Docker, Singularity, Helm, and Kubernetes. Monitoring and observability tasks will involve tools like Prometheus, Grafana, Loki, and Flower. Additionally, you will be responsible for cloud-based deployment on platforms like GCP, AWS, and Azure, as well as secure on-prem server management including GPU/CPU scheduling, RBAC, and SSH-only access. Familiarity with scientific environments such as CUDA, Qiskit, Conda, GROMACS, and Lmod will also be part of your role. To qualify for this position, you should hold a minimum Bachelor's Degree in Computer Science or related fields and have at least 2 years of professional work experience in full-stack systems engineering. Proficiency in Python (FastAPI/Celery), Linux (Ubuntu/Debian), and DevOps is required. You should also be familiar with cloud-native tools like Docker, Kubernetes, Helm, and GitHub Actions. Experience with Slurm, GPU resource allocation, and secure job execution will be beneficial. Any familiarity with quantum SDKs such as Qiskit, PennyLane, and Cirq will be considered a bonus.,
Posted 2 weeks ago
6.0 - 10.0 years
0 Lacs
karnataka
On-site
As a Senior Software DevOps Engineer, you will lead the design, implementation, and evolution of telemetry pipelines and DevOps automation that enable next-generation observability for distributed systems. You will blend a deep understanding of Open Telemetry architecture with strong DevOps practices to build a reliable, high-performance, and self-service observability platform across hybrid cloud environments (AWS & Azure). Your mission is to empower engineering teams with actionable insights through rich metrics, logs, and traces, while championing automation and innovation at every layer. You will be responsible for: Observability Strategy & Implementation: Architect and manage scalable observability solutions using OpenTelemetry (OTel), encompassing Collectors, Instrumentation, Export Pipelines, Processors & Extensions for advanced enrichment and routing. DevOps Automation & Platform Reliability: Own the CI/CD experience using GitLab Pipelines, integrating infrastructure automation with Terraform, Docker, and scripting in Bash and Python. Build resilient and reusable infrastructure-as-code modules across AWS and Azure ecosystems. Cloud-Native Enablement: Develop observability blueprints for cloud-native apps across AWS (ECS, EC2, VPC, IAM, CloudWatch) and Azure (AKS, App Services, Monitor). Optimize cost and performance of telemetry pipelines while ensuring SLA/SLO adherence for observability services. Monitoring, Dashboards, and Alerting: Build and maintain intuitive, role-based dashboards in Grafana, New Relic, enabling real-time visibility into service health, business KPIs, and SLOs. Implement alerting best practices integrated with incident management systems. Innovation & Technical Leadership: Drive cross-team observability initiatives that reduce MTTR and elevate engineering velocity. Champion innovation projects including self-service observability onboarding, log/metric reduction strategies, AI-assisted root cause detection, and more. Mentor engineering teams on instrumentation, telemetry standards, and operational excellence. Requirements: - 6+ years of experience in DevOps, Site Reliability Engineering, or Observability roles - Deep expertise with OpenTelemetry, including Collector configurations, receivers/exporters (OTLP, HTTP, Prometheus, Loki), and semantic conventions - Proficient in GitLab CI/CD, Terraform, Docker, and scripting (Python, Bash, Go). Strong hands-on experience with AWS and Azure services, cloud automation, and cost optimization - Proficiency with observability backends: Grafana, New Relic, Prometheus, Loki, or equivalent APM/log platforms - Passion for building automated, resilient, and scalable telemetry pipelines - Excellent documentation and communication skills to drive adoption and influence engineering culture Nice to Have: - Certifications in AWS, Azure, or Terraform - Experience with OpenTelemetry SDKs in Go, Java, or Node.js - Familiarity with SLO management, error budgets, and observability-as-code approaches - Exposure to event streaming (Kafka, RabbitMQ), Elasticsearch, Vault, Consul,
Posted 2 weeks ago
1.0 - 5.0 years
0 Lacs
chandigarh
On-site
You will be a part of our team as a Junior DevOps Engineer, where you will contribute to building, maintaining, and optimizing our cloud-native infrastructure. Your role will involve collaborating with senior DevOps engineers and development teams to automate deployments, monitor systems, and ensure the high availability, scalability, and security of our applications. Your key responsibilities will include managing and optimizing Kubernetes (EKS) clusters, Docker containers, and Helm charts for deployments. You will support CI/CD pipelines using tools like Jenkins, Bitbucket, and GitHub Actions, and help deploy and manage applications using ArgoCD for GitOps workflows. Monitoring and troubleshooting infrastructure will be an essential part of your role, utilizing tools such as Grafana, Prometheus, Loki, and OpenTelemetry. Working with various AWS services like EKS, ECR, ALB, EC2, VPC, S3, and CloudFront will also be a crucial aspect to ensure reliable cloud infrastructure. Automating infrastructure provisioning using IaC tools like Terraform and Ansible will be another key responsibility. Additionally, you will assist in maintaining Docker image registries and collaborate with developers to enhance observability, logging, and alerting while adhering to security best practices for cloud and containerized environments. To excel in this role, you should have a basic understanding of Kubernetes, Docker, and Helm, along with familiarity with AWS cloud services like EKS, EC2, S3, VPC, and ALB. Exposure to CI/CD tools such as Jenkins, GitHub/Bitbucket pipelines, basic scripting skills (Bash, Python, or Groovy), and knowledge of observability tools like Prometheus, Grafana, and Loki will be beneficial. Understanding GitOps (ArgoCD) and infrastructure as code (IaC), experience with Terraform/CloudFormation, and knowledge of Linux administration and networking are also required skills. This is a full-time position that requires you to work in person. If you are interested in this opportunity, please feel free to reach out to us at +91 6284554276.,
Posted 2 weeks ago
10.0 - 15.0 years
20 - 30 Lacs
Mumbai, Powai
Work from Office
Notice period : Immediate to 30 days, currently serving Notice period Job Responsibilities: Engineer and automate various database platforms and services. Assist in the ongoing process of rationalizing the technology and usage of databases. Participate in the creation and implementation of operational policies, procedures & documentation. Database Administration and Production support for databases hosted on private cloud across all regions. Database version Upgrades and Security patching. Performance Tuning. Database replication administration. Collaborate with development teams and utilize coding skills to design and implement database solutions for new and existing applications. Willing to work in the weekend and non-of f ice hours as part of wider scheduled support group. Willingness to learn and adapt to new technologies and methodologies. Required Skills Mandatory The candidate must have the following skills and experience: 10 + years of experience in MSSQL DBA administration Proven ability to navigate Linux operating systems and utilize command -line tools prof iciently. Clear understanding on MS SQL availability group Exposure in scripting languages like Python and automation tools like Ansible. Have a proven effective and efficient troubleshooting skill set. Ability to cope well under pressure. Strong Organization Skills and Practical Sense Quick and Eager to Learn and explore both Technical and Semi -Technical work types Engineering Mindset Preferred Skills Experience / Knowledge of the following will be added advantage (but not mandatory): Experience in MySQL and Oracle Experience in Infrastructure Automation Development Experience with monitoring systems and log management/reporting tools (e.g.Loki, Grafana, Splunk).
Posted 2 weeks ago
7.0 - 12.0 years
10 - 15 Lacs
Pune
Work from Office
Sarvaha would like to welcome a skilled Observability Engineer with a minimum of 7 years of experience to contribute to designing, deploying, and scaling our monitoring and logging infrastructure on Kubernetes. In this role, you will play a key part in enabling end-to-end visibility across cloud environments by processing Petabyte data scales, helping teams enhance reliability, detect anomalies early, and drive operational excellence. Sarvaha is a niche software development company that works with some of the best funded startups and established companies across the globe. What Youll Do : - Configure and manage observability agents across AWS, Azure & GCP. - Use IaC techniques and tools such as Terraform, Helm & GitOps, to automate deployment of Observability stack. - Experience with different language stacks such as Java, Ruby, Python and Go. - Instrument services using OpenTelemetry and integrate telemetry pipelines. - Optimize telemetry metrics storage using time-series databases such as Mimir & NoSQL DBs. - Create dashboards, set up alerts, and track SLIs/SLOs. - Enable RCA and incident response using observability data. - Secure the observability pipeline. You Bring : - BE/BTech/MTech (CS/IT or MCA), with an emphasis in Software Engineering. - Strong skills in reading and interpreting logs, metrics, and traces. - Proficiency with LGTM (Loki, Grafana, Tempo, Mimir) or similar stack, Jaeger, Datadog, Zipkin, InfluxDB etc. - Familiarity with log frameworks such as log4j, lograge, Zerolog, loguru etc. - Knowledge of OpenTelemetry, IaC, and security best practices. - Clear documentation of observability processes, logging standards & instrumentation guidelines. - Ability to proactively identify, debug, and resolve issues using observability data. - Focused on maintaining data quality and integrity across the observability pipeline.
Posted 2 weeks ago
10.0 - 14.0 years
0 Lacs
karnataka
On-site
As a Senior Software DevOps Engineer, you will be responsible for leading the design, implementation, and evolution of telemetry pipelines and DevOps automation to enable next-generation observability for distributed systems. Your main focus will be on leveraging a deep understanding of Open Telemetry architecture along with strong DevOps practices to construct a reliable, high-performance, and self-service observability platform that spans hybrid cloud environments such as AWS and Azure. Your primary goal will be to provide engineering teams with actionable insights through rich metrics, logs, and traces while promoting automation and innovation at all levels. In your role, you will be involved in the following key activities: Observability Strategy & Implementation: - Design and manage scalable observability solutions using OpenTelemetry (OTel), including deploying OTel Collectors for ingesting and exporting telemetry data, guiding teams on instrumentation best practices, building telemetry pipelines for data routing, and utilizing processors and extensions for advanced enrichment and routing. DevOps Automation & Platform Reliability: - Take ownership of the CI/CD experience using GitLab Pipelines, integrate infrastructure automation with Terraform, Docker, and scripting in Bash and Python, and develop resilient and reusable infrastructure-as-code modules across AWS and Azure ecosystems. Cloud-Native Enablement: - Create observability blueprints for cloud-native applications on AWS and Azure, optimize cost and performance of telemetry pipelines, and ensure SLA/SLO adherence for observability services. Monitoring, Dashboards, and Alerting: - Build and maintain role-based dashboards in tools like Grafana and New Relic for real-time visibility into service health and business KPIs, implement alerting best practices, and integrate with incident management systems. Innovation & Technical Leadership: - Drive cross-team observability initiatives to reduce MTTR and enhance engineering velocity, lead innovation projects such as self-service observability onboarding and AI-assisted root cause detection, and mentor engineering teams on telemetry standards and operational excellence. Qualifications and Skills: - 10+ years of experience in DevOps, Site Reliability Engineering, or Observability roles - Deep expertise with OpenTelemetry, GitLab CI/CD, Terraform, Docker, and scripting languages (Python, Bash, Go) - Hands-on experience with AWS and Azure services, cloud automation, and cost optimization - Proficiency with observability backends such as Grafana, New Relic, Prometheus, and Loki - Strong passion for building automated, resilient, and scalable telemetry pipelines - Excellent documentation and communication skills to drive adoption and influence engineering culture Nice to Have: - Certifications in AWS, Azure, or Terraform - Experience with OpenTelemetry SDKs in Go, Java, or Node.js - Familiarity with SLO management, error budgets, and observability-as-code approaches - Exposure to event streaming technologies (Kafka, RabbitMQ), Elasticsearch, Vault, and Consul,
Posted 2 weeks ago
2.0 - 6.0 years
0 Lacs
pune, maharashtra
On-site
You will be joining as a talented SDE1 - DevOps Engineer with the exciting opportunity to contribute towards building a top-notch DevOps infrastructure that can scale to accommodate the next 100M users. As an ideal candidate, you will be expected to tackle a variety of challenges with enthusiasm and take full ownership of your responsibilities. Your main responsibilities will include running a highly available Cloud-based software product on AWS, designing and implementing new systems in close collaboration with the Software Development team, setting up and maintaining CI/CD systems, and automating the deployment of software. You will also be tasked with continuously enhancing the security posture and operational efficiency of the Amber platform, as well as optimizing the operational costs. To excel in this role, you should possess 2-3 years of experience in a DevOps / SRE role, with a minimum of 2 years. You must have hands-on experience with AWS services such as ECS, EKS, RDS, Elasticache, and CloudFront, as well as familiarity with Google Cloud Platform. Proficiency in Infrastructure as Code tools like Terraform, CI/CD tools like Jenkins and GitHub Actions, and scripting languages such as Python and Bash is essential. Additionally, you should have a strong grasp of SCM in GitHub, networking concepts, and experience with observability and monitoring tools like Grafana, Loki, Prometheus, and ELK. Prior exposure to On-Call Rotation and mentoring junior DevOps Engineers would be advantageous. While not mandatory, knowledge of NodeJS and Ruby, including their platforms and workflows, would be considered a plus for this role.,
Posted 2 weeks ago
3.0 - 6.0 years
12 - 22 Lacs
Gurugram, Bengaluru, Mumbai (All Areas)
Work from Office
In the role of a DevOps Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and CI/CD pipelines necessary to support our Generative AI projects. Furthermore, you will have the opportunity to critically assess and influence the engineering design, architecture, and technology stack across multiple products, extending beyond your immediate focus. - Design, deploy, and manage scalable, reliable, and secure Azure cloud infrastructure to support Generative AI workloads. - Implement monitoring, logging, and alerting solutions to ensure the health and performance of AI applications. - Optimize cloud resource usage and costs while ensuring high performance and availability. - Work closely with Data Scientists and Machine Learning Engineers to understand their requirements and provide the necessary infrastructure and tools. - Automate repetitive tasks, configuration management, and infrastructure provisioning using tools like Terraform, Ansible, and Azure Resource Manager (ARM). - Utilize APM (Application Performance Monitoring) to identify and resolve performance bottlenecks Maintain comprehensive documentation for infrastructure, processes, and workflows. Must Have Skills: - Extensive knowledge of Azure services: Kubernetes, Azure App Service, Azure API management(APIM), Application gateway, AAD, GitHub Action, Istio, Datadog, Proficiency in containerization and orchestration tools such as (Jenkins, GitLab CI/CD, Azure DevOps) - Knowledge of API management platforms like APIM for API governance, security, and lifecycle management. - Expertise in monitoring and observability tools like Datadog, loki, grafana, prometheus for comprehensive monitoring, logging, and alerting solutions. Good scripting skills (Python, Bash, PowerShell). - Experience with infrastructure as code (Terraform, ARM Templates). - Experience in optimizing cloud resource usage and costs utilizing insights from Azure cost and monitor metrics.
Posted 3 weeks ago
5.0 - 8.0 years
6 - 16 Lacs
Hyderabad, Bengaluru, Mumbai (All Areas)
Work from Office
Job Title : DevOps Engineer Location: Mumbai/Bangalore/Chennai/Delhi NCR/Hyderabad Experience Required: 5+ Years Job Description Key Responsibilities: • Implement and maintain the cloud infrastructure • Ensure the smooth operation of environment • Evaluate new technologies in the field of infrastructure automation and cloud computing • Look for opportunities to improve performance, reliability and automation • Provide DevOps capability to team mates and customers • Perform code deployments • Release management activities • Resolve incidents and change requests • Document Solutions and communicate it to the users • Perform optimizations on existing solutions • Diagnose, troubleshoot, and resolve ensuring smooth operation of services. • Shows attitude and aptitude for owning responsibility of own work done and collaborate with other team member in their activities • Updates job knowledge by self-learning or participating in learning initiatives provided by organization Required Skills & Qualifications: • Bachelors degree in IT, computer science, computer engineering, or similar • 6 years of Overall experience with 3+years as Devops Engineer • Advanced experience with Cloud Infrastructure / Cloud Services (preferable on Microsoft Azure) • Container Orchestration (Kubernetes, docker ,Helm) • Experience with Linux incl. Scripting (Bash, Python) • Log and metrics management (ELK Stack), Monitoring (Prometheus,loki,Grafana,dynatrace) • Infrastructure as code / Deployment and configuration automation ( Terraform) • Continuous Integration / Continuous Delivery (Gitlab CI, Jenkins, Nexus etc.) • Infrastructure Security Principles • Advance experience in Helm and CI/CD pipelines • Advance Experience in configuration of DevOps Tools such as Jenkins ,sonarqube, Nexus etc • Exposure to SDLC & Agile process • Experience with SSO integrations • Knowledgeable on AI tools & efficient usage in day to day work • Attitude, Soft & Communication Skills • Experience in handling technically critical escalated situations, drive team of experts & come-up with best-in-class workarounds / solutions • Critical thinking generated by observation, experience, reflection, reasoning, and communication. • DevOps mindset (you build it, you run it; taking e2e responsibility and accountability) • Able to demonstrate how customer centric thinking is expressed and reinforced through the digital product design processes • Fluent English (written and spoken) is a must, other languages (e.g. German, French, etc.) are a plus. Nice to Have: • Knowledge in python • Databases (e.g. PostgreSQL, Elasticsearch)
Posted 3 weeks ago
3.0 - 6.0 years
15 - 25 Lacs
Bengaluru
Work from Office
The Opportunity Are you a self-starter with a strong background in UI development, automation, and cloud technologies, who thrives in a collaborative environment? If so, youll find an exciting opportunity on our team, where youll engage in innovative projects, deliver impactful demos, and work closely with diverse experts to drive real-world customer outcome solutions. This team strives to promote continuous learning and growth in a flexible and supportive culture. About the Team The team for this role is part of the Solutions & Performance Engineering organization within R&D at Nutanix, a global organization which operates out of various geographic locations. The team is known for its collaborative culture, where innovation and continuous learning are highly valued. The mission of the Solutions & Performance Engineering team is to engage customers on their technological and business challenges and leverage advanced technologies to develop impactful solutions, and provide efficient, seamless automation processes for clients worldwide. Your Role We are seeking a highly skilled Front-End Engineer to design, build, and optimize user interfaces with a focus on scalability and efficiency , that empower our engineering teams with deep insights into system performance. This role is ideal for someone with strong React.js expertise, a passion for building high-performing UIs, and a problem-solving mindset. Youll work closely with backend engineers and infrastructure teams to develop dashboards, integrate with APIs, and automate the visualization of complex data. Your work will help drive decisions, detect performance regressions, and streamline infrastructure automation workflows . 1. UI/UX Design & Front-End Development Build scalable and responsive front-end applications using React.js . Optimize UI/UX by managing cookies, caching , and performance tuning for large-scale apps (1,000+ pages). Revamp and modernize legacy front-end codebases for better maintainability and performance. Integrate with microservices-based backend architectures to ensure seamless data flow. Collaborate with design teams to create intuitive and visually appealing user interfaces. 2. Data Visualization & Insights Generation Develop interactive dashboards to visualize system performance trends and analytics. Work with APIs and performance benchmarks to translate backend data into actionable visual insights. Collaborate with backend engineers to define and optimize API contracts for UI needs. Utilize tools like Figma for UI design and translate wireframes into high-quality front-end components. What You Will Bring Required Skills & Experience: Proficiency in React.js , JavaScript, and front-end architecture. Strong experience with UI/UX design principles and tools such as Figma . Familiarity with REST APIs and microservices integration. Version control with Git ; experience in CI/CD pipelines , Docker , and Kubernetes . Experience building UIs that scale and perform efficiently under large data loads. Soft Skills & Qualities: Problem Solver: Can troubleshoot complex issues and design innovative, scalable solutions. Effective Communicator: Comfortable explaining technical concepts to both engineers and non-technical stakeholders. Team Player: Works well across teams and contributes to a collaborative, solution-oriented environment. Self-Starter: Independent learner who adapts quickly to new technologies and challenges. Detail-Oriented: Produces high-quality, efficient, and reliable code. Accountable: Takes ownership of tasks and delivers end-to-end solutions. Organized: Strong time management and prioritization skills in fast-paced environments. Preferred / Bonus Skills: Experience with distributed systems and cloud-native architectures . Familiarity with observability tools (e.g., Prometheus, Grafana, Loki, Jaeger, ELK stack). Background in cloud infrastructure automation using AWS, Azure, GCP, or OpenStack. Hands-on with infrastructure as code and workload orchestration tools like Terraform , Ansible , or Kubernetes
Posted 3 weeks ago
3.0 - 6.0 years
10 - 14 Lacs
Bengaluru
Hybrid
Hi all , we are looking for a role DevOps Engineer experience : 3 - 6 years notice period : Immediate - 15 days location : Bengaluru Description: Job Title: DevOps Engineer with 4+ years experience Job Summary We're looking for a dynamic DevSecOps Engineer to lead the charge in embedding security into our DevOps lifecycle. This role focuses on implementing secure, scalable, and observable cloud-native systems, leveraging Azure, Kubernetes, GitHub Actions, and security tools like Black Duck, SonarQube, and Snyk. Key Responsibilities • Architect, deploy, and manage secure Azure infrastructure using Terraform and Infrastructure as Code (IaC) principles • Build and maintain CI/CD pipelines in GitHub Actions, integrating tools such as Black Duck, SonarQube, and Snyk • Operate and optimize Azure Kubernetes Service (AKS) for containerized applications • Configure robust monitoring and observability stacks using Prometheus, Grafana, and Loki • Implement incident response automation with PagerDuty • Manage and support MS SQL databases and perform basic operations on Cosmos DB • Collaborate with development teams to promote security best practices across SDLC • Identify vulnerabilities early and respond to emerging security threats proactively Required Skills • Deep knowledge of Azure Services, AKS, and Terraform • Strong proficiency with Git, GitHub Actions, and CI/CD workflow design • Hands-on experience integrating and managing Black Duck, SonarQube, and Snyk • Proficiency in setting up monitoring stacks: Prometheus, Grafana, and Loki • Familiarity with PagerDuty for on-call and incident response workflows • Experience managing MSSQL and understanding Cosmos DB basics • Strong scripting ability (Python, Bash, or PowerShell) • Understanding of DevSecOps principles and secure coding practices • Familiarity with Helm, Bicep, container scanning, and runtime security solutions
Posted 1 month ago
2.0 - 4.0 years
4 - 9 Lacs
Bengaluru
Work from Office
Skills Required: Technical areas (hands-on experience in academic projects/internships) Experience with Kubernetes, Jenkins, Gitlab, Github, CI/CD, Terraform, Linux, Bash, Python, AWS, GCP, GKE, and EKSUnderstanding of Public/Private/Hybrid Cloud Solutions. Own the responsibility for platform management, supporting services, and all related tooling and automation. Proficient in cloud-native technologies, automation, and containerization. Experience in setting up and managing cloud infrastructure and services for a wide range of Applications. Some experience in ReactJS / NodeJS, PHP, Phyton and UNIX shell,so a background in system- oriented languages is important. Managing and deploying cloud-native applications on Kubernetes clusters, Setting CI/CD pipelines in (Jenkins, Gitlab, Github), Databases Migration (MySQL, Postgresql, Cassandra), Setting up Monitoring (Grafana, Loki, Prometheus, Mimir, ELK Stack). Certified in Kubernetes and Jenkins.Experienced in using Terraform to automate infrastructure provisioning. We are looking for bright, passionate, and dedicated people with clearly demonstrated initiative and a history of success in their past positions to join our growing team.
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough