Home
Jobs
Companies
Resume

628 Prometheus Jobs - Page 7

Filter
Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 10.0 years

22 - 26 Lacs

Hyderabad

Work from Office

Naukri logo

Skillsoft is the global leader in eLearning. Trusted by the world's leading organizations, including 65% of the Fortune 500. Our 100,000+ courses, videos and books are accessed over 100 million times every month, across more than 100 countries. At Skillsoft, we believe knowledge is the fuel for innovation and innovation is the fuel for business growth. Join us in our quest to democratize learning and help individuals unleash their edge. Are you ready to shape the future of learning through cutting-edge AI? As a Principal AI/Machine Learning Engineer at Skillsoft, you’ll dive into the heart of innovation, crafting intelligent systems that empower millions worldwide. From designing generative AI solutions to pioneering agentic workflows, you’ll collaborate with multiple teams to transform knowledge into a catalyst for growth—unleashing your edge while helping others do the same. Join us in redefining eLearning for the world’s leading organizations! Responsibilities: Hands-on AI/ML software engineer Prompt engineering, agentic workflow development and testing Work with product owners to understand requirements and guide new features Collaborate to identify new feature impacts Evaluate new AI/ML technology advancements and socialize finding Research, prototype, and select appropriate COTS and develop in-house AI/ML technology Consult with external partners to review and guide development and integration of AI technology Collaborate with teams to design, and guide AI development, and enhancements Document designs and implementation to ensure consistency and alignment with standards Create documentation including system and sequence diagrams Create appropriate data pipelines for AI/ML training and inference Analyze, curate, cleanse, and preprocess data Utilize and apply generative AI to increase productivity for yourself and the organization Periodically explore new technologies and design patterns with proof-of-concept Participate in developing best practices and improving operational processes Present research and work to socialize and share knowledge across the organization Contribute to patentable AI innovations Environment, Tools & Technologies: Agile/Scrum Operating Systems – Mac, Linux JavaScript, Node.js, Python PyTorch, Tensorflow, Keras, OpenAI, Anthropic, and friends Langchain, Langgraph, etc. APIs GraphQL, REST Docker, Kubernetes Amazon Web Services (AWS), MS Azure SQL: Postgres RDS NoSQL: Cassandra, Elasticsearch (VectorDb) Messaging – Kafka, RabbitMQ, SQS Monitoring – Prometheus, ELK GitHub, IDE (your choice) Skills & Qualifications: (8+ years experience) Experience with LLMs and fine-tuning models Development experience including unit testing Design and documentation experience of new APIs, data models, service interactions Familiarity with and ability to explain: o system and API security techniques o data privacy concerns o microservices architecture o vertical vs horizontal scaling o Generative AI, NLP, DNN, auto-encoders, etc. Attributes for Success: Proactive, Independent, Adaptable Collaborative team player Customer service minded with an ownership mindset Excellent analytic and communication skills Ability and desire to coach and mentor other developers Passionate, curious, open to new ideas, and ability to research and learn new technologies

Posted 2 weeks ago

Apply

8.0 - 10.0 years

40 - 45 Lacs

Hyderabad

Work from Office

Naukri logo

Skillsoft is the global leader in eLearning. Trusted by the world's leading organizations, including 65% of the Fortune 500. Our 100,000+ courses, videos and books are accessed over 100 million times every month, across more than 100 countries. At Skillsoft, we believe knowledge is the fuel for innovation and innovation is the fuel for business growth. Join us in our quest to democratize learning and help individuals unleash their edge. Are you ready to shape the future of learning through cutting-edge AI? As a Principal AI/Machine Learning Engineer at Skillsoft, you’ll dive into the heart of innovation, crafting intelligent systems that empower millions worldwide. From designing generative AI solutions to pioneering agentic workflows, you’ll collaborate with multiple teams to transform knowledge into a catalyst for growth—unleashing your edge while helping others do the same. Join us in redefining eLearning for the world’s leading organizations! Responsibilities: Hands-on AI/ML software engineer Prompt engineering, agentic workflow development and testing Work with product owners to understand requirements and guide new features Collaborate to identify new feature impacts Evaluate new AI/ML technology advancements and socialize finding Research, prototype, and select appropriate COTS and develop in-house AI/ML technology Consult with external partners to review and guide development and integration of AI technology Collaborate with teams to design, and guide AI development, and enhancements Document designs and implementation to ensure consistency and alignment with standards Create documentation including system and sequence diagrams Create appropriate data pipelines for AI/ML training and inference Analyze, curate, cleanse, and preprocess data Utilize and apply generative AI to increase productivity for yourself and the organization Periodically explore new technologies and design patterns with proof-of-concept Participate in developing best practices and improving operational processes Present research and work to socialize and share knowledge across the organization Contribute to patentable AI innovations Environment, Tools & Technologies: Agile/Scrum Operating Systems – Mac, Linux JavaScript, Node.js, Python PyTorch, Tensorflow, Keras, OpenAI, Anthropic, and friends Langchain, Langgraph, etc. APIs GraphQL, REST Docker, Kubernetes Amazon Web Services (AWS), MS Azure SQL: Postgres RDS NoSQL: Cassandra, Elasticsearch (VectorDb) Messaging – Kafka, RabbitMQ, SQS Monitoring – Prometheus, ELK GitHub, IDE (your choice) Skills & Qualifications: (8+ years experience) Experience with LLMs and fine-tuning models Development experience including unit testing Design and documentation experience of new APIs, data models, service interactions Familiarity with and ability to explain: o system and API security techniques o data privacy concerns o microservices architecture o vertical vs horizontal scaling o Generative AI, NLP, DNN, auto-encoders, etc. Attributes for Success: Proactive, Independent, Adaptable Collaborative team player Customer service minded with an ownership mindset Excellent analytic and communication skills Ability and desire to coach and mentor other developers Passionate, curious, open to new ideas, and ability to research and learn new technologies

Posted 2 weeks ago

Apply

6.0 - 8.0 years

13 - 17 Lacs

Noida, Hyderabad, Chennai

Hybrid

Naukri logo

Role & responsibilities: Design, deploy, and maintain AWS infrastructure using infrastructure as code (IAC) using tools such as Terraform and CloudFormation Build and deploy applications in a repetitive and automated way Design and implement serverless architecture using AWS services such as Lambda, API Gateway, DynamoDB, S3, and others Monitor, troubleshoot, and optimize performance of cloud-based applications using monitoring and analytics tools such as New Relic, Grafana and Prometheus Collaborate with development teams to ensure the reliability, scalability, and security of our systems Automate processes using CI/CD tools such as Azure DevOps, TeamCity or Jenkins. Implement security best practices and ensure compliance with regulatory requirements Continuously improve our infrastructure and processes to meet evolving business needs and technology trends Mandatory Skills: 6+ years of experience in a DevOps role, with a focus on AWS services and infrastructure as code Experience with Terraform or other IaC tools such as CloudFormation or CDK Strong understanding of serverless architectures, microservices, and containerization using Kubernetes or other container orchestration tools Experience with monitoring and analytics tools such as Grafana, Prometheus, and New Relic Familiarity with CI/CD tools such as Azure DevOps, Jenkins, GitLab, or CircleCI Proficient in at least one scripting language (Bash, Python, JavaScript) Proficiency with Linux administration/engineering Deep understanding of cloud-scale and micro/macro-services architectures, experience in operating high performance, highly scalable, and fault-tolerant multi-tenant SaaS based applications. Strong problem-solving skills and the ability to troubleshoot issues in a complex environment. Excellent communication and collaboration skills to work effectively with cross-functional teams. A passion for continuous learning and keeping up with the latest technology trends in the DevOps and cloud computing space. Preferred candidate profile: Looking for immediate joiners minimum 15days PF history mandatory for all companies

Posted 2 weeks ago

Apply

4.0 - 8.0 years

5 - 15 Lacs

Bengaluru

Work from Office

Naukri logo

Azure Monitor, Application Insights, Log Analytics Prometheus / Datadog / Dynatrace Grafana, Power BI Python, REST API Required Skills Network Watcher, Databricks Logs, System tables, REST API Bash, Powershell

Posted 2 weeks ago

Apply

10.0 - 15.0 years

12 - 17 Lacs

Hyderabad

Work from Office

Naukri logo

DevOps Manager - J49058 Job Summary We are looking for an experienced DevOps Manager with 10+ years of experience to lead our DevOps initiatives across AWS and GCP platforms. The ideal candidate will have expertise in cloud migration, CDN deployment, infrastructure automation, and stakeholder reporting. This role requires managing a team of 6 mid-level DevOps engineers and ensuring high availability, security, and scalability of our cloud infrastructure. Key Responsibilities Cloud & Infrastructure Management Manage and optimize cloud infrastructure on AWS and GCP. Lead cloud migration projects from on-premise or other cloud environments. Deploy and manage CDN solutions for improved performance and scalability. Ensure cost optimization, high availability, and disaster recovery best practices. Infrastructure Automation & CI/CD Implement Infrastructure as Code (IaC) using Terraform, Ansible, or similar tools. Automate deployment pipelines using CI/CD tools (GitHub Actions, Jenkins, AWS DevOps, or Google Cloud Build). Drive DevOps best practices, including containerization (Docker, Kubernetes) and serverless architectures. Monitoring, Security & Compliance Set up logging, monitoring, and alerting using tools like Prometheus, Grafana, AWS Monitor, and GCP Stackdriver. Ensure security best practices, including identity management, access controls, and compliance with industry standards. Conduct periodic security audits and vulnerability assessments. Stakeholder Communication & Reporting Prepare and send detailed reports on system performance, uptime, cost, and incidents to all stakeholders. Work closely with engineering, product, and security teams to align DevOps strategies with business goals. Maintain documentation for infrastructure, processes, and best practices. Team Leadership & Collaboration Lead and mentor a team of 6 mid-level DevOps/SRE engineers. Conduct training and knowledge-sharing sessions to upskill the team. Establish KPIs and performance metrics to track team progress and efficiency. Required Skills & Experience 10+ years of DevOps experience, with at least 3 years in a leadership role. Hands-on experience with AWS and GCP cloud platforms. Expertise in cloud migration & CDN deployment (e.g., Cloudflare, Akamai, CloudFront, AWS CDN). Strong knowledge of Infrastructure as Code (IaC) tools like Terraform, Ansible, or CloudFormation. Experience with CI/CD pipelines (AWS DevOps, Jenkins, GitHub Actions, GCP Cloud Build). Proficiency in Kubernetes, Docker, and container orchestration. Strong monitoring and logging skills using AWS Monitor, GCP Stackdriver, Prometheus, Grafana, Splunk. Excellent communication skills for stakeholder reporting and cross-functional collaboration. Ability to lead and mentor a team, ensuring high efficiency and skill growth. Nice to Have Experience with multi-cloud environments (AWS, GCP, ). Knowledge of serverless architectures (AWS Functions, Google Cloud Functions). Familiarity with FinOps for cloud cost management. Location & Work Mode Location:Hyderabad/Bangalore Work Mode: Office Why Join Us- Opportunity to work with cutting-edge cloud technologies and automation. Lead a talented team and drive impactful cloud transformation projects. Competitive salary, benefits, and career growth opportunities. Required Candidate profile Candidate Experience Should Be : 10 To 15 Candidate Degree Should Be : BA,BBA,BBA/BMS,BBI,BCA,BCom,BCS,BDES,BE-Comp/IT,BEd,BE-Other,BFA,BFM,BIS,BIT,BMS,BSc-Comp/IT,BSc-Other,BTech-Comp/IT,BTech-Other,CA,CS,DCA,DCS,DE-Comp/IT,DE-Other,Diploma,ICWA,LLB,MA,MBA,MBBS,MCA,MCM,MCom,MCS,ME-Comp/IT,ME-Other,MIS,MIT,MMS,MSc-Comp/IT,MS-Comp/IT,MSc-Other,MS-Other,MTech-Comp/IT

Posted 2 weeks ago

Apply

7.0 - 12.0 years

27 - 35 Lacs

Pune

Work from Office

Naukri logo

Key Responsibilities: OpenShift Platform Engineering: Deploy, manage, and maintain applications on OpenShift Container Platform. Configure and manage Operators, Helm charts, and OpenShift GitOps (Argo CD). Manage Red Hat Data Grid deployments and integrations. Support OCP cluster upgrades, patching, and troubleshooting. CI/CD Implementation & Automation: Design, implement, and manage CI/CD pipelines using Jenkins and Argo CD. Ensure seamless code integration, testing, and deployment processes with development teams. Infrastructure as Code (IaC): Automate infrastructure provisioning with tools like Terraform and Ansible. Manage hybrid infrastructure across on-prem and public clouds (AWS, Azure, or GCP). Monitoring & Performance Optimization: Implement and manage observability stacks (Prometheus, Grafana, ELK, etc.) for OCP and underlying services. Proactively identify and resolve system performance bottlenecks. Security & Compliance: Enforce security best practices in containerized and cloud environments. Conduct vulnerability assessments and ensure compliance with industry standards. Collaboration & Support: Collaborate with developers, QA, and IT teams to optimize DevOps workflows. Provide ongoing support and incident response for production and non-production environments. Required Skills & Qualifications: Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customers DevOps team during the project implementation.

Posted 2 weeks ago

Apply

5.0 - 9.0 years

12 - 15 Lacs

Pune

Hybrid

Naukri logo

We are hiring "Sr. Devops Engineer" for one of out Product based MNC @Pune EX-5-10 Years Mode-Permanent Work Mode-Hybrid Mandatory Siklls- *Experience in monitoring and troubleshooting of infratsructure and applications. Experience with Cloud platform - AWS / Azure / Google Scripting languages like Bash, Python, or Perl. Hands-on with CI/CD tools (Jenkins, GitHub / GitLab). Familiarity with Docker or similar tool for containerization and Kubernetes or similar tool for orchestration. Knowledge / experience on deployment & config. management tools like ansible, puppet or similar tool.Knowledge of monitoring tools such as AppDynamics, Splunk Basic understanding of security practices integrated into DevOps workflows. Relevant certifications like AWS Certified DevOps Engineer or Docker Certified Associate are beneficial. Ability to handle multiple tasks, prioritize, and work under pressure Ability to learn and apply new skills and processes quickly.

Posted 2 weeks ago

Apply

6.0 - 8.0 years

40 - 50 Lacs

Mumbai, Pune

Hybrid

Naukri logo

Congratulations, you have taken the first step towards bagging a career-defining role. Join the team of superheroes that safeguard data wherever it goes. What should you know about us? Seclore protects and controls digital assets to help enterprises prevent data theft and achieve compliance. Permissions and access to digital assets can be granularly assigned and revoked, or dynamically set at the enterprise-level, including when shared with external parties. Asset discovery and automated policy enforcement allow enterprises to adapt to changing security threats and regulatory requirements in real-time and at scale. Know more about us at www.seclore.com You would love our tribe: If you are a risk-taker, innovator, and fearless problem solver who loves solving challenges of data security, then this is the place for you! Role: Lead Product Engineer - Developer Productivity Experience: 6 - 8 Years Location: Mumbai/Pune A sneak peek into the role: We are seeking a highly motivated and experienced Lead, Developer Productivity & Platform Engineering to spearhead our efforts in building, scaling, and continuously improving our internal developer platform. In this critical role, you will be responsible for empowering our development teams with the tools, infrastructure, and processes necessary to achieve exceptional productivity, accelerate software delivery, and enhance their overall experience. You will driving the vision, strategy, and execution of our IDP initiatives, with a strong focus on measuring and improving developer effectiveness. Here's what you will get to explore: Leadership: This role blends the responsibilities of an individual contributor with the need to lead a team as the practice grows. While the primary focus is on individual contributions and expertise, the role also requires guiding, mentoring, and coordinating the work of others. Foster a collaborative, innovative, and results-oriented team culture. Define clear roles, responsibilities, and performance expectations for team members. Platform Vision, Strategy & Roadmap: Define and articulate a clear vision, strategy, and roadmap for our internal developer platform (IDP), aligning with overall engineering and business objectives. Identify and prioritize key features and improvements for the IDP based on developer needs and productivity goals. Stay abreast of industry trends and emerging technologies in platform engineering, developer experience, and IDPs (e.g., Backstage). Collaboration & Stakeholder Management: Work closely with application development teams, product managers, security teams, operations, and other stakeholders to understand their pain points, needs, and requirements for the IDP. Effectively communicate the value and progress of the IDP to both technical and non-technical audiences. IDP Design, Development & Maintenance: Lead the design, development, and maintenance of core components of our internal developer platform, emphasizing self-service capabilities, automation, standardization, and a seamless developer experience. Drive the adoption of Infrastructure as Code (IaC), Continuous Integration/Continuous Delivery (CI/CD), and robust observability practices within the platform. Ensure the IDP is scalable, reliable, secure, and cost-effective. Focus on Developer Productivity & Measurement: Define and track key metrics to measure the impact of the IDP on developer productivity (e.g., deployment frequency, lead time for changes, time to recovery, developer satisfaction). Implement mechanisms for collecting and analyzing data related to developer workflows and platform usage. Identify and implement solutions to streamline developer workflows, reduce toil, and accelerate application delivery based on data and feedback. Potentially lead initiatives to integrate and leverage tools like Backstage to enhance developer experience and provide a centralized platform. Tooling & Integration: Evaluate and integrate relevant tools and technologies into the IDP ecosystem, including CI/CD systems, monitoring tools, logging solutions, security scanners, and potentially IDP frameworks like Backstage. Ensure seamless integration between different platform components and existing development tools. We can see the next Entrepreneur At Seclore if you: 6+ years of relevant experience in software engineering, platform engineering, or DevOps roles, with increasing levels of responsibility. Proven experience leading and managing engineering teams, including hiring, mentoring, and performance management. Strong understanding of the software development lifecycle and common developer workflows. Deep technical expertise in cloud platforms (e.g., AWS, Azure, GCP) and cloud-native technologies (e.g., Kubernetes, Docker, serverless). Extensive experience with Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation). Significant experience designing and implementing CI/CD pipelines using tools like Jenkins, GitLab CI, GitHub Actions, CircleCI, Argo CD, or Flux CD. Solid understanding of observability principles and hands-on experience with monitoring tools (e.g., Prometheus, Grafana, Datadog), logging solutions (e.g., ELK stack, Splunk), and distributed tracing (e.g., Jaeger, Zipkin). Strong understanding of security best practices for cloud environments and containerized applications, and experience with security scanning tools and secrets management. Experience in managing and configuring Code Quality tools like SonarQube Experience in managing and configuring Git tools like Gitlab Proficiency in at least one Programming language (e.g., Python, Go) for automation. Understanding of API design principles (REST, GraphQL) and experience with building and consuming APIs. Experience with data collection and analysis to identify trends and measure the impact of platform initiatives. Excellent communication, collaboration, and interpersonal skills, with the ability to influence and build consensus across teams. Strong problem-solving and analytical abilities. Experience working in an Agile development environment. Prior experience building and maintaining an Internal Developer Platform (IDP). Hands-on experience with IDP frameworks like Backstage, including setup, configuration, plugin development, and integration with other tools. Familiarity with developer productivity frameworks and methodologies. Experience with other programming languages commonly used by development teams (e.g., Java, Node.js, C++). Experience with service mesh technologies. Knowledge of cost management and optimization in the cloud. Experience in defining and tracking developer productivity metrics. Experience with data visualization tools (e.g., Grafana, Tableau). Why do we call Seclorites Entrepreneurs not Employees? We value and support those who take the initiative and calculate risks. We have an attitude of a problem solver and an aptitude that is tech agnostic. You get to work with the smartest minds in the business. We are thriving not living. At Seclore, it is not just about work but about creating outstanding employee experiences. Our supportive and open culture enables our team to thrive. Excited to be the next Entrepreneur, apply today! Don't have some of the above points in your resume at the moment? Don't worry. We will help you build it. Let's build the future of data security at Seclore together.

Posted 2 weeks ago

Apply

9.0 - 12.0 years

20 Lacs

Hyderabad, Pune, Bengaluru

Work from Office

Naukri logo

We are looking for "Board Architect" with Minimum 9 years experience Contact- Atchaya (95001 64554) Required Candidate profile 7-8 yrs of BOARD experience with Architecture design, System Admin / Performance Mgmt. of 3-4 yrs. performance monitoring tools (e.g., AppDynamics, New Relic, Prometheus, Grafana).

Posted 2 weeks ago

Apply

4.0 - 7.0 years

11 - 16 Lacs

Pune

Hybrid

Naukri logo

So, what’s the role all about? As a Sr. Cloud Services Automation Engineer, you will be responsible for designing, developing, and maintaining robust end-to-end automation solutions that support our customer onboarding processes from an on-prem software solution to Azure SAAS platform and streamline cloud operations. You will work closely with Professional Services, Cloud Operations, and Engineering teams to implement tools and frameworks that ensure seamless deployment, monitoring, and self-healing of applications running in Azure. How will you make an impact? Design and develop automated workflows that orchestrate complex processes across multiple systems, databases, endpoints, and storage solutions in on-prem and public cloud. Design, develop, and maintain internal tools/utilities using C#, PowerShell, Python, Bash to automate and optimize cloud onboarding workflows. Create integrations with REST APIs and other services to ingest and process external/internal data. Query and analyze data from various sources such as, SQL databases, Elastic Search indices and Log files (structured and unstructured) Develop utilities to visualize, summarize, or otherwise make data actionable for Professional Services and QA engineers. Work closely with test, ingestion, and configuration teams to understand bottlenecks and build self-healing mechanisms for high availability and performance. Build automated data pipelines with data consistency and reconciliation checks using tools like PowerBI/Grafana for collecting metrics from multiple endpoints and generating centralized and actionable dashboards. Automate resource provisioning across Azure services including AKS, Web Apps, and storage solutions Experience in building Infrastructure-as-code (IaC) solutions using tools like Terraform, Bicep, or ARM templates Develop end-to-end workflow automation in customer onboarding journey that spans from Day 1 to Day 2 with minimal manual intervention Have you got what it takes? Bachelor’s degree in computer science, Engineering, or related field (or equivalent experience). Proficiency in scripting and programming languages (e.g., C#, .NET, PowerShell, Python, Bash). Experience working with and integrating REST APIs Experience with IaC and configuration management tools (e.g., Terraform, Ansible) Familiarity with monitoring and logging solutions (e.g., Azure Monitor, Log Analytics, Prometheus, Grafana). Familiarity with modern version control systems (e.g., GitHub). Excellent problem-solving skills and attention to detail. Ability to work with development and operations teams, to achieve desired results, on common projects Strategic thinker and capable of learning new technologies quickly Good communication with peers, subordinates and managers You will have an advantage if you also have: Experience with AKS infrastructure administration. Experience orchestrating automation with Azure Automation tools like Logic Apps. Experience working in a secure, compliance driven environment (e.g. CJIS/PCI/SOX/ISO) Certifications in vendor or industry specific technologies. What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NICEr! Enjoy NICE-FLEX! At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID: 7454 Reporting into: Director of Cloud Services Role Type: Individual Contributor

Posted 2 weeks ago

Apply

13.0 - 18.0 years

35 - 55 Lacs

Bengaluru

Hybrid

Naukri logo

SRE Manager About Ushur I Ushur XOS l Ushur GenA I Location: Bangalore Work Mode: Hybrid Experince: 12 to 18 Years The Role Our fast-growing team is seeking a Manager of SRE to join us as we pioneer Customer Experience AutomationTM as an Industry category. As the Manager of SRE you will be responsible for two important charters Operate and manage Ushurs production cloud Build a white-glove customer support and incident management function The ideal candidate for this role will be passionate about building a healthy high-performing team, and bring strong technical leadership, a customer-centric focus, and results-oriented action. You will begin as a player/coach while building and continuously improving execution, processes, tools/technology and analytics. Responsibilities Build and Manage a world-class SRE team. Design a 24x7 follow-the-sun organization including seamless handover across regions. Mentor and grow team focused on delivering white glove support and incident management service. Drive data-driven SRE strategy by defining and prioritizing SRE Objectives and Key Results (OKRs) aligned with company mission. This includes setting measurable targets for key service level agreements Manager Enterprise Support function to deliver exceptional white glove experiences at scale in close partnership with our Customer Success, Solution Consulting and Engineering teams. Responsible for ensuring that the Ushur platform runs reliably in production. Partner with the DevOps, Security and Engineering teams to automate deployment, monitoring and observability of the production cloud. Bring deep technical expertise in Ushur Customer Experience Automation. Provide customers with ongoing technical support and incident management for complex issues and support escalations. Optimize and automate support processes including improving the reliability of on-call processes, managing incidents, updating runbooks and documentation, reviewing RCAs and recommending solutions to prevent the recurrence and severity of incidents. Cross-functionally to drive positive customer outcomes. Engage with Product, Sales, Customer Success, Solution Consulting, Security, and Engineering, as necessary to make customers successful on our platform Qualifications 5+ years of experience of SRE/CloudOps Manager/Lead role in Enterprise SaaS Track record of developing and mentoring great talent, building and motivating high-achieving teams. Ability to lead diverse teams across multiple time zones. Business Acumen - Ability to quickly grasp and adapt to a variety of customer verticals, geographies, and business structures. Excellent verbal, written, and presentation skills with the ability to absorb complex technical concepts and communicate them to a non-technical audience Highly organized, collaborative and detail-oriented Deep experience with AWS cloud services, REST APIs, Linux Experience with DevOps processes and Build deployment, and orchestration technologies Passion for technology and for being a part of a fast-growing SaaS startup where we move quickly and wear many hats Flexible approach, able to operate effectively with uncertainty and change Driven, self-motivated, enthusiastic and with a can do attitude Benefits Great Company Culture. We pride ourselves on having a values-based culture that is welcoming, intentional, and respectful. Bring your whole self to work . We are focused on building a diverse culture, with innovative ideas where you and your ideas are valued. We are a start-up and know that every person has a significant impact! Rest and Relaxation . 20 days of flexible leaves per year, Monthly Wellness Day (aka a day off to care for yourself) and more! Health Benefits. Preventive health checkups, Medical Insurance covering the dependents, wellness sessions, and health talks at the office Keep learning. One of our core values is Growth Mindset - we believe in lifelong learning. Certification courses are reimbursed. Ushur Community offers wide resources for our employees to learn and grow. Flexible Work. In-office or hybrid working model, depending on position and location. We seek to create an environment for all our employees where they can thrive in both their profession and personal life. Why join us? We are passionate about Ushur, our product, and helping our employees grow and develop in their career in a caring, collaborative environment. We offer a very competitive compensation plan & stock options for the ideal candidates.

Posted 2 weeks ago

Apply

3.0 - 6.0 years

10 - 14 Lacs

Mumbai

Work from Office

Naukri logo

About This Role About this role The Aladdin Studio team is focused on developing a world-class digital experience which will help developers of all types build faster and more effectively on Aladdin We are evolving the Studio Developer platform as an integrated digital application where you can discover data, build your own financial apps, and access industry-leading content, documentation and insight with all of this delivered via a design-forward and client-centric experience, As a member of the Studio Developer Operations team, you will interact, engage and solve problems for some of the most technically sophisticated users of Aladdin Our team is also responsible for delivering the monitoring, logging, alerting and observability framework of Studio Developer to ensure our product is scalable and resilient as we enter a period of significant growth, Role Description 3-5 years of hand-on experience working as part of Platform Operations, Site Reliability Engineering,DevOpsor related engineering teams, Building your skills as a domain expert on the functionality and capabilities of the platform, Triaging and timely resolution of client inquiries, Enable user best practice execution on the platform including training and adoption of new platform features, Understanding and acting on platform telemetry alerts including invocation of our Incident Management response plays, Look for opportunities to automate our workflows to improve our teams effectiveness and efficiency, Reporting and metrics generation on platform reliability as well as user inquiry trends, Contribute to building out our observability framework to enhance our platform, Desirable Skills Experience building, managing and supporting large-scale platforms, Understanding of the K8s Operator Pattern -comfort and courage to wade into (predominantly golang based) operator implementation code bases Hands-on experience deploying log management andobservability platform tooling: SPLUNK / Prometheus / Grafana, AlertManager, Strong attention to details and focus on high quality delivery, Comfortable reading and writing Python code, Comfortable working with clients and partners at all levels of the business, Our Benefits To help you stay energized, engaged and inspired, we offer a wide range of benefits including a strong retirement plan, tuition reimbursement, comprehensive healthcare, support for working parents and Flexible Time Off (FTO) so you can relax, recharge and be there for the people you care about, Our hybrid work model BlackRocks hybrid work model is designed to enable a culture of collaboration and apprenticeship that enriches the experience of our employees, while supporting flexibility for all Employees are currently required to work at least 4 days in the office per week, with the flexibility to work from home 1 day a week Some business groups may require more time in the office due to their roles and responsibilities We remain focused on increasing the impactful moments that arise when we work together in person aligned with our commitment to performance and innovation As a new joiner, you can count on this hybrid model to accelerate your learning and onboarding experience here at BlackRock, About BlackRock At BlackRock, we are all connected by one mission: to help more and more people experience financial well-being Our clients, and the people they serve, are saving for retirement, paying for their childrens educations, buying homes and starting businesses Their investments also help to strengthen the global economy: support businesses small and large; finance infrastructure projects that connect and power cities; and facilitate innovations that drive progress, This mission would not be possible without our smartest investment the one we make in our employees

Posted 2 weeks ago

Apply

5.0 - 7.0 years

7 - 11 Lacs

Jaipur, Bengaluru

Work from Office

Naukri logo

In Time Tec is an award-winning IT & software company. In Time Tec offers progressive software development services, enabling its clients to keep their brightest and most valuable talent focused on innovation. In Time Tec has a leadership team averaging 15 years in software/firmware R&D, and 20 years building onshore/offshore R&D teams. We are looking for rare talent to join us. People having a positive mindset and great organizational skills will be drawn to the position. Your capacity to take initiative and solve problems as they emerge, flexibility, and honesty, will be key factors for your success at In Time Tec. We’re looking for an Interactive Backend Engineer – Python & DevOps who will be responsible for managing the release pipeline. This person will not just be involved in the scripting but also in the development and will be directly supporting the development and content teams that are creating and publishing content on most trafficked websites. The ideal candidate is someone who has worked in a build/release role previously, has strong communication skills, and who knows how to handle the unexpected scenarios. Roles and Responsibilities Backend Engineer – Python & DevOps Skills: Strong programming experience in Python (not just scripting — real development). Experience with CI/CD tools like Jenkins. Proficient in Git and source control workflows. Experience with Docker , Kubernetes , and Linux environments . Familiarity with scripting languages like Bash , optionally Groovy or Go . Knowledge of web application servers and deployment processes. Good understanding of DevOps principles , cloud environments, and automation. Nice to Have: Experience with monitoring/logging tools (e.g., Prometheus, Grafana, ELK stack). Exposure to configuration management tools like Ansible . Experience in performance tuning and scaling backend systems.

Posted 2 weeks ago

Apply

5.0 - 9.0 years

9 - 13 Lacs

Bengaluru

Work from Office

Naukri logo

Job Title: Software Engineer (Contractual) Location: Bengaluru (Work From Office) Experience: 5 – 6 Years Salary: 9 LPA – 13 LPA Employment Type: Contractual (1 Year + Extendable) Client: MicroGenesis On Payroll of: Nyxtech Hiring Contact: Yash Sharma (LinkedIn: linkedin.com/in/yashsharma1608) Notice Period: Immediate to 15 Days Job Description: We are looking for a skilled Software Engineer with expertise in GitLab administration, AWS infrastructure management with Terraform, containerization technologies, and monitoring tools to join our client MicroGenesis on a contractual basis. The role requires hands-on experience with cloud infrastructure, DevOps tools, and Linux environments. Responsibilities: Manage and administer GitLab for CI/CD pipelines and repository management. Design and implement AWS infrastructure using Terraform. Manage Linux-based environments and troubleshoot system issues. Containerize applications using Docker and orchestrate using Kubernetes. Monitor system health and application performance with Prometheus, Grafana, and CloudWatch. Collaborate with development and operations teams to streamline deployment and monitoring processes. Required Skills & Experience: 5 to 6 years of relevant experience in software engineering or DevOps roles. Strong experience with GitLab administration. Proficient with AWS and infrastructure-as-code using Terraform. Solid Linux knowledge and troubleshooting skills. Hands-on experience with Docker and Kubernetes. Familiarity with monitoring tools such as Prometheus, Grafana, and CloudWatch. Ability to work onsite in Bengaluru. Immediate to 15 days notice period preferred. Contract Details: Initial contract for 1 year, extendable based on performance and business needs. Candidate will be on payroll of Nyxtech, working with client MicroGenesis.

Posted 2 weeks ago

Apply

5.0 - 6.0 years

15 - 16 Lacs

Chennai

Work from Office

Naukri logo

Job Description: We are looking for a highly skilled DevOps Engineer with strong experience in Red Hat OpenShift Container Platform (v4.x) and related DevOps tools like Argo CD , Jenkins , and Red Hat Data Grid . The ideal candidate will be responsible for automation, managing containerized environments, and ensuring robust CI/CD pipelines across hybrid cloud infrastructure supporting our fintech solutions. Key Responsibilities: OpenShift Platform Engineering: Deploy, manage, and maintain apps on OpenShift v4.x. Manage Operators, Helm charts, and OpenShift GitOps (Argo CD). Handle Red Hat Data Grid deployments. Perform OCP upgrades, patching, and troubleshooting. CI/CD & Automation: Implement CI/CD pipelines using Jenkins, Argo CD, GitHub Actions. Ensure seamless code integration and automated deployment. Infrastructure as Code (IaC): Automate infrastructure using Terraform, Ansible, CloudFormation. Manage infrastructure on AWS, Azure, or GCP. Monitoring & Optimization: Set up observability stacks (Prometheus, Grafana, ELK, Splunk). Troubleshoot and optimize system performance. Security & Collaboration: Apply DevSecOps best practices and ensure compliance. Collaborate with development and DevOps teams for solution implementation. Desired Candidate Profile: Technical Skills: Red Hat OpenShift (v4.x) administration & operations. CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Kubernetes, Docker, Helm, GitOps. Red Hat Data Grid or other in-memory data grids. IaC tools: Terraform, Ansible, CloudFormation. Monitoring tools: Prometheus, Grafana, ELK, Splunk. Scripting: Bash, Python, or Shell. Soft Skills: Excellent analytical and problem-solving skills. Strong communication and collaboration abilities. Ability to work independently and with customer DevOps teams. Education: BE / B.Tech / MCA or equivalent in Computer Science or related fields. Work Location: Chennai

Posted 2 weeks ago

Apply

4.0 - 7.0 years

9 - 13 Lacs

Pune

Hybrid

Naukri logo

So, what’s the role all about? Seeking a skilled and experienced DevOps Engineer in designing, producing, and testing high-quality software that meets specified functional and non-functional requirements within the time and resource constraints given. How will you make an impact? Design, implement, and maintain CI/CD pipelines using Jenkins to support automated builds, testing, and deployments. Manage and optimize AWS infrastructure for scalability, reliability, and cost-effectiveness. To streamline operational workflows and develop automation scripts and tools using shell scripting and other programming languages. Collaborate with cross-functional teams (Development, QA, Operations) to ensure seamless software delivery and deployment. Monitor and troubleshoot infrastructure, build failures, and deployment issues to ensure high availability and performance. Implement and maintain robust configuration management practices and infrastructure-as-code principles. Document processes, systems, and configurations to ensure knowledge sharing and maintain operational consistency. Performing ongoing maintenance and upgrades (Production & non-production) Occasional weekend or after-hours work as needed Have you got what it takes? Experience: 4-7 years in DevOps or a similar role. Cloud Expertise: Proficient in AWS services such as EC2, S3, RDS, Lambda, IAM, CloudFormation, or similar. CI/CD Tools: Hands-on experience with Jenkins pipelines (declarative and scripted). Scripting Skills: Proficiency in either shell scripting or powershell Programming Knowledge: Familiarity with at least one programming language (e.g., Python, Java, or Go). IMP: Scripting/Programming is integral to this role and will be a key focus in the interview process. Version Control: Experience with Git and Git-based workflows. Monitoring Tools: Familiarity with tools like CloudWatch, Prometheus, or similar. Problem-solving: Strong analytical and troubleshooting skills in a fast-paced environment. CDK Knowledge in AWS DevOps. You will have an advantage if you also have: Prior experience in Development or Automation is a significant advantage. Windows system administration is a significant advantage. Experience with monitoring and log analysis tools is an advantage. Jenkins pipeline knowledge What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NICEr! Enjoy NICE-FLEX! At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID: 6119 Reporting into: Tech Manager Role Type: Individual Contributor

Posted 2 weeks ago

Apply

2.0 - 3.0 years

4 - 5 Lacs

Rajkot

Work from Office

Naukri logo

Technical Requirements: Excellent understanding of Linux commands. Thorough knowledge of CI/CD pipelines, automation, and debugging, particularly with Jenkins. Intermediate to advanced understanding of Docker and container orchestration platforms. Hands-on experience with web servers (Apache, Nginx), database servers (MongoDB, MySQL, PostgreSQL), and application servers (PHP, Node.js). Knowledge of proxies and reverse proxies is required. Good understanding and hands-on experience with site reliability tools such as Prometheus, Grafana, New Relic, Datadog, and Splunk. (Hands-on experience with at least one tool is highly desirable.) Ability to identify and fix security vulnerabilities at the OS, database, and application levels. Knowledge of cloud platforms, specifically AWS and DigitalOcean, and their commonly used services. Other Requirements: Good communication skills. Out-of-the-box problem-solving capabilities, especially in the context of technology automation and application architecture reviews. Hands-on experience with GKE, AKS, EKS, or ECS is a plus. Excellent understanding of how to craft effective AI prompts to solve specific issues.

Posted 2 weeks ago

Apply

5.0 - 7.0 years

15 - 27 Lacs

Bangalore Rural, Bengaluru

Work from Office

Naukri logo

DevOps, Site Reliability Engineering,loud platforms,GCP,Infrastructure as Code tools (Terraform, Ansible, CloudFormation), Prometheus, Grafana, ELK stack,Python, Bash, Go, Istio, Linkerd

Posted 2 weeks ago

Apply

9.0 - 14.0 years

35 - 40 Lacs

Pune

Work from Office

Naukri logo

Role Description Our organization within Deutsche Bank is AFC Production Services. We are responsible for providing technical L2 application support for business applications. The AFC (Anti-Financial Crime) line of business has a current portfolio of 25+ applications. The organization is in process of transforming itself using Google Cloud and many new technology offerings. As an Assistant Vice President, your role will include hands-on production support and be actively involved in technical issues resolution across multiple applications. You will also be working as application lead and will be responsible for technical & operational processes for all application you support. Deutsche Banks Corporate Bank division is a leading provider of cash management, trade finance and securities finance. We complete green-field projects that deliver the best Corporate Bank - Securities Services products in the world. Our team is diverse, international, and driven by shared focus on clean code and valued delivery. At every level, agile minds are rewarded with competitive pay, support, and opportunities to excel. You will work as part of a cross-functional agile delivery team. You will bring an innovative approach to software development, focusing on using the latest technologies and practices, as part of a relentless focus on business value. You will be someone who sees engineering as team activity, with a predisposition to open code, open discussion and creating a supportive, collaborative environment. You will be ready to contribute to all stages of software delivery, from initial analysis right through to production support. Your key responsibilities Provide technical support by handling and consulting on BAU, Incidents/emails/alerts for the respective applications. Perform post-mortem, root cause analysis using ITIL standards of Incident Management, Service Request fulfillment, Change Management, Knowledge Management, and Problem Management. Manage regional L2 team and vendor teams supporting the application. Ensure the team is up to speed and picks up the support duties. Build up technical subject matter expertise on the applications being supported including business flows, application architecture, and hardware configuration. Define and track KPIs, SLAs and operational metrics to measure and improve application stability and performance. Conduct real time monitoring to ensure application SLAs are achieved and maximum application availability (up time) using an array of monitoring tools. Build and maintain effective and productive relationships with the stakeholders in business, development, infrastructure, and third-party systems / data providers & vendors. Assist in the process to approve application code releases as well as tasks assigned to support to perform. Keep key stakeholders informed using communication templates. Approach support with a proactive attitude, desire to seek root cause, in-depth analysis, and strive to reduce inefficiencies and manual efforts. Mentor and guide junior team members, fostering technical upskill and knowledge sharing. Provide strategic input into disaster recovery planning, failover strategies and business continuity procedures Collaborate and deliver on initiatives and install these initiatives to drive stability in the environment. Perform reviews of all open production items with the development team and push for updates and resolutions to outstanding tasks and reoccurring issues. Drive service resilience by implementing SRE(site reliability engineering) principles, ensuring proactive monitoring, automation and operational efficiency. Ensure regulatory and compliance adherence, managing audits,access reviews, and security controls in line with organizational policies. The candidate will have to work in shifts as part of a Rota covering APAC and EMEA hours between 07:00 IST and 09:00 PM IST (2 shifts). In the event of major outages or issues we may ask for flexibility to help provide appropriate cover. Weekend on-call coverage needs to be provided on rotational/need basis. Your skills and experience 9-15 years of experience in providing hands on IT application support. Experience in managing vendor teams providing 24x7 support. Preferred : Team lead role experience, Experience in an investment bank, financial institution. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience/diploma/certification). Preferred : ITIL v3 foundation certification or higher. Knowledgeable in cloud products like Google Cloud Platform (GCP) and hybrid applications. Strong understanding of ITIL /SRE/ DEVOPS best practices for supporting a production environment. Understanding of KPIs, SLO, SLA and SLI Monitoring Tools: Knowledge of Elastic Search, Control M, Grafana, Geneos, OpenShift, Prometheus, Google Cloud Monitoring, Airflow,Splunk. Working Knowledge of creation of Dashboards and reports for senior management Red Hat Enterprise Linux (RHEL) professional skill in searching logs, process commands, start/stop processes, use of OS commands to aid in tasks needed to resolve or investigate issues. Shell scripting knowledge a plus. Understanding of database concepts and exposure in working with Oracle, MS SQL, Big Query etc. databases. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. Skills That Will Help You Excel Strong written and oral communication skills, including the ability to communicate technical information to a non-technical audience and good analytical and problem-solving skills. Proven experience in leading L2 support teams, including managing vendor teams and offshore resources. Able to train, coach, and mentor and know where each technique is best applied. Experience with GCP or another public cloud provider to build applications. Experience in an investment bank, financial institution or large corporation using enterprise hardware and software. Knowledge of Actimize, Mantas, and case management software is good to have. Working knowledge of Big Data Hadoop/Secure Data Lake is a plus. Prior experience in automation projects is great to have. Exposure to python, shell, Ansible or other scripting language for automation and process improvement Strong stakeholder management skills ensuring seamless coordination between business, development, and infrastructure teams. Ability to manage high-pressure issues, coordinating across teams to drive swift resolution. Strong negotiation skills with interface teams to drive process improvements and efficiency gains.

Posted 2 weeks ago

Apply

5.0 - 8.0 years

7 - 10 Lacs

Chennai

Work from Office

Naukri logo

What youll be doing... As a devops engineer, you will design, implement, and manage Kubernetes clusters for our telecom/networking applications. Developing and maintaining CI/CD pipelines for automated build, testing, and deployment. Monitoring and optimizing the performance and scalability of our Kubernetes infrastructure. Implementing and maintaining monitoring and alerting systems to proactively identify and resolve issues. Leading incident response and troubleshooting efforts, including root cause analysis. Automating operational tasks and processes to improve efficiency. Collaborating with development teams to integrate and deploy applications to Kubernetes. Contributing to the development and maintenance of our platform's security posture. Participating in on-call rotations to provide support for production systems. Leveraging network/telecom domain knowledge to effectively triage and resolve network-related issues. Contributing to development efforts by writing code and implementing new features (added advantage). Staying up-to-date with the latest Kubernetes and DevOps technologies and best practices. What were looking for: We are seeking a highly motivated and experienced Engineer with a strong background in Kubernetes and DevOps practices to join our team. This role will focus on building, maintaining, and scaling our network/telecom infrastructure and services in a kubernetes/Openshift based environment. You will play a key role in ensuring the reliability, performance, and security of our platform, working closely with development, operations, and other engineering teams. Experience with triaging and troubleshooting complex issues is essential, as is a willingness to contribute to development efforts. You'll need to have: Bachelors degree or four or more years of work experience. Four or more years of relevant work experience. Four or more years of experience in DevOps engineering or a related role. Proven experience with Kubernetes and containerization technologies (e.g., Docker). Experience with CI/CD tools (e.g., Jenkins, GitLab ). Strong understanding of networking concepts and protocols (e.g., TCP/IP, BGP, MPLS). Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). Experience with cloud computing platforms (e.g., AWS, Azure, GCP) is an added advantage. Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills. Experience in the telecom/networking domain is essential. Experience with scripting languages (e.g., Python, Bash) is highly desirable. Experience with development and coding is a significant advantage. Even better if you have one or more of the following: Experience with a high-performance, high-availability environment. Experience with Network technologies like SDN/NFV Strong analytical, debugging skills. Good communication and presentation skills. Relevant certifications.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

10 - 20 Lacs

Noida

Work from Office

Naukri logo

JOB DESCRIPTION Experience Level desired 5+ yrs Compensation: Salary commensurate w/ experience Reports to: Team Lead RESPONSIBILITIES Application Performance Monitoring - Using Dynatrace APM tools to optimize application performance and identify performance bottlenecks in web applications and provide solutions. Dynatrace OneAgent Installation and Troubleshoot on all types of platform like on cloud (Azure Infra, AKS ,App Services)/on-premises Dynatrace integration with 3rd Party Tools. Demonstrates thorough knowledge and awareness of application performance issues in a complex multi-tiered environment Knowledge of Customer Experience Management, Application Performance monitoring and log analytics tools like Splunk, Dynatrace Synthetic, Dynatrace Appmon, CA APM, Prometheus etc., is highly desired. On-board new application into Dynatrace, profile configuration, agent setup, instrumentation. Ability to do requirement gathering and target environment analysis from an APM perspective Hands-on implementation experience in Dynatrace On-Premise solutions Experience on Configuration and customization of Dynatrace solution Excellent communication skills (both verbal and written) Knowledge of Azure is preferred Hands on APM and other tools like- DataDog, Glassbox, Splunk, Grafana, Prometheus, New-Relic, Postman, Azure Appinsights, Azure Log, Jenkins, Docker. Power BI (good to have required for reporting and extracting data from tools) QUALIFICATIONS B.Tech or MCA preferred Atleast 5 yrs work exp. Good Communication skills

Posted 2 weeks ago

Apply

5.0 - 7.0 years

5 - 9 Lacs

Mumbai, Bengaluru, Delhi / NCR

Work from Office

Naukri logo

Key Responsibilities : Chaos Engineering : - Design and implement chaos engineering experiments to identify weaknesses in systems and applications. - Develop and execute strategies to improve system resilience and reliability. - Analyze experiment results, provide actionable insights, and drive remediation efforts. - Collaborate with development, operations, and infrastructure teams to integrate chaos engineering practices. Operational Acceptance : - Develop and maintain comprehensive operational acceptance criteria for new and existing systems. - Conduct thorough operational acceptance testing, ensuring systems meet all predefined criteria before go-live. - Work closely with project managers, developers, and QA teams to align operational acceptance processes with project timelines and objectives. - Document and communicate operational readiness findings, providing recommendations for improvement. System Resilience and Reliability : - Implement and manage strategies for continuous improvement of system resilience and reliability. - Monitor and assess system performance, identifying potential risks and areas for enhancement. - Lead initiatives to improve disaster recovery and business continuity plans. - Stay updated with the latest industry trends and best practices in chaos engineering and operational acceptance. Collaboration and Training : - Educate and mentor team members on chaos engineering and operational acceptance methodologies. - Foster a culture of resilience and reliability within the organization. - Engage with external communities, attending conferences and participating in knowledge-sharing events. Requirements : - Extensive experience in chaos engineering, operational acceptance testing, and system resilience. - Strong understanding of cloud platforms (AWS, Azure, GCP) and their resilience features. - Proficiency in scripting and automation tools (Python, Bash, Terraform, etc. - Experience with monitoring and observability tools (Prometheus, Grafana, Splunk, etc. - Experience with Chaos Engineering Tools such as Gremlin, Chaos Monkey etc. - Excellent analytical and problem-solving skills. - Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams. - Certifications in relevant fields (e.g , AWS Certified Solutions Architect, Azure DevOps Engineer) are a plus. Location: Delhi NCR,Bangalore,Chennai,Pune,Kolkata,Ahmedabad,Mumbai,Hyderabad

Posted 2 weeks ago

Apply

3.0 - 8.0 years

5 - 10 Lacs

Pune

Work from Office

Naukri logo

BMCs SaaS Ops team is looking for a DevOps Engineer to join us and design, develop, and implement complex applications, using the latest technologies. Here is how, through this exciting role, YOU will contribute to BMC's and your own success: Participate in all aspects of SaaS product development, from requirements analysis to product release and sustaining. Drive the adoption of the DevOps process and tools across the organization. Learn and implement cutting-edge technologies and tools to build best of class enterprise SaaS solutions. Deliver high-quality enterprise SaaS offerings on schedule Develop Continuous Delivery Pipeline Required Skills: 3+ years of working experience in a software engineering function Hands on experience with CI\CD pipelines and maintenance of containerized deployments Fundamental knowledge of one of automation scripting language Python, Groovy, Ansible, or Shell scripting Hands on experience in creating and maintaining Jenkins pipelines Hands on experience working with Web service protocols (Rest, JSON) Hands on experience working with DevOps and Automation tools like Git, Docker, Helm, Terraform, Jira, Harbor Registry Proficient working on Windows and Linux Operation System platforms. Good exposure and fundamental knowledge of Relational DBs (PostgreSQL, MS SQL) Good exposure and fundamental knowledge of container deployments, persistent storage, PODs, ingress, routes and Kubernetes objects. Good exposure and fundamental knowledge of tools like Elastic Search, Kibana, Grafana, Prometheus Good exposure and fundamental knowledge of Public, Private and hybrid cloud deployments Good exposure and fundamental knowledge of Site Reliability Engineering (SRE) principles and its implementation for SaaS services. Experience working in an Agile methodology with cross functional teams (R&D, DevOps, Operations, Support etc.) Able to design & document the Standard Operating Procedures (SOPs), design document and architecture artifacts Good troubleshooting skills and knowledge of BMC Helix products including ITSM, Digital Workplace, Helix Platform will be an add-on. Ability to work with time bound deadlines Hard Working & dedicated person with effective communication skills Bachelors degree in IT or equivalent professional experience This position is part of BMC SaaS DevOps team. This can include weekend work during scheduled production activities and after-hours work as needed.

Posted 2 weeks ago

Apply

4.0 - 9.0 years

17 - 22 Lacs

Bengaluru

Work from Office

Naukri logo

Job Summary: We are seeking a highly skilled Site Reliability Engineer (SRE) with experience to join our team in Bangalore. The ideal candidate will excel in implementing SRE principles to foster a culture of reliability, automation, and monitoring across our software engineering projects. This role is pivotal in ensuring the effective design, development, testing, and support of applications and systems, particularly within cloud environments. Software Requirements: Required Proficiency: Programming LanguagesTypeScript, Node.js Cloud EnvironmentsAWS (ECS Fargate, Vault, Lambda services, Artifactory) CI/CD ToolsGitHub Actions, JFrog Artifactory, Sysdig, Octopus, Terraform Observability ToolsObStack, Prometheus, Grafana, PagerDuty, Observe Infrastructure as Code (IaC) ToolsCloudFormation, Terraform Preferred Proficiency: Familiarity with additional programming languages or frameworks Experience with cloud platforms other than AWS Overall Responsibilities: Partner with senior stakeholders to lead a culture focused on data-driven reliability, monitoring, and automation in alignment with SRE principles. Design, develop, test, and support applications and systems, emphasizing managing and scaling distributed systems across cloud environments. Create and develop tools essential for the operational management and security of software applications and systems. Identify technology limitations and deficiencies in existing systems and implement scalable improvements. Drive automation efforts and enhance application monitoring capabilities. Review code developed by other engineers to ensure adherence to best practices. Thrive in incident response environments, conducting post-mortem analyses and designing secure solutions. Measure and optimize system performance, addressing customer needs and innovating for continuous improvement. Technical Skills (By Category): Programming Languages: Required: TypeScript, Node.js Cloud Technologies: Required: AWS (ECS Fargate, Lambda, Vault, Artifactory) Development Tools and Methodologies: Required: GitHub Actions, JFrog Artifactory, Sysdig, Octopus, Terraform Observability Tools: Required: ObStack, Prometheus, Grafana, PagerDuty, Observe Infrastructure as Code (IaC): Required: CloudFormation, Terraform Experience Requirements: 7 to 10 years of experience in software engineering and SRE practices. Experience in applying SRE practices in large organizations. Familiarity with modern software development practices and DevSecOps environments. Day-to-Day Activities: Collaborate with stakeholders to understand business needs and implement SRE practices. Lead cross-functional teams in enhancing system reliability and performance. Develop and maintain operational management tools for applications. Conduct regular code reviews and ensure adherence to best practices. Participate in incident response and post-mortem analysis to improve system resilience. Qualifications: Required: Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field. Commitment to continuous professional development through industry certifications and training. Professional Competencies: Strong critical thinking and problem-solving skills. Excellent leadership and teamwork abilities. Effective communication and stakeholder management skills. Adaptability and a learning-oriented mindset. Innovative thinking to drive continuous improvement. Strong time and priority management skills.

Posted 2 weeks ago

Apply

1.0 - 5.0 years

8 - 15 Lacs

Bengaluru

Work from Office

Naukri logo

Junior DevOps Engineer / DevOps Engineer Location: Bengaluru South, Karnataka, India Experience: 1.53 Years Compensation: 815 LPA Employment Type: Full-Time | Work From Office Only ________________________________________ Are you an aspiring DevOps professional ready to work on a transformative platform? Join a purpose-led team building India’s most disruptive ecosystem at the intersection of technology, property, and sustainability. This role is ideal for engineers who are eager to learn, automate, and contribute to building reliable, scalable, and secure infrastructure. Key Responsibilities Assist in designing, implementing, and managing CI/CD pipelines using tools like Jenkins or GitLab CI to automate build, test, and deployment processes. Support the deployment and management of cloud infrastructure, primarily on AWS, with exposure to Azure or GCP. Contribute to infrastructure as code practices using Terraform, CloudFormation, or Ansible. Participate in maintaining and operating containerized applications using Docker and Kubernetes. Implement and manage monitoring and logging solutions using Grafana, Loki, Prometheus, or ELK stack. Collaborate with engineering and QA teams to streamline release pipelines, ensuring high availability and performance. Develop basic automation scripts in Python or Bash to optimize and streamline operational tasks. Gain exposure to serverless and event-driven architectures under guidance from senior engineers. Troubleshoot infrastructure issues and contribute to system security and performance optimization. Requirements 1.5 to 3 years of experience in DevOps, SRE, or related infrastructure roles. Solid understanding of cloud environments (AWS preferred; Azure/GCP a plus). Basic to intermediate scripting knowledge in Python or Bash. Familiarity with CI/CD concepts and tools such as Jenkins, GitLab CI, etc. Working knowledge of Docker and introductory experience with Kubernetes. Exposure to monitoring and logging stacks (Grafana, Loki, Prometheus, ELK). Understanding of infrastructure as code using tools like Terraform or Ansible. Familiarity with networking, DNS, firewalls, and system security practices. Strong problem-solving skills and a learning mindset. Preferred Qualifications Certifications in AWS, Azure, or GCP. Exposure to serverless architectures and event-driven systems. Experience with additional monitoring tools or scripting languages. Familiarity with geospatial systems, virtual mapping, or sustainability-oriented platforms. Passion for eco-conscious technology and impact-driven development. Why You Should Join Contribute to a next-gen PropTech platform promoting sustainable and inclusive land ownership. Work closely with senior engineers committed to mentorship and ecosystem building. Join a team where your ideas are valued, your skills are sharpened, and your work has real-world impact. Be part of a vibrant, office-first culture that encourages innovation, collaboration, and growth.

Posted 3 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies