Jobs
Interviews

1162 Prometheus Jobs - Page 18

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 9.0 years

0 Lacs

punjab

On-site

The Senior Software Developer role in Perth requires a candidate with good hands-on experience in developing React/Angular based applications. The ideal candidate should possess a strong understanding of AWS Cloud services and be capable of setting up, maintaining, and enhancing the cloud infrastructure for web applications. It is essential for the candidate to have expertise in core AWS services, along with the ability to implement security and scalability best practices. Furthermore, the candidate will be responsible for establishing the CI/CD pipeline using the AWS CI/CD stack and should have practical experience in BDD/TDD methodologies. Familiarity with serverless approaches utilizing AWS Lambda, as well as proficiency in writing infrastructure as code using tools like CloudFormation, is required. Additionally, experience with Docker and Kubernetes would be advantageous for this role. A solid understanding of security best practices, including the utilization of IAM Roles and KMS, is essential. The candidate should also have exposure to monitoring solutions such as CloudWatch, Prometheus, and the ELK stack. Moreover, the candidate should possess good knowledge of DevOps practices to effectively contribute to the development and deployment processes. If you have any queries regarding this role, please feel free to reach out.,

Posted 2 weeks ago

Apply

1.0 - 5.0 years

0 Lacs

kochi, kerala

On-site

As a Java Backend Developer in our team specializing in the IoT domain, your role will involve designing, developing, and deploying scalable microservices utilizing Spring Boot, SQL databases, and AWS services. You will play a pivotal role in guiding the backend development team, implementing DevOps best practices, and optimizing cloud infrastructure to ensure high-performance and secure services. Your key responsibilities will include architecting and implementing high-performance backend services using Java (Spring Boot), developing RESTful APIs and event-driven microservices with a focus on scalability and reliability, designing and optimizing SQL databases (PostgreSQL, MySQL), and deploying applications on AWS utilizing services like ECS, Lambda, RDS, S3, and API Gateway. In addition, you will be tasked with implementing CI/CD pipelines using tools such as GitHub Actions, Jenkins, or similar, monitoring and optimizing backend performance, ensuring best practices for security, authentication, and authorization using OAuth, JWT, and IAM roles, and collaborating with the team to maintain high standards of efficiency and quality. The ideal candidate will possess expertise in Java (Spring Boot, Spring Cloud, Spring Security), microservices architecture, API development, SQL (PostgreSQL, MySQL), ORM (JPA, Hibernate), DevOps tools (Docker, Kubernetes, Terraform, CI/CD, GitHub Actions, Jenkins), AWS cloud services (EC2, Lambda, ECS, RDS, S3, IAM, API Gateway, CloudWatch), messaging systems (Kafka, RabbitMQ, SQS, MQTT), testing frameworks (JUnit, Mockito, Integration Testing), and logging & monitoring tools (ELK Stack, Prometheus, Grafana). Preferred skills that would be beneficial for this role include experience in the IoT domain, previous work experience in startups, familiarity with event-driven architecture using Apache Kafka, knowledge of Infrastructure as Code (IaC) with Terraform, and exposure to serverless architectures. In return, we offer a competitive salary with performance-based incentives, the opportunity to lead and mentor a high-performing tech team, hands-on experience with cutting-edge cloud and microservices technologies, and a collaborative, fast-paced work environment where your skills and expertise will be valued and further developed. If you have experience in any IoT domain and are enthusiastic about contributing to a dynamic team focused on innovation and excellence, we invite you to apply for this full-time, on-site/hybrid Java Backend Developer position in Kochi.,

Posted 2 weeks ago

Apply

4.0 - 8.0 years

0 Lacs

ahmedabad, gujarat

On-site

As an Experienced Systems Administrator, you will have a strong foundation in Linux, infrastructure management, and incident response. You will be skilled in monitoring, troubleshooting, and maintaining reliable systems across virtualized and cloud-based environments. Your main responsibilities will include collaborating with the operations team to manage escalations and oversee incident management. You will also be expected to implement strategies and solutions to enhance daily operations, focusing on system stability, security, and scalability. You will drive real-time monitoring of system performance and capacity, addressing alerts promptly to optimize systems. Leading troubleshooting efforts, you will coordinate responses to network and system issues. Your role will involve conducting and overseeing server, application, and network equipment setup and maintenance. Additionally, you will ensure effective outage notification and escalation for prompt resolution. Furthermore, mentoring and training team members on technical skills and troubleshooting methods will be a key part of your responsibilities. You will also be responsible for maintaining up-to-date documentation of processes and procedures in the WIKI. Key Skills: - Minimum 4 years of experience in Linux system administration. - Proficiency in datacenter technologies and cloud platforms such as AWS/GCP. - Experience in application deployment using tools like Git and StackStorm. - Strong troubleshooting skills across networks and systems, including familiarity with network protocols (TCP/IP, UDP, ICMP) and tools like TCPdump. - Advanced diagnostic skills in network performance and system capacity monitoring. - Proficiency in Linux command-line operations. - Analytical skills with the ability to interpret and act on data effectively. - Ability to prioritize and escalate issues efficiently. - Adaptability to shift work and capacity for multitasking in high-pressure scenarios. - Excellent leadership, communication, and interpersonal skills. - Bachelor's degree in Computer Science, Engineering (BE/B.Tech), MCA, or M.Sc. Desired Skills: - Basic experience with Configuration Management tools like Ansible, SaltStack, or StackStorm. - Basic experience with CI/CD tools like Jenkins. - Experience with monitoring tools such as Nagios, Sensu, Zabbix. - Basic experience with Log Analytics tools like Splunk, Elasticsearch, Sumo Logic, Prometheus, or Grafana. - Knowledge of Virtualization technologies like VMware, KVM. - Strong fundamentals in Linux, troubleshooting, and networking. - Knowledge of Containerization technologies like Kubernetes, Rancher. - Experience with Cloud Providers such as AWS or GCP. - Advanced knowledge of Networking concepts including BGP, F5 Load Balancer, and switching protocols. - Relevant certifications like RHCSA, CCNA, or equivalent. (hirist.tech),

Posted 2 weeks ago

Apply

10.0 - 14.0 years

0 Lacs

karnataka

On-site

As a member of the Data Center Network Services team at Cisco IT, you will be responsible for supporting network services for Cisco Engineering and business functions globally. Your primary mission will be to build a future-ready network that is adaptable and agile using Cisco's networking solutions. The networks you will be working on are deployed, monitored, and managed using a DevOps approach to facilitate rapid application changes. By investing in cutting-edge technologies, we ensure the delivery of services in a fast and reliable manner. The team culture fosters collaboration, creativity, and fun, encouraging team members to think innovatively and explore new ideas. In this environment, you will play a crucial role in designing, developing, testing, and deploying Data Center network capabilities. Your work will involve engaging with fellow engineers from different disciplines and internal clients to create innovative and high-quality solutions that enhance our clients" experience. **Minimum Requirements:** - Bachelor of Engineering or Technology with a minimum of 10 years of experience in designing, deploying, operating, and managing scalable DC network infrastructure using Nexus OS - Proficiency in technologies such as Routing, Switching, Nexus, VPC, VDC, VLAN, VXLAN, and BGP - Experience in incident, problem, and organizational change management - Familiarity with DevOps principles and comfortable with Agile practices **Preferred Qualifications:** - CCNP or CCIE/DE certification - Experience with SONiC NoS including basic configuration, network problem-solving, QoS monitoring and fix (especially for RoCEv2), BGP routing - Desirable experience with L3 Fabrics, Nvidia and Linux networking, Python, Prometheus, Splunk, Grafana, and Cisco Firepower firewalls (FTD/FMC) **Nice to have Qualifications:** - Experience with Nexus Dashboard Fabric Controller for network building and troubleshooting - Experience with VXLan-based networks and problem-solving In conclusion, at Cisco, we are at the forefront of revolutionizing how data and infrastructure connect and protect organizations in the AI era and beyond. With a history of 40 years of innovation, we create solutions that enable humans and technology to work together seamlessly across physical and digital realms. Our solutions empower customers with unrivaled security, visibility, and insights across their entire digital footprint. By leveraging our technology and global network, we continuously experiment and innovate to build impactful solutions. As part of the Cisco team, you will have limitless opportunities to grow and contribute on a global scale, collaborating with appreciation to achieve significant milestones. Cisco's impact is omnipresent, and it all starts with you.,

Posted 2 weeks ago

Apply

10.0 - 15.0 years

0 Lacs

chennai, tamil nadu

On-site

As a Staff Software Engineer (Java) at Walmart Global Tech in Chennai, you will play a crucial role in guiding the team in making architectural decisions and implementing best practices for building scalable applications. Your responsibilities will include driving design, development, and documentation, as well as building, testing, and deploying cutting-edge solutions that impact associates of Walmart worldwide. You will collaborate with Walmart engineering teams globally, engage with Product Management and Business to drive the agenda, and work closely with Architects and cross-functional teams to deliver solutions meeting Quality, Cost, and Delivery standards. To excel in this role, you should have a Bachelor's/Masters degree in Computer Science or a related field with a minimum of 10 years of experience in software design, development, and automated deployments. Your expertise should include delivering highly scalable Java applications, strong system design skills, knowledge of CS Fundamentals, Microservices, Data Structures, Algorithms, and proficiency in writing modular and testable code. Experience with Java, Spring Boot, Kafka, and Spark, as well as working in cloud-based solutions, is essential. You should also have a good understanding of microservices architecture, distributed concepts, design principles, and cloud native development. Additionally, your skills should encompass working with relational and NoSQL databases, caching technologies, event-based systems like Kafka, and monitoring tools like Prometheus and Splunk. Experience with containerization tools such as Docker, Helm, and Kubernetes, as well as knowledge of public cloud platforms like Azure and GCP, will be advantageous in this role. At Walmart Global Tech, you will work in an innovative environment where your contributions can impact millions of people. You will have the opportunity to grow your career, gain new skills, and collaborate with experts in the field. The company offers a flexible, hybrid work model, competitive compensation, and a range of benefits including maternity and parental leave, health benefits, and more. Walmart is committed to creating a culture of belonging where every associate feels valued and respected, fostering inclusivity and diversity across its global team. Join Walmart Global Tech to be part of a team that is shaping the future of retail, innovating at scale, and making a positive impact on the world.,

Posted 2 weeks ago

Apply

8.0 - 12.0 years

0 Lacs

pune, maharashtra

On-site

You are a seasoned Team Lead responsible for leading a team of developers in the design, development, and deployment of scalable enterprise applications. Your expertise in Java technologies, microservices architecture, and database management, along with proficiency in Python and CI/CD practices, will be essential for this role. Your key responsibilities include providing technical leadership to guide the team in building and maintaining Java-based applications using Spring Boot and microservices architecture. You will be responsible for architecting and implementing RESTful APIs to ensure scalability and performance. Additionally, you will develop and optimize PL/SQL stored procedures for efficient data handling, establish and manage CI/CD pipelines using tools like Jenkins, GitLab CI, or Bamboo, and utilize Python for scripting and automation tasks as required. Conducting code reviews, enforcing best practices, and maintaining high code quality will also be part of your role. Collaboration with cross-functional teams to gather requirements and deliver solutions that align with business objectives is crucial. Your qualifications include 8+ years of experience, familiarity with Cloud Platforms like AWS or Azure, knowledge of containerization tools such as Docker and Kubernetes, proficiency in monitoring tools like Splunk or Prometheus, and experience working in Agile/Scrum environments. You hold a Bachelor's degree or equivalent experience and are committed to continuous learning and development. This job description provides an overview of your responsibilities, and you may be assigned other job-related duties as required.,

Posted 2 weeks ago

Apply

6.0 - 11.0 years

13 - 17 Lacs

Pune

Work from Office

Hello Visionary! We empower our people to stay resilient and relevant in a constantly changing world. Were looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future. Does that sound like youThen it seems like youd make a great addition to our vibrant team. Siemens founded the new business unit Siemens Foundational Technologies (formerly known as Siemens IoT Services) on April 1, 2019 with its headquarter in Munich, Germany. It has been crafted to unlock the digital future of its clients by offering end-to-end support on their outstanding digitalization journey. Siemens Foundational Technologies is a strategic advisor and a trusted implementation partner in digital transformation and industrial IoT with a global network of more than 8000 employees in 10 countries and 21 offices. Highly skilled and experienced specialists offer services which range from consulting to craft & prototyping to solution & implementation and operation everything out of one hand. We are looking for a Senior DevOps Engineer - CI/CD and Component Clearing Automation Youll make a difference by: We are looking for a Senior DevOps Engineer with strong communication and negotiation skills to drive CI/CD automation, containerization, and software component clearing processes. This role will include onboarding projects for automated component clearing using GitLab CI/CD, managing Docker containerization, and ensuring Software Composition Analysis (SCA) for compliance and generating Software Bills of Materials (SBOMs). Additionally, the DevOps Engineer will act as a key interface between design and development team and end-users, handling multiple requests via a service desk management tool, and providing expert support. This position requires a proactive communicator who can balance user requests with development priorities. Responsibilities: Build, maintain, and optimize GitLab CI/CD pipelines for automated component clearing, integrating Software Composition Analysis (SCA) and license compliance checks. Onboard projects for automated component clearing, ensuring SBOM generation and regulatory compliance. Collaborate with stakeholders to gather requirements, set priorities, and ensure alignment between user requests and development roadmaps. Manage Docker containerization for consistent deployment across Dev, QA, and Production environments. Implement secure secret management practices within CI/CD workflows. Conduct SCA processes to identify vulnerabilities and ensure compliance using tools like BlackDuck Code Center, and Veracode. Handle multiple requests from end-users through a service desk management tool, acting as the primary interface between the development team and end-users. Provide support and consultation to development and QA teams, continuously enhancing automation, release, deployment, and upgrade processes. Monitor and optimize CI/CD pipeline performance, troubleshooting issues to ensure efficiency and uptime. Qualifications: Bachelors degree in Computer Science, Engineering, or a related field, or equivalent practical experience. 6+ years of experience in DevOps or Site Reliability Engineering, with a focus on CI/CD, Docker, and Software Composition Analysis (SCA) tools. Strong expertise in GitLab CI/CD, Docker, and hands-on experience with SCA tools (e.g., BlackDuck Code Center, Veracode). Excellent communication and negotiation skills, with the ability to manage user requests via a service desk tool and effectively bridge communication between development teams and end-users. Proven knowledge of secret management within CI/CD pipelines and containerized environments. Experience with Grafana and Prometheus for monitoring and metric-based decision-making. Desired Skills: 5-8 years of experience is required. Great Communication skills. Analytical and problem-solving skills Join us and be yourself! Make your mark in our exciting world at Siemens. This role is based in Pune and is an Individual contributor role. You might be required to visit other locations within India and outside. In return, you'll get the chance to work with teams impacting - and the shape of things to come. Find out more about Siemens careers at: & more about mobility at

Posted 2 weeks ago

Apply

7.0 - 12.0 years

11 - 15 Lacs

Bengaluru

Work from Office

As a Fortune 50 company with more than 400,000 team members worldwide, Target is an iconic brand and one of America's leading retailers. At Target, we have a timeless purpose and a proven strategy and that hasnt happened by accident. Some of the best minds from diverse backgrounds come together at Target to redefine retail in an inclusive learning environment that values people and delivers world-class outcomes. That winning formula is especially apparent in Bengaluru, where Target in India operates as a fully integrated part of Targets global team and has more than 4,000 team members supporting the companys global strategy and operations. Target Tech Overview Every time a guest enters a Target store or browses Target.com , they experience the impact of Targets investments in technology and innovation. Were the technologists behind one of the most loved retail brands, delivering joy to millions of our guests, team members, and communities. Our global in-house technology team of more than 5,000 of engineers, data scientists, architects, coaches and product managers strive to make Target the most convenient, safe and joyful place to shop. We use agile practices and leverage open-source software to adapt and build best-in-class technology for our team members and guestsand we do so with a focus on diversity and inclusion, experimentation and continuous learning. Pyramid Overview Our Product Engineering teams fuel Targets business with cutting-edge technology to deliver incredible experiences and value for guests and team members. Using a responsive architecture platform, we build and deploy industry-leading technology enabling Target to operate efficiently, securely, and reliably from the inside out. We work across Target, developing comprehensive product strategies, leveraging enterprise and guest feedback to set the standard for best in retail. About You As a lead engineer you would be responsible for Designing scalable architecture with the best choice of tech, responsible for all the services/functionalities that the team develops while ensuring quality of the team's code and/or infrastructure standards. Hands-on development, often taking on the more complicated tasks. Ensures solution is production ready, deployable, scalable and resilient. Planning and delivering of work in the team in addition to their own work. Promotes a learning culture through mentoring and coaching. Ensures product observability is in place for reliability. Fosters a culture of observability across teams and helps use operational data to improve stability and performance of their domains. Drives monitoring work on their team based on the organization's monitoring philosophy. Is aware of the operational data for their teams domain and uses it as a basis for driving changes to the team's services to achieve stability and performance improvements. Responsible for ensuring the security of the product and fostering a security first mindset across teams. Highly skilled with applying and implementing security concepts such as identifying vulnerabilities in software, creating logic to detect malicious behavior, and analyzing network or host artifacts. Able to articulate a technical strategy, value of technology, and impact to the business. Provides guidance and cultivates solutions for the most complex problems across teams. Encourages team to adopt a growth mindset. Educates team about how competitors and technology companies evolve their technologies. Guides the team in anticipation of future use cases and helps them make design decisions that minimise the cost of future changes. Evaluates options, defines pros and cons by working with the team, and identifies the best option. Position Overview To succeed in this role, youll bring over7+ yearsof experience in software design and development, with5+ yearsfocused on building scalable backend applications. Youll be a proven leader in guiding technical teams and driving products at scale to successful completion. Your expertise will lie in JVM-based technologies, specifically Java and Kotlin, along with a deep understanding of building robust, scalable systems. Must Have Skills: Java / Kotlin (Advanced proficiency in Java/Kotlin development) Microservices Architecture (Designing, developing, and managing scalable microservices) Spring BootorMicronaut(Experience with JVM-based frameworks, including reactive programming) Messaging Systems (Kafka, RabbitMQ) Databases (Experience with NoSQL databases like Cassandra, MongoDB, and SQL-based databases like PostgreSQL) CI/CD (Building and managing pipelines with Jenkins, GitLab, or similar tools) Unit and Integration Testing (Spock, JUnit, TestContainers, Selenium) Cloud Services (AWS, GCP, Azure) Containerization and Orchestration (Docker, Kubernetes) Monitoring & Observability (Grafana, ELK Stack, Prometheus) Event-Driven Architecture (Knowledge of event-driven patterns in distributed systems) Good to Have Skills: Functional Programming(Familiarity with functional programming paradigms in Kotlin) GraphQL(Experience designing and integrating GraphQL APIs) Legacy System Modernization(Experience refactoring and modernizing older systems) Security Best Practices(OWASP, vulnerability scanning, secure coding principles) Agile Methodologies(Familiar with Scrum, Kanban, or other agile processes) Know More Here Life at Target- https://india.target.com/ Benefits- https://india.target.com/life-at-target/workplace/benefits Follow us on social media https://www.linkedin.com/company/target/ Target Tech- https://tech.target.com/

Posted 2 weeks ago

Apply

2.0 - 6.0 years

5 - 9 Lacs

Hyderabad

Work from Office

Cloudant is looking for a talented Infrastructure Engineer to help manage, evolve and operate our global service infrastructure. The infrastructure team’s role is to keep our bare metal and Kubernetes infrastructure secure, healthy and performant. We play a key role across the product by providing a solid foundation to deliver Cloudant’s serverless database as a service. As an engineer in the infrastructure team, you’ll be able to develop a deep expertise in the technologies that keep a large-scale cloud database online and available. You’ll help build automation to reduce the manual effort in managing our machines, contribute to the day-to-day maintenance of the systems, and ensure our infrastructure provides the right support for Cloudant’s key customer features and security standards. We prioritise engineer growth and have a lot of in-house experience to learn from. We code primarily in Python and Ruby. Our infrastructure is a mixture of bare metal machines running Debian and Kubernetes, running on IBM’s Cloud. This is managed using Chef and Terraform, along with a lot of homegrown automation to tie it all together. Over time, you will become a subject matter expert in our infrastructure and help out debugging and fixing service issues. This role involves on-call responsibilities. Required education Bachelor's Degree Required technical and professional expertise Some experience with managing Linux machines using SSH or configuration management / Infrastructure as Code tooling (eg Skills writing code in a modern backend language (eg, Python, Go, Ruby). A focus on creating reliable code using techniques like unit testing and staged rollout. Comfortable working using pull requests and continuous integration. Experience with observability tooling (eg, Graphite, Prometheus, Grafana). Strong written skills in English and an ability to work in a distributed team. Preferred technical and professional experience Experience maintaining systems within a compliance environment (eg, financial services, tools such as Auditree). Previous experience as an SRE for a large-scale service, especially maintaining database and observability systems. Significant experience with Linux, including networking and storage debugging. Comfortable working with open-source tools, contributing fixes where needed.

Posted 2 weeks ago

Apply

3.0 - 7.0 years

9 - 13 Lacs

Pune

Work from Office

As a Site Reliability Engineer, you will work in an agile, collaborative environment to build, deploy, configure, and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying the latest software updates & fixes. Your primary responsibilities include: 24x7 Observability: Be part of a worldwide team that monitors the health of production systems and services around the clock, ensuring continuous reliability and optimal customer experience. Cross-Functional Troubleshooting: Collaborate with engineering teams to provide initial assessments and possible workarounds for production issues. Troubleshoot and resolve production issues effectively. Deployment and Configuration: Leverage Continuous Delivery (CI/CD) tools to deploy services and configuration changes at enterprise scale. Security and Compliance Implementation: Implementing security measures that meet or exceed industry standards for regulations such as GDPR, SOC2, ISO 27001, PCI, HIPAA, and FBA. Maintenance and Support: Tasks related to applying Couchbase security patches and upgrades, supporting Cassandra and Mongo for pager duty rotation, and collaborating with Couchbase Product support for issue resolution. Required education Bachelor's Degree Required technical and professional expertise System Monitoring and Troubleshooting: Strong skills in monitoring/observability, issue response, and troubleshooting for optimal system performance. Automation Proficiency: Proficiency in automation for production environment changes, streamlining processes for efficiency, and reducing toil. Linux Proficiency: Strong knowledge of Linux operating systems. Operation and Support Experience: Demonstrated experience in handling day-to-day operations, alert management, incident support, migration tasks, and break-fix support. Experience with Infrastructure as Code (Terraform/OpenTofu) Experience with ELK/EFK stack (ElasticSearch, Logstash/Fluentd, and Kibana) Preferred technical and professional experience Kubernetes/OpenShift: Strongly preferred experience in working with production Kubernetes/OpenShift environments. Automation/Scripting: In depth experience with the Ansible, Python, Terraform, and CI/CD tools such as Jenkins, IBM Continuous Delivery, ArgoCD Monitoring/Observability: Hands on experience crafting alerts and dashboards using tools such as Instana, Grafana/Prometheus Experience working in an agile team, e.g., Kanban

Posted 2 weeks ago

Apply

3.0 - 8.0 years

7 - 12 Lacs

Hyderabad

Work from Office

Responsible for IT Infrastructure cross-platform technology areas demonstrating design and build expertise. Responsible for developing, architecting, and building AWS Cloud services with best practices, blueprints, patterns, high-availability and multi-region disaster recovery. Strong communication and collaboration skills Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise BE / B Tech in any stream, M.Sc. (Computer Science/IT) / M.C.A, with Minimum 3-5 plus years of experience Must have 3 + yrs of relevant experience in Python/ Java, AWS, Terraform/(IaC) Experience in Kubernetes, Docker, Shell scripting. Experienced in scripting languages Python (not someone who can write small scripts) Preferred technical and professional experience Experience using DevOps tools in a cloud environment, such as Ansible, Artifactory, Docker, GitHub, Jenkins, Kubernetes, Maven, and Sonar Qube Experience installing and configuring different application servers such as JBoss, Tomcat, and WebLogic Experience using monitoring solutions like CloudWatch, ELK Stack, and Prometheus

Posted 2 weeks ago

Apply

2.0 - 5.0 years

15 - 20 Lacs

Pune

Hybrid

Monitor production systems & services using observability tools (logs, metrics, traces, dashboards, Respond to incidents Design, implement & maintain observability solutions (eg Prometheus, Grafana, ELK) Technical Operations & Continuous Improvement Required Candidate profile Must have* Exp in Azure services with AWS Hands on with (IaC) tools such as Terraform Scripting skills in Python/Bash/PowerShell Familiarity with Gitlab CI/CD tools Notice Period - 1 month or less

Posted 2 weeks ago

Apply

15.0 - 20.0 years

5 - 9 Lacs

Chennai

Work from Office

Project Role : Application Developer Project Role Description : Design

Posted 2 weeks ago

Apply

7.0 - 12.0 years

9 - 14 Lacs

Pune

Work from Office

Role Purpose The purpose of this role is to provide solutions and bridge the gap between technology and business know-how to deliver any client solution 7+ Years of experiance in Kubernetes, Helm charts and API tool experiance. Cloud and Kubernetes : 3+ years of hands-on experience with DevOps practices, especially in AWS EKS and Kubernetes ecosystem. Familiarity with container orchestration, cluster scaling, and networking tools like Calico and Karpenter . Proficiency in creating and managing Helm charts for Kubernetes-based applications. API Management : Experience with API gateways and platforms like Tyk , Kong , or similar API management tools. Strong understanding of API security, authentication, and performance optimization. CI/CD Tools and Automation : Expertise in building CI/CD pipelines using AWS CodeCommit , GitHub , GitLab , or similar platforms. Strong knowledge of scripting and automation Monitoring and Observability : Experience with monitoring and observability tools like Prometheus , Grafana , and OpenTelemetry for Kubernetes clusters. Hands-on experience with logging tools, specifically Elastic (ELK stack) for log management and analysis. Soft Skills : Strong problem-solving skills with attention to detail. Ability to work in a collaborative, fast-paced environment. Excellent communication skills and the ability to work cross-functionally with development and operations teams. 2. Skill upgradation and competency building Clear wipro exams and internal certifications from time to time to upgrade the skills Attend trainings, seminars to sharpen the knowledge in functional/ technical domain Write papers, articles, case studies and publish them on the intranet Mandatory Skills: Kubernetes. Experience: 5-8 Years.

Posted 2 weeks ago

Apply

8.0 - 12.0 years

25 - 35 Lacs

Bengaluru

Remote

Job Title : Sr. Devops SRE Location State : Karnataka Location City : Bangalore(Hybrid/ Remote) Experience Required : 8 to 12 Year(s) CTC Range : 25 to 38 LPA Shift: Day Shift Work Mode: Hybrid/ Remote Position Type: Contract ( with possible extension) Openings: 6 Company Name: VARITE INDIA PRIVATE LIMITED About The Client: An American multinational digital communications technology conglomerate corporation headquartered in San Jose, California. The Client develops, manufactures, and sells networking hardware, software, telecommunications equipment, and other high-technology services and products. The Client specializes in specific tech markets, such as the Internet of Things (IoT), domain security, videoconferencing, and energy management. It is one of the largest technology companies in the world, ranking 82nd on the Fortune 100 with over $51 billion in revenue and nearly 83,300 employees. About The Job: Hiring for Sr. Devops SRE Essential Job Functions: Key Responsibilities: Help build a new platform to support business transformation Focus on automation within DevOps (tools, processes) Operate in production environments (Amazon cloud or on-prem datacenters) Strong exposure to Kubernetes clusters and observability tools Top 3 Skill needed Kubernetes Highest priority (hands-on in production cluster setup & management) Observability & Monitoring Tools Grafana, Splunk (logging), Prometheus DevOps Tools & Practices Must Have Skills: Git (code repository) Python (basic to intermediate scripting) Docker Pipelines (CI/CD) Qualifications: Any Graduate How to Apply: Interested candidates are invited to submit their resume using the apply online button on this job post. Equal Opportunity Employer: VARITE is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, marital status, veteran status, or disability status. Unlock Rewards: Refer Candidates and Earn. If you're not available or interested in this opportunity, please pass this along to anyone in your network who might be a good fit and interested in our open positions. VARITE offers a Candidate Referral program, where you'll receive a one-time referral bonus based on the following scale if the referred candidate completes a three-month assignment with VARITE. Exp Req - Referral Bonus 0 - 2 Yrs. - INR 5,000 2 - 6 Yrs. - INR 7,500 6 + Yrs. - INR 10,000 About VARITE: VARITE is a global staffing and IT consulting company providing technical consulting and team augmentation services to Fortune 500 Companies in USA, UK, CANADA and INDIA. VARITE is currently a primary and direct vendor to the leading corporations in the verticals of Networking, Cloud Infrastructure, Hardware and Software, Digital Marketing and Media Solutions, Clinical Diagnostics, Utilities, Gaming and Entertainment, and Financial Services.

Posted 2 weeks ago

Apply

14.0 - 20.0 years

15 - 20 Lacs

Pune

Hybrid

So, what’s the role all about? We are looking for a highly skilled and motivated Site Reliability Engineering (SRE) Manager to lead a team of SREs in designing, building, and maintaining scalable, reliable, and secure infrastructure and services. You will work closely with engineering, product, and security teams to improve system performance, availability, and developer productivity through automation and best practices. How will you make an impact? Build server-side software using Java Lead and mentor a team of SREs; support their career growth and ensure strong team performance. Drive initiatives to improve availability, reliability, observability, and performance of applications and infrastructure. Establish SLOs/SLAs and implement monitoring systems, dashboards, and alerting to measure and uphold system health. Develop strategies for incident management, root cause analysis, and postmortem reporting. Build scalable automation solutions for infrastructure provisioning, deployments, and system maintenance. Collaborate with cross-functional teams to design fault-tolerant and cost-effective architectures. Promote a culture of continuous improvement and reliability-first engineering. Participate in capacity planning and infrastructure scaling. Manage on-call rotations and ensure incident response processes are effective and well-documented. Work in a fast-paced, fluid landscape while managing and prioritizing multiple responsibilities Have you got what it takes? Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field. 10+ years of overall experience in SRE/DevOps roles, with at least 2 years managing technical teams. Proficiency in at least one programming language (e.g., Python, Go, Java, C#) and experience with scripting languages (e.g., Bash, PowerShell). Deep understanding of cloud computing platforms (e.g., AWS), the working and reliability constraints of some of the prominent services (e.g., EC2, ECS, Lambda, DynamoDB etc) Experience with infrastructure as code tools such as CloudFormation, Terraform. Deep understanding of CI/CD concepts and experience with CI/CD tools such as Jenkins, GitLab CI/CD, or CircleCI. Strong knowledge of containerization technologies (e.g., Docker, Kubernetes) and microservices architecture. Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK). Working experience of Grafana Observability Suite (Loki, Mimir, Tempo). Experience in implementing OpenTelemetry protocol in Microservice environment. Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems. Experience of Incident management and blameless postmortems that includes driving the incident response efforts during outages and other critical incidents, resolution, and communication in a cross-functional team setup. Good to have skills: Handson experience of working with large Kubernetes Cluster. Certification will be an added plus. Administration and/or development experience of standard monitoring and automation tools such as Splunk, Datadog, Pagerduty Rundeck. Familiarity with configuration management tools like Ansible, Puppet, or Chef. Certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent.

Posted 2 weeks ago

Apply

4.0 - 8.0 years

10 - 18 Lacs

Hyderabad

Work from Office

Responsibilities: * Design, develop, test & maintain backend systems using Java, J2EE, OOD, RESTful APIs & NoSQL databases.

Posted 2 weeks ago

Apply

6.0 - 10.0 years

11 - 12 Lacs

Hyderabad

Work from Office

We are seeking a highly skilled Devops Engineer to join our dynamic development team. In this role, you will be responsible for designing, developing, and maintaining both frontend and backend components of our applications using Devops and associated technologies. You will collaborate with cross-functional teams to deliver robust, scalable, and high-performing software solutions that meet our business needs. The ideal candidate will have a strong background in devops, experience with modern frontend frameworks, and a passion for full-stack development. Requirements : Bachelor's degree in Computer Science Engineering, or a related field. 6 to 10+ years of experience in full-stack development, with a strong focus on DevOps. DevOps with AWS Data Engineer - Roles & Responsibilities: Use AWS services like EC2, VPC, S3, IAM, RDS, and Route 53. Automate infrastructure using Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation . Build and maintain CI/CD pipelines using tools AWS CodePipeline, Jenkins,GitLab CI/CD. Cross-Functional Collaboration Automate build, test, and deployment processes for Java applications. Use Ansible , Chef , or AWS Systems Manager for managing configurations across environments. Containerize Java apps using Docker . Deploy and manage containers using Amazon ECS , EKS (Kubernetes) , or Fargate . Monitoring & Logging using Amazon CloudWatch,Prometheus + Grafana,E Stack (Elasticsearch, Logstash, Kibana),AWS X-Ray for distributed tracing manage access with IAM roles/policies . Use AWS Secrets Manager / Parameter Store for managing credentials. Enforce security best practices , encryption, and audits. Automate backups for databases and services using AWS Backup , RDS Snapshots , and S3 lifecycle rules . Implement Disaster Recovery (DR) strategies. Work closely with development teams to integrate DevOps practices. Document pipelines, architecture, and troubleshooting runbooks. Monitor and optimize AWS resource usage. Use AWS Cost Explorer , Budgets , and Savings Plans . Must-Have Skills: Experience working on Linux-based infrastructure. Excellent understanding of Ruby, Python, Perl, and Java . Configuration and managing databases such as MySQL, Mongo. Excellent troubleshooting. Selecting and deploying appropriate CI/CD tools Working knowledge of various tools, open-source technologies, and cloud services. Awareness of critical concepts in DevOps and Agile principles. Managing stakeholders and external interfaces. Setting up tools and required infrastructure. Defining and setting development, testing, release, update, and support processes for DevOps operation. Have the technical skills to review, verify, and validate the software code developed in the project.

Posted 2 weeks ago

Apply

4.0 - 9.0 years

5 - 9 Lacs

Bengaluru

Work from Office

Experienced as an Opsramp Developer/Architect Hands-on experience with Prometheus, OpenTelemetry Experience with data pipelines and redirecting Prometheus metrics to opsramp Proficiency in scripting and programming languages such as Python, Ansible, and Bash. Familiarity with CI/CD deployment pipelines (Ansible, GIT). Strong knowledge of performance monitoring, metrics, capacity planning, and management. Excellent communication skills with the ability to articulate technical details to different audiences. Experience with application onboarding, capturing requirements, understanding data sources, and architecture diagrams. Will work in a collaborative manner with clients and team, abiding to critical timelines and deliverable The general scope of the work for this position is as follows: Design, implement, and optimize OpsRamp solutions in multi tenant model. Implement and configure components of the OpsRamp, Gateway, discovery, opsramp agents, instrumentation via Prometheus etc. Opsramp for Infra , network , app observability OpsRamp event management. Create and maintain comprehensive documentation for OpsRamp configurations and processes. Ensure seamless integration between Opsrmap and other element monitoring tools and ITSM platforms Develop and maintain advanced dashboards and visualizations.

Posted 2 weeks ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

As a Site Reliability Engineering (SRE) Technical Leader on the Network Assurance Data Platform (NADP) team at ThousandEyes, you will be responsible for ensuring the reliability, scalability, and security of cloud and big data platforms. Your role will involve representing the NADP SRE team, working in a dynamic environment, and providing technical leadership in defining and executing the team's technical roadmap. Collaborating with cross-functional teams, including software development, product management, customers, and security teams, is essential. Your contributions will directly impact the success of machine learning (ML) and AI initiatives by ensuring a robust and efficient platform infrastructure aligned with operational excellence. In this role, you will design, build, and optimize cloud and data infrastructure to ensure high availability, reliability, and scalability of big-data and ML/AI systems. Collaboration with cross-functional teams will be crucial in creating secure, scalable solutions that support ML/AI workloads and enhance operational efficiency through automation. Troubleshooting complex technical problems, conducting root cause analyses, and contributing to continuous improvement efforts are key responsibilities. You will lead the architectural vision, shape the team's technical strategy and roadmap, and act as a mentor and technical leader to foster a culture of engineering and operational excellence. Engaging with customers and stakeholders to understand use cases and feedback, translating them into actionable insights, and effectively influencing stakeholders at all levels are essential aspects of the role. Utilizing strong programming skills to integrate software and systems engineering, building core data platform capabilities and automation to meet enterprise customer needs, is a crucial requirement. Developing strategic roadmaps, processes, plans, and infrastructure to efficiently deploy new software components at an enterprise scale while enforcing engineering best practices is also part of the role. Qualifications for this position include 8-12 years of relevant experience and a bachelor's engineering degree in computer science or its equivalent. Candidates should have the ability to design and implement scalable solutions with a focus on streamlining operations. Strong hands-on experience in Cloud, preferably AWS, is required, along with Infrastructure as a Code skills, ideally with Terraform and EKS or Kubernetes. Proficiency in observability tools like Prometheus, Grafana, Thanos, CloudWatch, OpenTelemetry, and the ELK stack is necessary. Writing high-quality code in Python, Go, or equivalent programming languages is essential, as well as a good understanding of Unix/Linux systems, system libraries, file systems, and client-server protocols. Experience in building Cloud, Big data, and/or ML/AI infrastructure, architecting software and infrastructure at scale, and certifications in cloud and security domains are beneficial qualifications for this role. Cisco emphasizes diversity and encourages candidates to apply even if they do not meet every single qualification. Diverse perspectives and skills are valued, and Cisco believes that diverse teams are better equipped to solve problems, innovate, and create a positive impact.,

Posted 3 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

chandigarh

On-site

As a DevOps Engineer, you will be responsible for designing, implementing, and managing CI/CD pipelines to streamline software development and deployment processes. Your role will involve overseeing Jenkins management for continuous integration and automation, as well as deploying and managing cloud infrastructure using AWS services. Additionally, you will configure and optimize brokers such as RabbitMQ, Kafka, or similar messaging systems to ensure efficient communication between microservices. Monitoring, troubleshooting, and enhancing system performance, security, and reliability will also be key aspects of your responsibilities. Collaboration with developers, QA, and IT teams to optimize development workflows is a crucial part of this role. To excel in this position, you are required to have AWS Certification (preferably AWS Certified DevOps Engineer, Solutions Architect, or equivalent) along with strong experience in CI/CD pipeline automation using tools like Jenkins, GitLab CI/CD, or GitHub Actions. Proficiency in Jenkins management, including installation, configuration, and troubleshooting is essential. Knowledge of brokers for messaging and event-driven architectures, hands-on experience with containerization tools like Docker, and proficiency in scripting and automation (using Python, Bash, or similar) are also necessary. Additionally, experience with monitoring and logging tools such as Prometheus, Grafana, ELK stack, or CloudWatch, as well as an understanding of networking, security, and cloud best practices are important qualifications. Preferred skills for this role include experience in mobile and web application development environments and familiarity with Agile and DevOps methodologies. This is a full-time position with benefits including paid sick time, paid time off, and a performance bonus. The work schedule is on the day shift from Monday to Friday, and the work location is in person.,

Posted 3 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

pune, maharashtra

On-site

You are a skilled DevOps Specialist with over 3 years of experience, sought to join a global automotive team in Kochi, Pune, or Chennai. Your primary responsibilities include managing operations, system monitoring, troubleshooting, and supporting automation workflows to ensure operational stability and excellence for enterprise IT projects within the automotive industry. Your daily tasks will involve proactive incident tracking, log analysis, and resource monitoring to maintain application availability and system performance. You will also be responsible for responding to tickets raised by the DevOps team or end-users, troubleshooting issues, and maintaining detailed incident logs and SLAs. Additionally, you will assist in scheduled changes, releases, and maintenance activities, while documenting processes, runbooks, and knowledge base articles. To excel in this role, you must have proficiency in logfile analysis, Linux administration, and monitoring tools such as AppDynamics, Checkmk, Prometheus, and Grafana. Experience with security tools like Black Duck, SonarQube, and OWASP is required, along with hands-on experience in Docker. Familiarity with CI/CD tools (Jenkins, GitLab), container platforms (Docker, Kubernetes), and cloud services (AWS, Azure) is essential. Your communication, analytical, and organizational skills will be crucial in providing regular updates to stakeholders, preparing root cause analysis reports, and effectively collaborating with the team. Experience in handling confidential data and safety-sensitive systems is necessary, along with the ability to work in a team environment. Optional experience in the automotive or manufacturing industry, particularly production management systems, and familiarity with IT process frameworks like SCRUM and ITIL will be advantageous.,

Posted 3 weeks ago

Apply

7.0 - 12.0 years

25 - 32 Lacs

Pune

Work from Office

Hi, Wishes from GSN!!! Pleasure connecting with you!!! We been into Corporate Search Services for Identifying & Bringing in Stellar Talented Professionals for our reputed IT / Non-IT clients in India. We have been successfully providing results to various potential needs of our clients for the last 20 years. Who are we looking for? Skilled IT Operations Consultant specializing in Monitoring and Observability to design, implement and optimize monitoring solutions for our customers. Strong background in monitoring, observability and IT service management is MUST . 1. WORK LOCATION : PUNE 2. Job Role: LEAD ENGINEER 3. EXPERIENCE : 7+ yrs 4. CTC Range: Rs. 25 LPA to Rs. 30 LPA 5. Work Type : WFO ****** Looking for SHORT JOINERS ****** Job Description : Required Skills : Strong understanding of infrastructure and platform development principles and experience with programming languages such as Python, Ansible for developing custom scripts . Strong knowledge of monitoring frameworks, logging systems (ELK stack, Fluentd), and tracing tools (Jaeger, Zipkin) along with the OpenSource solutions like Prometheus, Grafana. Extensive EXP with monitoring and observability solutions such as OpsRamp, Dynatrace, New Relic , must have worked with ITSM integration (e.g. integration with ServiceNow, BMC remedy etc.) Working EXP with RESTful APIs and understanding of API integration with the monitoring tools . Knowledge of ITIL processes and Service Management frameworks . Familiarity with security monitoring and compliance requirements. Familiarity with AIOps and Machine Learning techniques for anomaly detection and incident prediction. Excellent analytical and problem-solving skills, ability to debug and troubleshoot complex automation issues Roles & Responsibilities : Design end-to-end monitoring and observability solutions to provide comprehensive visibility into infrastructure, applications and networks. Implement monitoring tools and frameworks (e.g., Prometheus, Grafana, OpsRamp, Dynatrace, New Relic) to track key performance indicators and system health metrics. Integration of monitoring and observability solutions with IT Service Management Tools. Develop and deploy dashboards and reports to proactively identify and address system performance issues. Architect scalable observability solutions to support hybrid and multi-cloud environments. Collaborate with infrastructure, development and DevOps teams to ensure seamless integration of monitoring systems into CI/CD pipelines. Continuously optimize monitoring configurations and thresholds to minimize noise and improve incident detection accuracy. Utilize AIOps and machine learning capabilities for intelligent incident management and predictive analytics. Work closely with business stakeholders to define monitoring requirements and success metrics. Document monitoring architectures, configurations and operational procedures. ****** Looking for SHORT JOINERS ****** If interested, dont hesitate to click APPLY for IMMEDIATE response. Best Wishes, GSN HR | Google review : https://g.co/kgs/UAsF9W

Posted 3 weeks ago

Apply

5.0 - 10.0 years

11 - 12 Lacs

Hyderabad

Work from Office

We are seeking a highly skilled Devops Engineer to join our dynamic development team. In this role, you will be responsible for designing, developing, and maintaining both frontend and backend components of our applications using Devops and associated technologies. You will collaborate with cross-functional teams to deliver robust, scalable, and high-performing software solutions that meet our business needs. The ideal candidate will have a strong background in devops, experience with modern frontend frameworks, and a passion for full-stack development. Requirements : Bachelor's degree in Computer Science Engineering, or a related field. 5 to 10+ years of experience in full-stack development, with a strong focus on DevOps. DevOps with AWS Data Engineer - Roles & Responsibilities: Use AWS services like EC2, VPC, S3, IAM, RDS, and Route 53. Automate infrastructure using Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation . Build and maintain CI/CD pipelines using tools AWS CodePipeline, Jenkins,GitLab CI/CD. Cross-Functional Collaboration Automate build, test, and deployment processes for Java applications. Use Ansible , Chef , or AWS Systems Manager for managing configurations across environments. Containerize Java apps using Docker . Deploy and manage containers using Amazon ECS , EKS (Kubernetes) , or Fargate . Monitoring & Logging using Amazon CloudWatch,Prometheus + Grafana,E Stack (Elasticsearch, Logstash, Kibana),AWS X-Ray for distributed tracing manage access with IAM roles/policies . Use AWS Secrets Manager / Parameter Store for managing credentials. Enforce security best practices , encryption, and audits. Automate backups for databases and services using AWS Backup , RDS Snapshots , and S3 lifecycle rules . Implement Disaster Recovery (DR) strategies. Work closely with development teams to integrate DevOps practices. Document pipelines, architecture, and troubleshooting runbooks. Monitor and optimize AWS resource usage. Use AWS Cost Explorer , Budgets , and Savings Plans . Must-Have Skills: Experience working on Linux-based infrastructure. Excellent understanding of Ruby, Python, Perl, and Java . Configuration and managing databases such as MySQL, Mongo. Excellent troubleshooting. Selecting and deploying appropriate CI/CD tools Working knowledge of various tools, open-source technologies, and cloud services. Awareness of critical concepts in DevOps and Agile principles. Managing stakeholders and external interfaces. Setting up tools and required infrastructure. Defining and setting development, testing, release, update, and support processes for DevOps operation. Have the technical skills to review, verify, and validate the software code developed in the project.

Posted 3 weeks ago

Apply

8.0 - 12.0 years

0 Lacs

pune, maharashtra

On-site

As an online travel booking platform, Agoda is committed to connecting travelers with a vast network of accommodations, flights, and more. With cutting-edge technology and a global presence, Agoda strives to enhance the travel experience for customers worldwide. As part of Booking Holdings and headquartered in Asia, Agoda boasts a diverse team of over 7,100 employees from 95+ nationalities across 27 markets. The work environment at Agoda is characterized by diversity, creativity, and collaboration, fostering innovation through a culture of experimentation and ownership. The core purpose of Agoda is to bridge the world through travel, believing that travel enriches lives, facilitates learning, and brings people and cultures closer together. By enabling individuals to explore and experience the world, Agoda aims to promote empathy, understanding, and happiness. As a member of the Observability Platform team at Agoda, you will be involved in building and maintaining the company's time series database and log aggregation system. This critical infrastructure processes a massive volume of data daily, supporting various monitoring tools and dashboards. The team faces challenges in scaling data collection efficiently while minimizing costs. In this role, you will have the opportunity to: - Develop fault-tolerant, scalable solutions in multi-tenant environments - Tackle complex problems in distributed and highly concurrent settings - Enhance observability tools for all developers at Agoda To succeed in this role, you will need: - Minimum of 8 years of experience in writing performant code using JVM languages (Java/Scala/Kotlin) or Rust (C++) - Hands-on experience with observability products like Prometheus, InfluxDB, Victoria Metrics, Elasticsearch, and Grafana Loki - Proficiency in working with messaging queues such as Kafka - Deep understanding of concurrency, multithreading, and emphasis on code simplicity and performance - Strong communication and collaboration skills It would be great if you also have: - Expertise in database internals, indexes, and data formats (AVRO, Protobuf) - Familiarity with observability data types like logs and metrics and proficiency in using profilers, debuggers, and tracers in a Linux environment - Previous experience in building large-scale time series data stores and monitoring solutions - Knowledge of open-source components like S3 (Ceph), Elasticsearch, and Grafana - Ability to work at low-level when required Agoda is an Equal Opportunity Employer and maintains a policy of considering all applications for future positions. For more information about our privacy policy, please refer to our website. Please note that Agoda does not accept third-party resumes and is not responsible for any fees associated with unsolicited resumes.,

Posted 3 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies