Jobs
Interviews

310 Elk Jobs - Page 3

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

2.0 - 6.0 years

0 Lacs

karnataka

On-site

You will be responsible for developing and maintaining high-performance server-side applications in Python following SOLID design principles. You will design, build, and optimize low-latency, scalable applications and integrate user-facing elements with server-side logic via RESTful APIs. Maintaining ETL and Data pipelines, implementing secure data handling protocols, and managing authentication and authorization across systems will be crucial aspects of your role. Additionally, you will ensure security measures and setup efficient deployment practices using Docker and Kubernetes. Leveraging caching solutions for enhanced performance and scalability will also be part of your responsibilities. To excel in this role, you should have strong experience in Python and proficiency in at least one Python web framework such as FastAPI or Flask. Familiarity with ORM libraries, asynchronous programming, event-driven architecture, and messaging tools like Apache Kafka or RabbitMQ is required. Experience with NoSQL and Vector databases, Docker, Kubernetes, and caching tools like Redis will be beneficial. Additionally, you should possess strong unit testing and debugging skills and the ability to utilize Monitoring and Logging frameworks effectively. You should have a minimum of 1.5 years of professional experience in backend development roles with Python. Your expertise in setting up efficient deployment practices, handling data securely, and optimizing application performance will be essential for success in this position.,

Posted 1 week ago

Apply

6.0 - 12.0 years

0 Lacs

karnataka

On-site

As a DevOps Engineer at Capgemini, you will have the opportunity to shape your career according to your aspirations in a supportive and inspiring environment. You will work with a collaborative global community of colleagues to push the boundaries of what is achievable. By joining us, you will play a key role in assisting the world's top organizations in harnessing the full potential of technology to create a more sustainable and inclusive world. Your responsibilities will include building and managing CI/CD pipelines using tools such as Jenkins, GitLab CI, and Azure DevOps. You will automate infrastructure deployment using Terraform, Ansible, or CloudFormation, and set up monitoring systems with Prometheus, Grafana, and ELK. Managing containers with Docker and orchestrating them through Kubernetes will be a crucial part of your role. Additionally, you will collaborate closely with developers to integrate DevOps practices into the Software Development Life Cycle (SDLC). To excel in this position, you should ideally possess 6 to 12 years of experience in DevOps, CI/CD, and Infrastructure as Code (IaC). Your expertise should extend to Docker, Kubernetes, and cloud platforms such as AWS, Azure, or GCP. Experience with monitoring tools like Prometheus, Grafana, and ELK is essential, along with knowledge of security, compliance, and performance aspects. Being ready for on-call duties and adept at handling production issues are also required skills for this role. At Capgemini, you will enjoy a flexible work environment with hybrid options, along with a competitive salary and benefits package. Your career growth will be supported through opportunities for SAP and cloud certifications. You will thrive in an inclusive and collaborative workplace culture that values teamwork and diversity. Capgemini is a global leader in business and technology transformation, facilitating organizations in their digital and sustainable evolution. With a diverse team of over 340,000 members across 50 countries, Capgemini leverages its 55-year legacy to deliver comprehensive services and solutions, ranging from strategy and design to engineering. The company's expertise in AI, generative AI, cloud, and data, combined with industry knowledge and partnerships, enables clients to unlock the true potential of technology to meet their business requirements effectively.,

Posted 1 week ago

Apply

3.0 - 5.0 years

10 - 20 Lacs

Pune

Work from Office

Required Skills and Qualifications: 3+ years of backend development experience in Java (Java 8+) and Spring Boot Strong understanding of REST APIs, JPA/Hibernate, and SQL databases (e.g., PostgreSQL, MySQL) Knowledge of software engineering principles and design patterns Experience with testing frameworks like JUnit and Mockito Familiarity with Docker and CI/CD tools Good communication and team collaboration skills Roles and Responsibilities Key Responsibilities: Develop and maintain backend systems using Java (Spring Boot) Build RESTful APIs and integrate with databases and third-party services Write unit and integration tests to ensure code quality Participate in code reviews and collaborate with peers and senior engineers Follow clean code principles and best practices in microservices design Support CI/CD deployment pipelines and container-based workflows Continuously learn and stay updated with backend technologies Required Skills and Qualifications: 3+ years of backend development experience in Java (Java 8+) and Spring Boot Strong understanding of REST APIs, JPA/Hibernate, and SQL databases (e.g., PostgreSQL, MySQL) Knowledge of software engineering principles and design patterns Experience with testing frameworks like JUnit and Mockito Familiarity with Docker and CI/CD tools Good communication and team collaboration skills Nice to Have: Exposure to Kubernetes and cloud platforms (AWS, GCP, etc.) Familiarity with messaging systems like Kafka or RabbitMQ Awareness of security standards and authentication protocols (OAuth2, JWT) Interest in DevOps practices and monitoring tools (Prometheus, ELK, etc.)

Posted 1 week ago

Apply

5.0 - 9.0 years

24 - 42 Lacs

Gurugram

Work from Office

* Expert in monitoring solutions Openshift, OCP, Docker, Kubernetes, Ansible, Terraform, Helm, Elk. * Expert in CI CD & Automation * Compliance standards: RBAC, SCCs, Network Segmentation, CIS hardening. * CLI: oc, kubectl, Helm, Kustomize *Devops Health insurance Annual bonus Provident fund

Posted 1 week ago

Apply

5.0 - 9.0 years

0 Lacs

maharashtra

On-site

As an Engineering Manager focusing on the OSS Platform & Infrastructure team, you will be responsible for leading and managing a team of engineers to ensure the successful development and maintenance of the organization's platform. Your role will require a deep understanding and practical experience in various technical domains. You should have hands-on expertise in Infrastructure as Code (IaC), Cloud Platforms, Continuous Integration/Continuous Deployment (CI/CD) Pipelines, Containerization & Orchestration, and Site Reliability Engineering (SRE) principles. Your experience should include working in a product-oriented environment with leadership responsibilities in engineering. In addition, you must demonstrate strong proficiency and practical experience with tools such as Ansible, Terraform, CloudFormation, and Pulumi. Knowledge of resource management frameworks like Apache Mesos, Kubernetes, and Yarn is essential. Expertise in Linux operating systems and experience in monitoring, logging, and observability using tools like Prometheus, Grafana, and ELK stack is also required. Furthermore, your programming skills should encompass at least one high-level language such as Python, Java, or Golang. A solid understanding of architectural and systems design, including scalability and resilience patterns, various databases (RDBMS & NoSQL), and familiarity with multi-cloud and hybrid-cloud architectures is crucial for this role. Additionally, highly valued skills for this position include expertise in Network and infrastructure operational product engineering. Knowledge of Network Protocols such as TCP/IP, UDP, HTTP/HTTPS, DNS, BGP, OSPF, VXLAN, IPSec, and having a CCNA or equivalent certification would be advantageous. Experience in Network Security, Network Automation, zero trust concepts, TLS/SSL, VPNs, and protocols like gNMI, gRPC, and RESTCONF is desirable. Proficiency in Agile Methodologies like Scrum and Kanban, backlog and workflow management, as well as SRE-specific reporting metrics (MTTR, deployment frequency, SLO, etc.), will also be beneficial for excelling in this role.,

Posted 1 week ago

Apply

5.0 - 8.0 years

15 - 20 Lacs

Bengaluru

Work from Office

We are looking for a skilled Oracle PCF Engineer with hands-on experience in the design, integration, deployment, and support of Policy and Charging Function (PCF) solutions in 4G/5G networks. The ideal candidate should have solid expertise in Oracle Communications PCRF/PCF , strong telecom domain knowledge, and the ability to troubleshoot complex policy-related network issues in real-time. Roles and Responsibilities Design, configure, and deploy Oracle PCF solutions across 4G/5G network environments. Perform integration with other core network elements such as CHF, SMF, AMF, and UDR. Implement and test policy rules as per service provider requirements. Collaborate with solution architects, testers, and system integrators for end-to-end service flow validation . Monitor and analyze PCF performance using tools and logs; perform root cause analysis for incidents. Provide L2/L3 support , including post-deployment troubleshooting and upgrades. Participate in capacity planning, software patching, and lifecycle management of PCF platforms. Maintain high availability and redundancy of the PCF nodes. Prepare and maintain technical documentation , configurations, and operational procedures. Work with cross-functional teams (Core, OSS/BSS, Cloud, Security) to align PCF behavior with network policies. Primary Skills In-depth knowledge of 3GPP standards for 4G LTE and 5G SA/NSA networks, particularly for Policy and Charging Control (PCC) architecture. Experience with Oracle Communications Policy Management (OCPM) Strong understanding of network functions like SMF, AMF, CHF, UDR , and interfaces such as N7, N15, Gx, Rx, Sy . Proficiency in Linux/Unix systems , shell scripting, and system-level debugging. Familiarity with Diameter, HTTP/2, and REST-based protocols . Experience in working with Kubernetes-based deployments , VNFs, or CNFs. Exposure to CI/CD tools , automation frameworks, and telecom service orchestration Good to Have: Experience with 5GC cloud-native deployments (e.g., on OCI, AWS, or OpenStack). Familiarity with Oracle OCI or other telecom cloud platforms . Knowledge of 5G QoS models , slice management , and dynamic policy handling . Experience with monitoring tools like Prometheus, Grafana, or ELK stack. Prior involvement in DevOps or SRE practices within telecom environments.

Posted 1 week ago

Apply

8.0 - 13.0 years

16 - 22 Lacs

Hyderabad

Work from Office

Looking for a Data Engineer with 8+ yrs exp to build scalable data pipelines on AWS/Azure, work with Big Data tools (Spark, Kafka), and support analytics teams. Must have strong coding skills in Python/Java and exp with SQL/NoSQL & cloud platforms. Required Candidate profile Strong experience in Java/Scala/Python. Worked with big data tech: Spark, Kafka, Flink, etc. Built real-time & batch data pipelines. Cloud: AWS, Azure, or GCP.

Posted 1 week ago

Apply

6.0 - 10.0 years

0 Lacs

chennai, tamil nadu

On-site

You should have over 10 years of experience working in a large enterprise with diverse teams. Specifically, you should possess at least 6 years of expertise in APM and Monitoring technologies and a minimum of 3 years of experience with ELK. Your responsibilities will include designing and implementing efficient log shipping and data ingestion processes, collaborating with development and operations teams to enhance logging capabilities, and configuring components of the Elastic Stack such as Filebeat, Metricsbeat, Winlogbeat, Logstash, and Kibana. Additionally, you will be required to create and maintain comprehensive documentation for Elastic Stack configurations, ensure seamless integration between various components, advance Kibana dashboards and visualizations, and manage Elasticsearch Clusters on premise. Hands-on experience in Scripting & Programming languages like Python, Ansible, and bash, as well as knowledge in Security Hardening, Vulnerability/Compliance, and CI/CD deployment pipelines, are essential for this role. You should also have a strong understanding of performance monitoring, metrics, planning, and management, and the ability to apply systematic and creative problem-solving approaches. Experience in application onboarding, influencing other teams to adopt best practices, effective communication skills, and familiarity with tools like ServiceNow, Confluence, and JIRA are highly desirable. Understanding of SRE and DevOps principles is also crucial. In terms of technical skills, you should be proficient in APM Tools like ELK, AppDynamics, PagerDuty, programming languages such as Java, .Net, Python, operating systems like Linux and Windows, automation tools including GitLab and Ansible, container orchestration with Kubernetes, and cloud platforms like Microsoft Azure and AWS. If you meet these qualifications and are interested in this opportunity, please share your resume with gopinath.sonaiyan@flyerssoft.com.,

Posted 1 week ago

Apply

10.0 - 14.0 years

0 Lacs

karnataka

On-site

As a Senior Full Stack Developer (Java + React) in the Fintech / Insurance domain, you will be responsible for leveraging your 10+ years of experience in full-stack development to deliver high-quality solutions. Your technical strengths should include proficiency in Java 11+, Spring Boot, and REST APIs. Additionally, you must possess a strong expertise in React, TypeScript, and modern frontend frameworks. In this role, experience with microservices and micro frontend architecture is crucial, along with cloud deployment experience, preferably on Azure. You should also have knowledge of Kafka, distributed systems, and API gateways. Familiarity with observability tools such as Grafana, ELK, Prometheus, and Splunk is highly desirable. Experience with Strapi CMS and OpenFeature for feature management would be an added advantage. Apart from your technical skills, strong leadership and communication abilities are essential. You should have experience leading Agile development teams and be capable of managing risks, dependencies, and 3rd-party integrations. Confidence in working with cross-functional and remote teams is a key requirement for this role. This is a contractual/temporary position with a contract length of 6 months. The work location is remote, and you must align with Singapore hours. Your role as a Senior Full Stack Developer will involve collaborating with a dynamic team to deliver innovative solutions in the Fintech / Insurance domain.,

Posted 1 week ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

NTT DATA is looking to hire an Azure Cloud Engineer to join their team in Bangalore, Karnataka, India. As an Azure Cloud Engineer, you will be responsible for working in the Banking Domain as an Azure consultant. You should hold a Bachelors/Masters degree in Computer Science or Data Science, along with 5 to 8 years of experience in software development and data structures/algorithms. The ideal candidate will have 5 to 7 years of experience with programming languages such as Python or JAVA, database languages like SQL, and no-sql. Additionally, you should have 5 years of experience in developing large-scale platforms, distributed systems or networks, and be familiar with compute technologies and storage architecture. A strong understanding of microservices architecture is essential for this role. Experience in building AKS applications on Azure, as well as a deep understanding of Kubernetes for availability and scalability of applications in Azure Kubernetes Service, is required. You should also have experience in building and deploying applications with Azure using third-party tools like Docker, Kubernetes, and Terraform. The role will involve working with AKS clusters, VNETs, NSGs, Azure storage technologies, Azure container registries, etc. Good understanding of building Redis, ElasticSearch, and MongoDB applications is preferred, along with experience with RabbitMQ. An end-to-end understanding of ELK, Azure Monitor, DataDog, Splunk, and logging stack is beneficial. Candidates should have experience with development tools, CI/CD pipelines such as GitLab CI/CD, Artifactory, Cloudbees, Jenkins, Helm, Terraform, etc. Understanding of IAM roles on Azure and integration/configuration experience is required, preferably with experience working on Data Robot setup or similar applications on Cloud/Azure. Experience in functional, integration, and security testing, as well as performance validation, is also necessary for this role. NTT DATA is a trusted global innovator of business and technology services, serving 75% of the Fortune Global 100. As a Global Top Employer, NTT DATA has diverse experts in more than 50 countries and a robust partner ecosystem. Their services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation, and management of applications, infrastructure, and connectivity. NTT DATA is a leading provider of digital and AI infrastructure globally, committed to helping organizations and society move confidently into the digital future.,

Posted 1 week ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

As a Site Reliability Engineering (SRE) Technical Leader on the Network Assurance Data Platform (NADP) team at Cisco ThousandEyes, you will be responsible for ensuring the reliability, scalability, and security of the cloud and big data platforms. Your role will involve representing the NADP SRE team, contributing to the technical roadmap, and collaborating with cross-functional teams to design, build, and maintain SaaS systems operating at multi-region scale. Your efforts will be crucial in supporting machine learning (ML) and AI initiatives by ensuring the platform infrastructure is robust, efficient, and aligned with operational excellence. You will be tasked with designing, building, and optimizing cloud and data infrastructure to guarantee high availability, reliability, and scalability of big-data and ML/AI systems. This will involve implementing SRE principles such as monitoring, alerting, error budgets, and fault analysis. Additionally, you will collaborate with various teams to create secure and scalable solutions, troubleshoot technical problems, lead the architectural vision, and shape the technical strategy and roadmap. Your role will also encompass mentoring and guiding teams, fostering a culture of engineering and operational excellence, engaging with customers and stakeholders to understand use cases and feedback, and utilizing your strong programming skills to integrate software and systems engineering. Furthermore, you will develop strategic roadmaps, processes, plans, and infrastructure to efficiently deploy new software components at an enterprise scale while enforcing engineering best practices. To be successful in this role, you should have relevant experience (8-12 yrs) and a bachelor's engineering degree in computer science or its equivalent. You should possess the ability to design and implement scalable solutions, hands-on experience in Cloud (preferably AWS), Infrastructure as Code skills, experience with observability tools, proficiency in programming languages such as Python or Go, and a good understanding of Unix/Linux systems and client-server protocols. Experience in building Cloud, Big data, and/or ML/AI infrastructure is essential, along with a sense of ownership and accountability in architecting software and infrastructure at scale. Additional qualifications that would be advantageous include experience with the Hadoop Ecosystem, certifications in cloud and security domains, and experience in building/managing a cloud-based data platform. Cisco encourages individuals from diverse backgrounds to apply, as the company values perspectives and skills that emerge from employees with varied experiences. Cisco believes in unlocking potential and creating diverse teams that are better equipped to solve problems, innovate, and make a positive impact.,

Posted 1 week ago

Apply

8.0 - 12.0 years

0 Lacs

ahmedabad, gujarat

On-site

As a Lead DevOps Engineer at GrowExx, you will collaborate with cross-functional teams to define, design, and implement DevOps infrastructure while adhering to best practices of Infrastructure as Code (IAC). Your primary goal will be to ensure a robust and stable CI/CD process that maximizes efficiency and achieves 100% automation. You will be responsible for analyzing system requirements comprehensively to develop effective Test Automation Strategies for applications. Additionally, your role will involve designing infrastructure using cloud platforms such as AWS, GCP, Azure, or others. You will also manage Code Repositories like GitHub, GitLab, or BitBucket, and automate software quality gateways using Sonarqube. In this position, you will design branching and merging strategies, create CI pipelines using tools like Jenkins, CircleCI, or Bitbucket, and establish automated build & deployment processes with rollback mechanisms. Identifying and mitigating infrastructure security and performance risks will be crucial, along with designing Disaster Recovery & Backup policies and Infrastructure/Application Monitoring processes. Your role will also involve formulating DevOps Strategies for projects with a focus on Quality, Performance, and Cost considerations. Conducting cost/benefit analysis for proposed infrastructures, automating software delivery processes for distributed development teams, and promoting software craftsmanship will be key responsibilities. You will be expected to identify new tools and processes, and train teams on their adoption. Key Skills: - Hands-on experience with LLM models and evaluation metrics for LLMs. - Proficiency in managing infrastructure on cloud platforms like AWS, GCP, or Azure. - Expertise in Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi. - Managing code repositories using GitHub, GitLab, or Bitbucket, and implementing effective branching and merging strategies. - Designing and maintaining robust CI/CD pipelines with tools like Jenkins, CircleCI, or Bitbucket Pipelines. - Automating software quality checks using SonarQube. - Understanding of automated build and deployment processes, including rollback mechanisms. - Knowledge of infrastructure security best practices and risk mitigation. - Designing disaster recovery and backup strategies. - Experience with monitoring tools like Prometheus, Grafana, ELK, Datadog, or New Relic. - Defining DevOps strategies aligned with project goals. - Conducting cost-benefit analyses for optimal infrastructure solutions. - Automating software delivery processes for distributed teams. - Passion for software craftsmanship and evangelizing DevOps best practices. - Strong leadership, communication, and training skills. Education and Experience: - B Tech or B. E./BCA/MCA/M.E degree. - 8+ years of relevant experience with team-leading experience. - Experience in Agile methodologies, Scrum & Kanban, project management, planning, risk identification, and mitigation. Analytical and Personal Skills: - Strong logical reasoning and analytical skills. - Effective communication in English (written and verbal). - Ownership and accountability in work. - Interest in new technologies and trends. - Multi-tasking and team management abilities. - Coaching and mentoring skills. - Managing multiple stakeholders and resolving conflicts diplomatically. - Forward-thinking mindset.,

Posted 1 week ago

Apply

3.0 - 7.0 years

0 Lacs

chennai, tamil nadu

On-site

You should have hands-on experience with Ingress controllers such as Traefik, Nginx, Envoy. Familiarity with configuration & release management tools like Concourse, Ansible, Chef, or Puppet would be beneficial. Additionally, you should have practical exposure to tools like Prometheus, Grafana, ELK, Sleuth. Proficiency in utilizing various unit, integration, and end-to-end testing frameworks is also required.,

Posted 1 week ago

Apply

5.0 - 9.0 years

0 Lacs

vadodara, gujarat

On-site

You will be responsible for designing, developing, and implementing hybrid cloud environments. You will deploy and automate infrastructure & platform services in Public Clouds (AWS, GCP, and Azure) using Terraform and Ansible. Additionally, you will design and manage Continuous deployment using Kubernetes and Jenkins. It will be your duty to design, implement, and execute Backup and Recovery and Business Continuity processes. You will also be tasked with implementing industry standard security processes for best practices and compliance (SOC2, ISO27001, Fedramp, HIPPA etc) leveraging services in Public Cloud such as AWS GuardDuty, Web Application Firewall, and Cloudtrail. Monitoring environments for security vulnerabilities, taking actions to remediate and/or mitigate risks, and monitoring applications and services within the environments will be part of your routine. You will be expected to be on-call rotation using Datadog, Elastic Search, and Opsgenie, taking actions to resolve issues and implementing strategies to prevent future occurrences. Troubleshooting and root cause analysis for Service Incidents using Jira Service Desk and the alerting and monitoring tools documented above will also fall under your responsibilities. Setting up intelligent application performance alerts in Datadog and ElasticSearch to identify and resolve issues before they impact business services and end-users will be crucial. It is essential to continuously learn about technologies outside of your realm of expertise that help drive. You will work collaboratively with software engineering to develop/deploy our systems. To succeed in this role, you should have an understanding of how cloud-based web applications work and an interest in measuring, analyzing, and improving distributed systems. Familiarity with web application development using Javascript, Java, AngularJS, PostgreSQL, or SQL Server database is required. You must possess 5-7 years of experience with Public Cloud Deployments, both Hybrid and Pure public cloud deployments. Experience with Docker and Kubernetes in production, automation tools like Terraform or Ansible, Networking and security Technology for cloud services, Continuous Deployment tools such as Jenkins or CircleCI, and Logging and Monitoring tools for SaaS such as ELK, Splunk, Datadog etc is essential. Strong communication skills, both written and verbal, are necessary. Being well-organized, able to take direction and work independently, possessing team working skills, and holding a BS or MS in Computer Science are all important qualifications for this role.,

Posted 1 week ago

Apply

1.0 - 5.0 years

4 - 8 Lacs

Gurugram

Work from Office

Company Description About Holiday Tribe Holiday Tribe is a Great Place To WorkCertified?, seed-stage VC-funded travel-tech brand based in Gurugram We specialize in crafting unforgettable leisure travel experiences by integrating advanced technology, leveraging human expertise, and prioritizing customer success With holidays curated across 30+ destinations worldwide, partnerships with renowned tourism boards, and recognition as the Emerging Holiday Tech Company at the India Travel Awards 2023, Holiday Tribe is transforming the travel industry Our mission is to redefine how Indians experience holidays?making travel planning faster, smarter, and more personalized, ensuring every trip is truly seamless and unforgettable Role Description Design, deploy, and manage scalable infrastructure on AWS and/or GCP for production and staging environments Manage Kubernetes clusters for container orchestration, ensuring high availability and performance of services Build and maintain CI/CD pipelines using Jenkins or GitHub Actions to streamline deployments Implement observability stacks including Prometheus, Grafana, New Relic, and ELK (Elasticsearch, Logstash, Kibana) for proactive monitoring and logging Collaborate with engineering teams to ensure secure, fast, and cost-efficient infrastructure for backend services and AI orchestration Own incident response workflows, setup alerts, and help improve system reliability and root cause analysis practices Automate common operational tasks and contribute to internal tooling to reduce toil Qualifications 2+ years of hands-on DevOps experience, preferably in a product-based startup Strong proficiency in cloud platforms (AWS/GCP) including VPCs, EC2/GKE, IAM, etc Hands-on experience managing Kubernetes clusters in production Proficiency in CI/CD systems, especially Jenkins (bonus if GitHub Actions or GitLab CI) Deep understanding of observability tools like Prometheus, Grafana, New Relic, and ELK Stack Experience with scripting (Bash, Python) and infrastructure-as-code tools (Terraform, Helm is a plus) Good communication skills and ability to work closely with developers and product teams Perks Help define backend architecture of a next-gen travel product Work closely with a product and AI team solving real-world travel use cases Freedom to experiment and grow as an engineer Competitive compensation and equity Why Join Us Competitive salary and performance-based incentives Opportunities for growth and career development in a rapidly expanding company A dynamic and collaborative work environment with a focus on innovation and customer satisfaction

Posted 1 week ago

Apply

2.0 - 7.0 years

15 - 20 Lacs

Bengaluru

Work from Office

As a AI Ops Expert , Responsible and full ownership for the deliverables with greater defined quality standards with defined timeline and budget Design, implement, and manage AIops solutions to automate and optimize AI/ML workflows. Collaborate with data scientists, engineers, and other stakeholders to ensure seamless integration of AI/ML models into production. Monitor and maintain the health and performance of AI/ML systems. Develop and maintain CI/CD pipelines for AI/ML models. Implement best practices for model versioning, testing, and deployment. Troubleshoot and resolve issues related to AI/ML infrastructure and workflows. Stay up-to-date with the latest AIops, MLOps, and Kubernetes tools and technologies. Requirements and skills Bachelors or Masters degree in Computer Science, Software Engineering, or a related field. 2-7 year of relevant experience Proven experience in AIops, MLOps, or related fields. Strong proficiency in Python and experience with FastAPI. Strong handson expertise on Kubernetes (Or AKS) Hands-on experience with MS Azure and its AI/ML services, including Azure ML Flow. Proficiency in using DevContainer for development. Knowledge of CI/CD tools such as Jenkins, GitHub Actions, or Azure DevOps. Experience with containerization and orchestration tools like Docker and Kubernetes. Strong problem-solving skills and the ability to work in a fast-paced environment. Excellent communication and collaboration skills. Preferred Skills: Experience with machine learning frameworks such as TensorFlow, PyTorch, or scikit-learn. Familiarity with data engineering tools like Apache Kafka, Apache Spark, or similar. Knowledge of monitoring and logging tools such as Prometheus, Grafana, or ELK stack. Understanding of data versioning tools like DVC or MLflow. Experience with infrastructure as code (IaC) tools like Terraform or Ansible. Proficiency in Azure-specific tools and services, such as: Azure Machine Learning (Azure ML) Azure DevOps Azure Kubernetes Service (AKS) Azure Functions Azure Logic Apps Azure Data Factory Azure Monitor and Application Insights

Posted 1 week ago

Apply

4.0 - 8.0 years

15 - 20 Lacs

Bengaluru

Work from Office

**JIRA Data Center architecture*(multi-node clustering, shared databases, and file systems) **Linux systems administration** **Scripting*(Bash, Python, or Groovy) **Observability stack*(ELK, Prometheus, Grafana, etc.) **Cloud providers*(AWS, GCP, or Azure) if hosted on cloud infrastructure **PostgreSQL / Oracle* as JIRA backend DBs Familiarity with **Atlassian tools ecosystem* Confluence, Bitbucket, Crowd Technologies / OS Python Terraform Docker Linux Windows Network CI/CD Tools JIRA System ADMIN JIRA Functional ADMIN Github Github Actions Jenkins Sonarqube JFrog Artifactory Sonatype Nexus JFrog XRAY Kubernetes Soft Skills Communication Customer service Autonomy Problem-solving Adaptability Team Spirit Time Management

Posted 1 week ago

Apply

4.0 - 9.0 years

9 - 14 Lacs

Bengaluru

Work from Office

Minimum of 4+ years of software development experience with demonstrated expertise in standard development best practice methodologies SKILLS REQUIRED: Spark, Scala, Python, HDFS, Hive, , Scheduler ( Ozzie, Airflow),Kafka Spark/Scala SQL RDBMS DOCKER KUBERNETES RABBITMQ/KAFKA MONITORING TOOLS - SPLUNK OR ELK Profile required Integrate test frameworks in development process Refactor existing solutions to make it reusable and scalable - Work with operations to get the solutions deployed Take ownership of production deployment of code Collaborating with and/or lead cross functional teams, build and launch applications and data platforms at scale, either for revenue generating or operational purposes *Come up with Coding and Design best practices *Thrive in self-motivated internal-innovation driven environment Adapting fast to new application knowledge and changes

Posted 1 week ago

Apply

5.0 - 9.0 years

0 Lacs

ahmedabad, gujarat

On-site

As a Senior DevOps Engineer at TechBlocks, you will be responsible for designing and managing robust, scalable CI/CD pipelines, automating infrastructure with Terraform, and improving deployment efficiency across GCP-hosted environments. With 5-8 years of experience in DevOps engineering roles, your expertise in CI/CD, infrastructure automation, and Kubernetes will be crucial for the success of our projects. In this role, you will own the CI/CD strategy and configuration, implement DevSecOps practices, and drive an automation-first culture within the team. Your key responsibilities will include designing and implementing end-to-end CI/CD pipelines using tools like Jenkins, GitHub Actions, and Argo CD for production-grade deployments. You will also define branching strategies and workflow templates for development teams, automate infrastructure provisioning using Terraform, Helm, and Kubernetes manifests, and manage secrets lifecycle using Vault for secure deployments. Collaborating with engineering leads, you will review deployment readiness, ensure quality gates are met, and integrate DevSecOps tools like Trivy, SonarQube, and JFrog into CI/CD workflows. Monitoring infrastructure health and capacity planning using tools like Prometheus, Grafana, and Datadog, you will implement alerting rules, auto-scaling, self-healing, and resilience strategies in Kubernetes. Additionally, you will drive process documentation, review peer automation scripts, and provide mentoring to junior DevOps engineers. Your role will be pivotal in ensuring the reliability, scalability, and security of our systems while fostering a culture of innovation and continuous learning within the team. TechBlocks is a global digital product engineering company with 16+ years of experience, helping Fortune 500 enterprises and high-growth brands accelerate innovation, modernize technology, and drive digital transformation. We believe in the power of technology and the impact it can have when coupled with a talented team. Join us at TechBlocks and be part of a dynamic, fast-moving environment where big ideas turn into real impact, shaping the future of digital transformation.,

Posted 2 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

karnataka

On-site

Netradyne harnesses the power of Computer Vision and Edge Computing to revolutionize the modern-day transportation ecosystem. We are a leader in fleet safety solutions, experiencing exponential growth year over year. As our solution gains recognition as a disruptive technology, our team is expanding. We are seeking proactive and competitive team members to support our continued growth. As a Site Reliability Engineer (SRE) at Netradyne within the Cloud Services department in Bangalore, India, you will play a crucial role in ensuring high availability and performance of our software solutions. Our SREs are responsible for empowering customers with cutting-edge features and real-time insights from massive-scale data. We are looking for individuals who can bring fresh perspectives, collaborate effectively across teams, and deliver innovative solutions to enhance user experiences. **Role and Responsibilities:** - Participate in an on-call rotation for incident response and implement proactive measures to prevent incidents. - Develop monitoring alerts and incident response processes to ensure high availability and reliability. - Document actions taken during incidents and create automated solutions for improved incident response. - Collaborate with the engineering team to support ongoing projects with expertise in reliability, performance, and efficiency. - Deliver high-quality managed services to ensure optimal uptime and scalability of infrastructure, applications, and cloud services. - Automate detection and resolution of recurring issues to enhance system stability. - Build tools and automation frameworks to eliminate repetitive tasks and prevent incident occurrence. - Continuously improve engineering and operational processes to enhance efficiency and productivity. - Demonstrate strong programming skills and systems understanding to support service reliability and scalability. - Foster a culture of continuous improvement by advocating process changes and best practices. - Engage in continuous learning to expand skills through experimentation or training. **Soft Skills:** - Ability to work asynchronously and independently. - Strong collaboration skills and willingness to work as part of a team. - Excellent problem-solving skills with clear thinking under pressure. - Strong analytical and management skills. - Effective communication and documentation skills. **Qualifications:** - Bachelor's or Graduate degree in Computer Engineering, Computer Science, Engineering, Information Systems Management, or equivalent experience. - Experience with Monitoring/Observability/Log tools such as AWS CloudWatch, Datadog, Prometheus/Grafana, and ELK. - Proficiency with Public Cloud platforms, LINUX/UNIX environments, and programming languages such as Java, Python, or Go. - Familiarity with Agile methodologies, SaaS environments, RDBMS, NoSQL databases, Cloud Architecture, and Frontend/Backend Systems and tools. - Comfortable with scripting and debugging production systems and services. - Strong collaboration skills with a mindset for continuous improvement. - Expertise in scalability and root cause analysis exercises. If your skills and experiences align with our requirements, we will reach out to you directly. Netradyne is an equal-opportunity employer. Applicants only - Recruiting agencies, please refrain from contacting us. For available job openings, please visit Netradyne.com/company/careers. To learn more about avoiding and reporting scams, visit the Federal Trade Commission's job scams website.,

Posted 2 weeks ago

Apply

7.0 - 11.0 years

40 - 45 Lacs

Noida, Ahmedabad, Chennai

Work from Office

Dear Candidate, We are looking for a skilled Cloud DevOps Engineer to automate, deploy, and manage cloud infrastructure. If you have expertise in AWS, CI/CD pipelines, and Infrastructure as Code (IaC), wed love to hear from you! Key Responsibilities: Design and implement cloud-based DevOps solutions. Automate infrastructure provisioning using Terraform, Ansible, or CloudFormation. Manage CI/CD pipelines for application deployment. Monitor system performance, security, and reliability. Optimize cloud resources for cost efficiency and scalability. Troubleshoot and resolve production issues in cloud environments. Required Skills & Qualifications: Hands-on experience with AWS, Azure, or Google Cloud. Expertise in CI/CD tools (Jenkins, GitLab CI, ArgoCD). Strong scripting skills (Python, Bash, PowerShell). Knowledge of Kubernetes, Docker, and container orchestration. Familiarity with logging and monitoring tools (Prometheus, Grafana, ELK). Soft Skills: Strong analytical and problem-solving skills. Ability to work independently and in a team. Excellent communication and documentation skills. Note: If interested, please share your updated resume and preferred time for a discussion. If shortlisted, our HR team will contact you. Kandi Srinivasa Delivery Manager Integra Technologies

Posted 2 weeks ago

Apply

10.0 - 14.0 years

0 Lacs

pune, maharashtra

On-site

As a global leader in cybersecurity, CrowdStrike is dedicated to protecting the people, processes, and technologies that drive modern organizations. Since 2011, our unwavering mission is to prevent breaches and we have revolutionized modern security with the most advanced AI-native platform in the world. Our diverse customer base spans across all industries, relying on CrowdStrike to ensure the continuity of their businesses, the safety of their communities, and the progression of their lives. We are a mission-driven company that fosters a culture empowering every CrowdStriker with the autonomy and flexibility to steer their careers. CrowdStrike is constantly seeking passionate individuals to join our team who exhibit boundless enthusiasm, an unwavering focus on innovation, and a deep commitment to our customers, community, and each other. Are you ready to be part of a mission that makes a difference The future of cybersecurity commences with you. The CrowdStrike Information Technology team is currently seeking a Staff IT Monitoring Engineer/Site Reliability Engineer (SRE) to take charge of designing, implementing, and evolving our enterprise monitoring and observability platforms. In this pivotal role, you will be responsible for architecting scalable monitoring solutions, leading reliability initiatives, and serving as a technical authority on monitoring best practices. Your duties will involve mentoring junior team members, collaborating with cross-functional teams to establish Service Level Objectives (SLOs), and playing a vital role in major incident management. This position necessitates advanced technical prowess, strategic thinking, and the ability to strike a balance between operational excellence and innovation. **What You'll Need** **Required Skills and Qualifications:** - Possess 10+ years of experience with enterprise monitoring platforms and observability tools such as LogicMonitor, DataDog, LogScale, Zscaler Digital Experience (ZDX), ThousandEyes. - Demonstrate advanced proficiency in multiple scripting/programming languages including Python, Go, and Bash. - Exhibit expert knowledge of modern monitoring ecosystems such as Prometheus, Grafana, and ELK. - Showcase experience in architecting monitoring solutions at scale across hybrid environments. - Have a strong background in SRE practices encompassing SLO definition, error budgets, and reliability engineering. - Possess advanced knowledge of cloud platforms like AWS, GCP, and their native monitoring capabilities. - Demonstrate expertise in log aggregation, metrics and Key Performance Indicators (KPIs) collection, and distributed tracing implementations. - Experience in designing and implementing automated remediation systems. - Strong understanding of Infrastructure as Code and GitOps principles. - Proven ability to mentor junior engineers and provide technical leadership. **Shift timings:** 12PM -9PM IST In summary, we are looking for a seasoned professional to lead our monitoring and observability initiatives, drive reliability improvements, and play a pivotal role in incident management. If you possess the required skills and are passionate about making a difference in the cybersecurity landscape, we welcome you to join our team at CrowdStrike.,

Posted 2 weeks ago

Apply

7.0 - 11.0 years

0 Lacs

karnataka

On-site

As a backend engineer at Latinum, you will play a crucial role in designing and developing robust and scalable systems to address complex business challenges. You will be part of a dynamic team of high-performance engineers focused on creating efficient and high-throughput solutions. To excel in this role, you must have a minimum of 7+ years of hands-on experience in backend engineering. You should demonstrate proficiency in Core Java backend engineering and Microservices and Cloud architecture. Candidates with expertise in both areas will be considered for senior positions within the team. In the realm of Java & Backend Engineering, you should be well-versed in Java 8+ concepts such as Streams, Lambdas, Functional Interfaces, and Optionals. Additionally, you should have experience with Spring Core, Spring Boot, object-oriented principles, multithreading, and collections. Knowledge of Kafka, JPA, RDBMS/NoSQL, and design patterns is essential for this role. In the Microservices, Cloud & Distributed Systems domain, familiarity with REST APIs, OpenAPI/Swagger, Spring Boot, and Kafka Streams is required. Experience with event-driven patterns, GraphQL, cloud-native applications on AWS, CI/CD pipelines, and observability tools like ELK and Prometheus will be beneficial. Moreover, additional skills in Node.js, React, Angular, Golang, Python, and web platforms like AEM and Sitecore are considered advantageous. Proficiency in TDD, mocking, security testing, and architecture artifacts is a plus. Your key responsibilities will include designing and developing scalable backend systems using Java and Spring Boot, building event-driven microservices and cloud-native APIs, and implementing secure and high-performance solutions. You will collaborate with cross-functional teams to define architecture, conduct code reviews, and ensure production readiness. Additionally, troubleshooting, optimizing, and monitoring distributed systems and mentoring junior engineers will be part of your role, especially for senior positions. Join Latinum's team of dedicated engineers and be part of a challenging and rewarding environment where you can contribute to cutting-edge solutions and innovative technologies.,

Posted 2 weeks ago

Apply

10.0 - 14.0 years

0 Lacs

haryana

On-site

Are you a skilled and experienced .NET Architect looking for an exciting opportunity to drive innovation and collaborate with Red Hat and client teams Join our client in Gurgaon for a full-time onsite position from Monday to Friday and play a key role in designing, building, and mentoring teams in the adoption of modern Cloud Native Development practices. As a Senior .NET Architect, your responsibilities will include designing and developing scalable, secure microservices using .NET Core (6/8) and C#, implementing robust CD pipelines with tools like Azure DevOps, GitLab, and Jenkins, leading teams with Test-Driven Development (TDD) principles, and building and running containerized .NET applications with Podman. You will also be tasked with setting up local multi-container environments using Podman Compose/Docker Compose, driving automated API testing strategies for RESTful services, enabling full-stack observability with tools such as Prometheus, OpenTelemetry, ELK, and Splunk, mentoring developers, conducting code reviews, and contributing to architecture decisions and system design. We are seeking candidates with over 10 years of hands-on experience in developing .NET applications, strong expertise in C#, .NET Core, and Microservices Architecture, deep knowledge of containerization using Podman, familiarity with TDD, CI/CD, and API Testing frameworks (e.g., XUnit, Postman, Playwright), experience with SQL Server / PostgreSQL and Entity Framework Core, and strong mentorship & communication skills. If you are passionate about learning, collaborating, and building modern, cloud-native apps, we would like to hear from you. To apply for this position, please send your updated resume to career@strive4x.net with the subject line "Application for .NET Architect [Your Name]" and include the following details in your email: Total Experience, Relevant Experience in .NET Core, Podman, Microservices, Current CTC (LPA), Expected CTC (LPA), Notice Period, Current Location & Willingness to Relocate (if applicable), and Availability for Interview. Kindly indicate whether you have a PF account (Yes/No). Please note that this is an immediate requirement, and preference will be given to candidates available to join within 15-20 days. Candidates with a notice period exceeding 30 days will not be considered. Only applicants meeting the required experience, skill set, and location criteria should apply. Multiple technical interview rounds will be conducted, and PF account, background verification, and no dual employment are mandatory for this position.,

Posted 2 weeks ago

Apply

2.0 - 6.0 years

0 Lacs

maharashtra

On-site

Cowbell is signaling a new era in cyber insurance by harnessing technology and data to provide small and medium-sized enterprises (SMEs) with advanced warning of cyber risk exposures bundled with cyber insurance coverage adaptable to the threats of today and tomorrow. Championing adaptive insurance, Cowbell follows policyholders" cyber risk exposures as they evolve through continuous risk assessment and continuous underwriting. In its unique AI-based approach to risk selection and pricing, Cowbell's underwriting platform, powered by Cowbell Factors, compresses the insurance process from submission to issue to less than 5 minutes. Founded in 2019 and based in the San Francisco Bay Area, Cowbell has rapidly grown, now operating across the U.S., Canada, U.K., and India. This growth was recently bolstered by a successful Series C fundraising round of $60 million from Zurich Insurance. This investment not only underscores the confidence in Cowbell's mission but also accelerates our capacity to revolutionize cyber insurance on a global scale. With the backing of over 25 prominent reinsurance partners, Cowbell is poised to redefine how SMEs navigate the evolving landscape of cyber threats. Your future team At Cowbell, our Platform Support team is all about making things smoother for our internal folks and API partners. We do this by bringing our A-game in technical know-how and product smarts, always keeping our users" needs front and center. About The Role As a Level 3 Support Engineer II at Cowbell, you will play a crucial role in enhancing our customers" experience. You will be responsible for identifying and resolving production anomalies, collaborating closely with our global team of Level 2 and Level 3 engineers to tackle complex issues. Your dedication and urgency will be key in providing timely resolutions, ensuring our platform users remain unblocked and operations run smoothly. What You Will Do: Issue Resolution: - Triage and troubleshoot user-reported issues, identifying the most effective resolution path. - Implement production hotfixes and data changes to unblock users and mitigate issues. - Lead code enhancement initiatives to improve platform resilience and reduce critical incidents. - Prioritize and manage multiple incidents and deadlines in a fast-paced environment. On-Call & Alerting: - Participate in on-call rotations to triage, investigate, prioritize, and resolve critical bugs. - Become proficient in setting metric-based alerts, investigating logs, and identifying fixes or escalating to appropriate service owners when necessary. Knowledge & Collaboration: - Identify patterns and group similar/related issues to determine optimal resolutions. - Collaborate with other teams to build and maintain Level 2 & Level 3 Knowledge Bases, Status pages, incident notes, and other internal and external platform resources. - Maintain strong collaboration, communication, and interaction with all stakeholders (platform users, engineering, QA, and product teams) on support-related topics. What We Need From You: We are seeking a highly motivated and experienced Level 3 Platform Support Engineer to join our team. The ideal candidate will possess a strong technical background, excellent problem-solving abilities, and a commitment to continuous learning and growth. Education & Experience: - Bachelor's degree in Computer Science or a related field, or equivalent practical experience. - Minimum of three years of experience with Spring Boot applications. - Minimum of two years of experience with Java Microservices. Technical Skills: - Demonstrated expertise in software application debugging and troubleshooting. - Familiarity with technical support processes and escalation management. - Proficiency with Microservices, RESTful web services, and Kafka. - Experience with ELK and RDS/Postgres. - Hands-on experience with continuous software deployment in containerized microservices on public cloud infrastructure. Soft Skills: - Proactive and eager to learn new technologies and concepts daily. - Exceptional interpersonal skills, including clear and professional written and verbal communication. - Possess a responsible, reliable, confident, committed, empathetic, genuine, and helpful working style. - Self-motivated, self-directed, adaptable, and capable of managing multiple tasks effectively. - Strong ownership mindset, embracing both responsibility and accountability. Bonus/Nice to Have: - Experience working on a Software as a Service (SaaS) product. - Familiarity with continuous integration and automated testing. - Prior experience with tools such as JIRA, JIRA Service Management, Fire Hydrant, Datadog, Honeycomb, Komodor, Postman, and Tableau. - Knowledgeable and comfortable working with Copilot and other LLM tools to assist in troubleshooting and code fixes. What Cowbell brings to the table: - Employee equity plan for all and wealth enablement plan for select customer-facing roles. - Comprehensive wellness program, meditation app subscriptions, lunch and learn, book club, happy hours, and much more. - Professional development and the opportunity to learn the ins and outs of cyber insurance, cybersecurity as well as continuing to build your professional skills in a team environment. Equal Employment Opportunity: Cowbell is a leading innovator in cyber insurance, dedicated to empowering businesses to always deliver their intended outcomes as the cyber threat landscape evolves. Guided by our core values of TRUE Transparency, Resiliency, Urgency, and Empowerment, we are on a mission to be the gold standard for businesses to understand, manage, and transfer cyber risk. At Cowbell, we foster a collaborative and dynamic work environment where every employee is empowered to contribute and grow. We pride ourselves on our commitment to transparency and resilience, ensuring that we not only meet but exceed industry standards. We are proud to be an equal opportunity employer, promoting a diverse and inclusive workplace where all voices are heard and valued. Our employees enjoy competitive compensation, comprehensive benefits, and continuous opportunities for professional development. For more information, please visit https://cowbell.insure/.,

Posted 2 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies