Jobs
Interviews

4 Victoria Metrics Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 9.0 years

0 Lacs

karnataka

On-site

The ideal candidate for the Site Reliability Engineer (SRE) position should have at least 5 years of experience in a similar role. Proficiency in Python, Bash, and strong scripting skills are necessary for this role. The candidate should demonstrate expertise in tasks such as monitoring, rollouts/deployments, and operational responsibilities. Knowledge of Grafana, Prometheus, Victoria Metrics, and CEPH would be advantageous. As the SRE Implementation Engineer, you will be tasked with promoting the adoption of SRE principles throughout the engineering organization. Collaboration with development, operations, and infrastructure teams is essential to integrate reliability-focused practices like monitoring, automation, capacity planning, and incident management into the development lifecycle. Your contributions will be pivotal in scaling and automating critical systems to maintain high availability and performance. At GlobalLogic, we foster a culture of caring where people are the top priority. You will be welcomed into an inclusive environment that values acceptance and belonging, enabling you to establish meaningful connections with collaborative colleagues, supportive managers, and empathetic leaders. Continuous learning and development are core values at GlobalLogic. You will have access to various opportunities for personal and professional growth, including programs, training curricula, and hands-on experiences to enhance your skills and advance your career. Our Career Navigator tool is just one of the many resources available to help you thrive. Working at GlobalLogic means engaging in interesting and impactful projects for clients worldwide. You will have the chance to contribute to innovative solutions that make a difference, leveraging your problem-solving skills to help clients reimagine possibilities and bring new products to market. We believe in maintaining a healthy balance between work and personal life. With diverse career areas, roles, and flexible work arrangements, you can explore ways to achieve an optimal work-life balance. Our goal is to support you in integrating work and personal life seamlessly, ensuring that you have fun along the way. GlobalLogic is a high-trust organization that values integrity above all. Joining our team means being part of a safe, reliable, and ethical global company that prioritizes honesty, transparency, and trust in all interactions with employees and clients. Your decision to be a part of GlobalLogic reflects your trust in our commitment to integrity and ethical practices. GlobalLogic, a Hitachi Group Company, is a leading digital engineering partner for the world's most innovative companies. With a history dating back to 2000, we have been instrumental in shaping the digital landscape by creating cutting-edge products and experiences. Our collaboration with clients continues to drive business transformation and industry redefinition through intelligent platforms, products, and services.,

Posted 2 days ago

Apply

8.0 - 12.0 years

0 Lacs

pune, maharashtra

On-site

As an online travel booking platform, Agoda is committed to connecting travelers with a vast network of accommodations, flights, and more. With cutting-edge technology and a global presence, Agoda strives to enhance the travel experience for customers worldwide. As part of Booking Holdings and headquartered in Asia, Agoda boasts a diverse team of over 7,100 employees from 95+ nationalities across 27 markets. The work environment at Agoda is characterized by diversity, creativity, and collaboration, fostering innovation through a culture of experimentation and ownership. The core purpose of Agoda is to bridge the world through travel, believing that travel enriches lives, facilitates learning, and brings people and cultures closer together. By enabling individuals to explore and experience the world, Agoda aims to promote empathy, understanding, and happiness. As a member of the Observability Platform team at Agoda, you will be involved in building and maintaining the company's time series database and log aggregation system. This critical infrastructure processes a massive volume of data daily, supporting various monitoring tools and dashboards. The team faces challenges in scaling data collection efficiently while minimizing costs. In this role, you will have the opportunity to: - Develop fault-tolerant, scalable solutions in multi-tenant environments - Tackle complex problems in distributed and highly concurrent settings - Enhance observability tools for all developers at Agoda To succeed in this role, you will need: - Minimum of 8 years of experience in writing performant code using JVM languages (Java/Scala/Kotlin) or Rust (C++) - Hands-on experience with observability products like Prometheus, InfluxDB, Victoria Metrics, Elasticsearch, and Grafana Loki - Proficiency in working with messaging queues such as Kafka - Deep understanding of concurrency, multithreading, and emphasis on code simplicity and performance - Strong communication and collaboration skills It would be great if you also have: - Expertise in database internals, indexes, and data formats (AVRO, Protobuf) - Familiarity with observability data types like logs and metrics and proficiency in using profilers, debuggers, and tracers in a Linux environment - Previous experience in building large-scale time series data stores and monitoring solutions - Knowledge of open-source components like S3 (Ceph), Elasticsearch, and Grafana - Ability to work at low-level when required Agoda is an Equal Opportunity Employer and maintains a policy of considering all applications for future positions. For more information about our privacy policy, please refer to our website. Please note that Agoda does not accept third-party resumes and is not responsible for any fees associated with unsolicited resumes.,

Posted 3 weeks ago

Apply

6.0 - 8.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

About MoEngage: MoEngage is an insights-led customer engagement platform trusted by 1,350+ global consumer brands, including McAfee, Flipkart, Domino's, Nestle, Deutsche Telekom, and OYO. MoEngage combines data from multiple sources to help brands gain a 360-degree view of their customers. arms marketers and product owners with insights into customer behavior. Brands can leverageto orchestrate journeys and build 1:1 conversations across the website, mobile, email, social, and messaging channels., the transactional messaging infrastructure, helps unify promotional and transactional communication to a single platform for better insights and lower costs. MoEngage'shelps marketers develop winning copies and creatives, optimize campaigns and channels that boost engagement, and help with faster execution. For over a decade, consumer brands in 60+ countries have been using MoEngage to power digital experiences for over a billion monthly customers. With offices in 15 countries, MoEngage is backed by Goldman Sachs Asset Management, B Capital, Steadview Capital, Multiples Private Equity, Eight Roads, F-Prime Capital, Matrix Partners, Ventureast, and Helion Ventures. MoEngage was named a Contender in The Forrester Wave: Real-Time Interaction Management, Q1 2024 report, and Strong Performer in The Forrester Wave 2023 report. MoEngage was also featured as a Leader in the IDC MarketScape: Worldwide Omni-Channel Marketing Platforms for B2C Enterprises 2023. Our team is the backbone of MoEngage, we manage TBs of data for multiple teams, which we store in more than 50 clusters and handle 500+ EC2 servers in over 5 regions. Our team is responsible for the installation, configuration, upgrade, and migration of databases. We work closely with developers of applications that run against the database to make sure that best practices are followed for good performance and results. We process, in real-time, more than 40 Billion events per month. On an average day, we send more than 3 Billion Intelligent push notifications through our systems and build stats for them, all in real-time. Processing Speed is super critical to everything we do. As a team member, you will be constantly challenged to save those extra milliseconds and nanoseconds from your processing time. We are a small and close-knit team, we believe in learning and growing together. Requirements : 6+ years of hands-on experience on NoSQL/SQL databases with at least 4 years as DBA on MongoDB Experienced in hosting, maintaining, and owning large MongoDB clusters on the cloud. Experience in scripting language and tools like Ansible and Terraform Great in debugging skills, should be able to look at related metrics and narrow down possible causes of the problem. Dive deep/reproduce those issues. Communicate with the application team and bring them to closure. Bonus if experienced in managing ScyllaDB clusters, AWS ElastiCache Bonus if experience in one or more of the Time Series Databases - like InfluxDB, Prometheus, Victoria Metrics Roles and Responsibilities : Engineering Excellence: Constantly thrive to explore optimizations in database configurations, infrastructure, cost, new features, and performance improvements Identify parts of the system that do not scale/are non-reliable, provide immediate measures, and drive long-term resolution of such cases Owning the reliability and availability of MongoDB and ScyllaDB infra in the cloud - Servers (EC2 Instances), storage etc Influence developers to adopt to right standards and practices which lead to ease of operations, higher reliability, and cost efficiency Build Self-healing capabilities for the databases Operational Excellence Enhancing the scalability and performance of existing database architecture - adding and removing shards regularly Performing database maintenance, migration, and upgrading hardware and software. Monitor the overall health parameters of the clusters like CPU Utilisation, Memory utilization, Operation Execution times, Replication lag, and load balancing of data and queries, and identify the stress areas Conducting diagnostic tests, evaluating performance metrics, and ensuring high availability or uptime of database services Documenting processes and complying with best practices in database management Automation Mindset Look for opportunities to reduce toil by using automation. Keen to bring new ideas for automating day-to-day database operations using code. On-call Be a part of the on-call rotation and be the first responder to all database-related issues. During on-call, being able to respond, mitigate, fix, and escalate issues if required. Keep documentation of the on-call issues to avoid recurrence. Why Join Us! At MoEngage, we are passionate about our team and technology - see below to know more about us. We handle more than a billion messages every day. Rest assured, you will be surrounded by really smart and passionate people as we scale much more to build a world-class technology team.

Posted 1 month ago

Apply

4.0 - 6.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

About MoEngage: MoEngage is an insights-led customer engagement platform trusted by 1,350+ global consumer brands, including McAfee, Flipkart, Domino's, Nestle, Deutsche Telekom, and OYO. MoEngage combines data from multiple sources to help brands gain a 360-degree view of their customers. arms marketers and product owners with insights into customer behavior. Brands can leverageto orchestrate journeys and build 1:1 conversations across the website, mobile, email, social, and messaging channels., the transactional messaging infrastructure, helps unify promotional and transactional communication to a single platform for better insights and lower costs. MoEngage'shelps marketers develop winning copies and creatives, optimize campaigns and channels that boost engagement, and help with faster execution. For over a decade, consumer brands in 60+ countries have been using MoEngage to power digital experiences for over a billion monthly customers. With offices in 15 countries, MoEngage is backed by Goldman Sachs Asset Management, B Capital, Steadview Capital, Multiples Private Equity, Eight Roads, F-Prime Capital, Matrix Partners, Ventureast, and Helion Ventures. MoEngage was named a Contender in The Forrester Wave: Real-Time Interaction Management, Q1 2024 report, and Strong Performer in The Forrester Wave 2023 report. MoEngage was also featured as a Leader in the IDC MarketScape: Worldwide Omni-Channel Marketing Platforms for B2C Enterprises 2023. Our team is the backbone of MoEngage, we manage TBs of data for multiple teams, which we store in more than 50 clusters and handle 500+ EC2 servers in over 5 regions. Our team is responsible for the installation, configuration, upgrade, and migration of databases. We work closely with developers of applications that run against the database to make sure that best practices are followed for good performance and results. We process, in real-time, more than 40 Billion events per month. On an average day, we send more than 3 Billion Intelligent push notifications through our systems and build stats for them, all in real-time. Processing Speed is super critical to everything we do. As a team member, you will be constantly challenged to save those extra milliseconds and nanoseconds from your processing time. We are a small and close-knit team, we believe in learning and growing together. Requirements : 4+ years of hands-on experience on NoSQL/SQL databases with at least 2 years as DBA on MongoDB Experienced in hosting maintaining and owning large MongoDB clusters on the cloud. Experience in one or more of the Time Series Databases - like InfluxDB, Prometheus, Victoria Metrics Bonus if experienced in managing ScyllaDB clusters, AWS ElastiCache Bonus if experienced with automation using Terraform and Ansible. Roles and Responsibilities : Enhancing the scalability and performance of existing database architecture - adding/removing shards regularly Performing database maintenance, migration, and upgrading hardware and software. Conducting diagnostic tests, evaluating performance metrics and ensuring high availability or uptime of database services Owning the reliability and availability of MongoDB and ScyllaDB infra in the cloud - Servers (EC2 Instances), storage, etc Monitor the overall health parameters of the clusters like CPU Utilisation, Memory utilization, Operation Execution times, Replication lag, and load balancing of data and queries, and identify the stress areas Documenting processes and complying with best practices in database management Automate Routine or manual DBA activities Ensure metrics, logs, and dashboards are available for all critical components. Why Join Us! At MoEngage, we are passionate about our team and technology - see below to know more about us. We handle more than a billion messages every day. Rest assured, you will be surrounded by really smart and passionate people as we scale much more to build a world-class technology team.

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies