Get alerts for new jobs matching your selected skills, preferred locations, and experience range.
5.0 - 10.0 years
15 - 25 Lacs
Pune, Gurugram, Bengaluru
Work from Office
Domain: IT & Services Position: ELK Architect Experience: 5-14Years Location: Anywhere in India Notice period-Immediate to 30 -40 days Your Team You are invited to be part of a global leader in technology services and consulting, focused on providing innovative solutions for digital transformation. Leverage your expertise in consulting, design, engineering, and operations to help clients achieve their boldest goals and build future-ready businesses. Work in a collaborative environment, bringing together skills across 40+ industries across 60+ countries. This combination of strong human capital and expansive geographic presence positions this enterprise as a formidable player in the IT services landscape. JD - Key Responsibilities - Design, implement, and configuration of Elastic stack, Kibana visualizations and Grafana for the Client. Present and demonstrate ELK / Grafana capabilities to the prospective clients Designs and optimizes ELK platform architecture for large-scale and distributed deployments Establishes best practices and development standards, and ensures that the team adopts them Maintains a close partnership with ELK / grafana on feature requests, upgrade planning, and product roadmap alignment Develops and customizes various dashboards and Builds advanced visualizations Performs assessment of Monitoring estate and derive at recommendations with quantified business benefits Good understanding and experience on: End to end ELK Stack / Grafana Enterprise & Cloud deployment Administering Production Systems, where Elastic Stack runs / where Grafana with data sources as Prometheus, Graphite, M3DB,etc. Data onboarding from multiple data sources and build Transformation framework Migration / upgrade planning & execution for ELK / Grafana platform End to end ELK / Grafana data Ingestion, enrichment, transformation and visualization Deployment & Administration of Elastic Stack version 5.0 and above & Grafana Stack version 7.x and above Docker, Kubernetes, Artifactory and Gitlab & cloud platforms (preferably Azure / AWS) Deeper understanding on VPC, Subnet, NI, LBs and Availability Zones & AWS Services limitations Hands on / Knowledge on ML modules with good insights on Event Aggregation / Event correlation / Anomaly detection areas Watchers for Alerting leveraging Painless scripts / Python Should have hands on experience with: Design Elasticsearch indices to efficiently store metric data by optimizing both performance and its growth into consideration. Design access control using X-Pack. Configuration of X-Pack including Shield, Watcher, Marvel, Graph, and Reporting. Configuration of Logstash, FileBeats, MetricsBeats and other ELK Stack components to collect and the store the data necessary to meet customer requirements efficiently. Inbound / Outbound Integrations IT & business systems and custom plugin creations Implementing Active directory integration, cross-cluster search & replication, Infra UI and Compact UI display. Setting up and configuring Grafana & Integration experience with different multiple data sources like 3rd party monitoring tools Devops / APIs,etc. Configuring dashboards, alerts and alarms Exposure to analyze from dashboard, recommend and problem solving skills to debug issues / integrate using Json, API, CI/CD pipeline using scripts Data visualization using Grafana for creating Grafana dashboards to display time-based data plots AWS Services Cloud Formation ASG, EC2 S3, Route53 CloudWatch
Posted 2 weeks ago
6.0 - 9.0 years
32 - 35 Lacs
Noida, Kolkata, Chennai
Work from Office
Dear Candidate, We are hiring a Rust Developer to build safe, concurrent, and high-performance applications for system-level or blockchain development. Key Responsibilities: Develop applications using Rust and its ecosystem (Cargo, Crates) Write memory-safe and zero-cost abstractions for systems or backends Build RESTful APIs, CLI tools, or blockchain smart contracts Optimize performance using async/await and ownership model Ensure safety through unit tests, benchmarks, and fuzzing Required Skills & Qualifications: Proficient in Rust , lifetimes , and borrowing Experience with Tokio , Actix , or Rocket frameworks Familiarity with WebAssembly , blockchain (e.g. Substrate) , or embedded Rust Bonus: Background in C/C++ , systems programming, or cryptography Soft Skills: Strong troubleshooting and problem-solving skills. Ability to work independently and in a team. Excellent communication and documentation skills. Note: If interested, please share your updated resume and preferred time for a discussion. If shortlisted, our HR team will contact you. Srinivasa Reddy Kandi Delivery Manager Integra Technologies
Posted 2 weeks ago
2 - 5 years
4 - 7 Lacs
Bengaluru
Work from Office
Site Reliability Engineer - Private Cloud - Our mission at Booking.com is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties. About the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes Booking-managed cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed by Booking.com. These platform instances, and the workloads running on them, are hosted both in Booking datacenters (on-premises) and on public cloud infrastructure (AWS). The Private Cloud platform has three primary internal customer-facing verticals: virtualization, containerization, and serverless, corresponding to the three types of workloads it supports. At the highest level, the Booking Private Cloud drives three primary business outcomes: Agility in provisioning and using cloud infrastructure. Efficiency in cost and utilization of cloud infrastructure, as well as toil reduction for developers and engineers. Trust in the safety, reliability, and performance of our cloud infrastructure. Years of Experience: 2years-5years Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here at Booking. You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers You will have the opportunity to collaborate with many of the worlds leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc What youll be Doing: Design, develop and implement systems software that improves the stability, scalability, availability and latency of the Booking.com products; Take ownership of one or more services and have the freedom to do what is best for our business and customers; Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again; Build effective monitoring to monitor the health of your system, and jump in to handle outages; Build and run capacity tests to handle the growth of your systems; Plan for reliability by designing systems to work across our multinational data centers; Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day; Share the on-call rotation and be an escalation contact for incidents (depending on level of role) What youll bring: Solid experience in at least one programming language. Experience with building, operating and maintaining scalable distributed systems, and with operations automation; Experience with Infrastructure as Code technologies; Knowledge of cloud computing fundamentals; Solid foundation in Linux administration and troubleshooting; Understanding of Service level agreements and objectives; Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable; Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus; Good interpersonal skills Proficient command of the English language, both written and spoken
Posted 1 month ago
3 - 7 years
13 - 17 Lacs
Bengaluru
Work from Office
Site Reliability Engineer - Private Cloud - Our mission at Booking.com is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties. About the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes Booking-managed cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed by Booking.com. These platform instances, and the workloads running on them, are hosted both in Booking datacenters (on-premises) and on public cloud infrastructure (AWS). The Private Cloud platform has three primary internal customer-facing verticals: virtualization, containerization, and serverless, corresponding to the three types of workloads it supports. At the highest level, the Booking Private Cloud drives three primary business outcomes: Agility in provisioning and using cloud infrastructure. Efficiency in cost and utilization of cloud infrastructure, as well as toil reduction for developers and engineers. Trust in the safety, reliability, and performance of our cloud infrastructure. Years of Experience: 2years-5years Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here at Booking. You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers You will have the opportunity to collaborate with many of the worlds leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc What youll be Doing: Design, develop and implement systems software that improves the stability, scalability, availability and latency of the Booking.com products; Take ownership of one or more services and have the freedom to do what is best for our business and customers; Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again; Build effective monitoring to monitor the health of your system, and jump in to handle outages; Build and run capacity tests to handle the growth of your systems; Plan for reliability by designing systems to work across our multinational data centers; Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day; Share the on-call rotation and be an escalation contact for incidents (depending on level of role) What youll bring: Solid experience in at least one programming language. Experience with building, operating and maintaining scalable distributed systems, and with operations automation; Experience with Infrastructure as Code technologies; Knowledge of cloud computing fundamentals; Solid foundation in Linux administration and troubleshooting; Understanding of Service level agreements and objectives; Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable; Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus; Good interpersonal skills Proficient command of the English language, both written and spoken
Posted 1 month ago
3 - 7 years
13 - 17 Lacs
Bengaluru
Work from Office
Role Description Site Reliability Engineer - Private Cloud - Our mission at Booking.com is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties. About the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes Booking-managed cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed by Booking.com. These platform instances, and the workloads running on them, are hosted both in Booking datacenters (on-premises) and on public cloud infrastructure (AWS). The Private Cloud platform has three primary internal customer-facing verticals: virtualization, containerization, and serverless, corresponding to the three types of workloads it supports. At the highest level, the Booking Private Cloud drives three primary business outcomes: Agility in provisioning and using cloud infrastructure. Efficiency in cost and utilization of cloud infrastructure, as well as toil reduction for developers and engineers. Trust in the safety, reliability, and performance of our cloud infrastructure. Years of Experience: 2years-5years Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here at Booking. You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers You will have the opportunity to collaborate with many of the worlds leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc What youll be Doing: Design, develop and implement systems software that improves the stability, scalability, availability and latency of the Booking.com products; Take ownership of one or more services and have the freedom to do what is best for our business and customers; Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again; Build effective monitoring to monitor the health of your system, and jump in to handle outages; Build and run capacity tests to handle the growth of your systems; Plan for reliability by designing systems to work across our multinational data centers; Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day; Share the on-call rotation and be an escalation contact for incidents (depending on level of role) What youll bring: Solid experience in at least one programming language. Experience with building, operating and maintaining scalable distributed systems, and with operations automation; Experience with Infrastructure as Code technologies; Knowledge of cloud computing fundamentals; Solid foundation in Linux administration and troubleshooting; Understanding of Service level agreements and objectives; Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable; Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus; Good interpersonal skills Proficient command of the English language, both written and spoken
Posted 1 month ago
5 - 8 years
7 - 10 Lacs
Bengaluru
Work from Office
The financial Systems team in the FinTech business unit provides technical expertise to the finance department and is responsible for supporting SAP ERP/S4HANA, SAP BI, Native HANA solutions, and many other connected external systems. We want to change the way people work with enterprise systems, by building an application platform that supports simplification of business processes and empowers the finance community with better integrations and financial insights. This role is focused on make sure we develop Integration solutions for Booking Financial services and our different Business units and Brands. Key Job Responsibilities and Duties Help our team with providing best practices, enterprise architecture standards, and reusable integration patterns. Ability to take fast decisions in a very dynamic environment, with some level of uncertainty Driving and leading architecture design discussions and actively documenting them. Build template solutions on complex integration cases and liaise with our business stakeholders and traduce their business needs into Integration solutions. Maintaining and supporting existing integration interfaces. Create technical design and actual implementation of solutions. Continuously monitor and propose improvements and innovations in our integration landscape The quality, reusability and proficiency of all above aspects conforms the KPIs of this role Role Qualifications and Requirements Min 5+ years of experience in the IT industry having 2+ full implementation cycle in integration area. Good understanding of concepts such as Mulesoft, EDA, MOA, SOA, SOAP, REST APIs, RESTful, RPC/RFC, event streams such as Kafka and different integration patterns. Integration platform experience with Mule 4.x, CloudHub 1.0, (Ideally some experience on CloudHub 2.0, not essential) as well as strong development experience preferable with JAVA/J2EE and build tools like Maven, other integration platform experience, especially event driven integration, SAP CM, CPI is a plus.. Exposure to a variety of enterprise architectures from a large monolithic architecture to hybrid landscapes consisting of distributed systems and microservices and event-driven architectures. Experience of API led design using RAML/OAS and experience of supporting APIs through the full API lifecycle and system observability is a plus (Graphite, Grafana etc.) Strong analytical and problem-solving skills as well as good communication skills while being organized, flexible, proactive, and result-oriented. Pre-Employment Screening If your application is successful, your personal data may be used for a pre-employment screening check by a third party as permitted by applicable law. Depending on the vacancy and applicable law, a pre-employment screening may include employment history, education and other information (such as media information) that may be necessary for determining your qualifications and suitability for the position.
Posted 1 month ago
3 - 5 years
7 - 11 Lacs
Gurugram
Remote
Groundtruth looking for DevOps Engineer who can join us within 30 Days You will: Increase velocity of engineering teams by creating/deploying new stacks, services, and automations Work on projects to improve tooling, efficiency, and standardize/automate approaches (DRY) for commonly-used stacks/services Manage user access to services/systems via tools such as AWS IAM, terraform, and saltstack Participate in on-call rotation to handle critical and/or service-impacting issues Seek pragmatic opportunities to improve our infrastructure, processes, and operational activities Plan, provision, operate, and monitor cloud infrastructure for multiple areas of the business that you support. Design and assist with development and integration of monitoring dashboards, alerting solutions, and devops tools. Collaborate with Software Engineering to plan feature releases and to monitor and support applications including cost analysis and controls. Respond to system, application, security, and customer incidents conducting cause and impact analysis. Participate in on-call support rotation You have: This is our ideal wish list, but most people dont check every box on every job description. So, if you meet most of the criteria below and are excited about the opportunity, and willing to learn, wed love to hear from you. working in a DevOps roles supporting Engineering teams 4 year degree in Computer Science or related field and 3+ years of experience in software engineering OR 6+ years of experience in software development with no degree Experience working with multiple AWS technologies including IAM, EC2, ECS, S3, RDS, EMR, Glue, or similar Experience working for a geographically distributed company Knowledge of CI/CD tools and integration along with container and other microservice-related technologies Proficiency with Github, Github Actions, AWS CLI, and troubleshooting web services and distributed systems Experience in one or more of the following: Python, Bash/Shell, Go, Terraform (or other IaC tools) Experience with automation tools (Saltstack, Chef, Ansible) Experience with IaC tools (e.g. Terraform) Experience working with cloud (AWS, Azure, GCP) preferably with multi-region tenancy Experience with linux administration Experience with shell scripting/cron Nice to have Python3 coding experience (or similar) automation of cloud deployments/infra mgmt. experience with containerization (docker, kubernetes, etc) experience with networking set up (on prem or virtual) experience with monitoring/alerting tools (e.g. cloudwatch alarms, graphite, prometheus, etc) What we offer At GroundTruth, we want our employees to be comfortable with their benefits so they can focus on doing the work they love. Parental leave- Maternity and Paternity Flexible Time Offs (Earned Leaves, Sick Leaves, Birthday leave, Bereavement leave & Company Holidays) In Office Daily Catered Lunch Fully stocked snacks/beverages Health cover for any hospitalization. Covers both nuclear family and parents Tele-med for free doctor consultation, discounts on health checkups and medicines Wellness/Gym Reimbursement Pet Expense Reimbursement Childcare Expenses and reimbursements Employee referral program Education reimbursement program Skill development program Cell phone reimbursement (Mobile Subsidy program). Internet reimbursement/Postpaid cell phone bill/or both. Birthday treat reimbursement Employee Provident Fund Scheme offering different tax saving options such as Voluntary Provident Fund and employee and employer contribution up to 12% Basic Creche reimbursement Co-working space reimbursement National Pension System employer match Meal card for tax benefit Special benefits on salary account Interested one share update resume at laxmi.pal@groundtruth.com or if you are immediate joiner and having relevant experience please connect on 9220900537
Posted 1 month ago
2 - 5 years
4 - 7 Lacs
Bengaluru
Work from Office
Site Reliability Engineer - Private Cloud - Our mission at Booking.com is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties. About the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes Booking-managed cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed by Booking.com. These platform instances, and the workloads running on them, are hosted both in Booking datacenters (on-premises) and on public cloud infrastructure (AWS). The Private Cloud platform has three primary internal customer-facing verticals: virtualization, containerization, and serverless, corresponding to the three types of workloads it supports. At the highest level, the Booking Private Cloud drives three primary business outcomes: Agility in provisioning and using cloud infrastructure. Efficiency in cost and utilization of cloud infrastructure, as well as toil reduction for developers and engineers. Trust in the safety, reliability, and performance of our cloud infrastructure. Years of Experience: 2years-5years Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here at Booking. You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers You will have the opportunity to collaborate with many of the worlds leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc What youll be Doing: Design, develop and implement systems software that improves the stability, scalability, availability and latency of the Booking.com products; Take ownership of one or more services and have the freedom to do what is best for our business and customers; Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again; Build effective monitoring to monitor the health of your system, and jump in to handle outages; Build and run capacity tests to handle the growth of your systems; Plan for reliability by designing systems to work across our multinational data centers; Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day; Share the on-call rotation and be an escalation contact for incidents (depending on level of role) What youll bring: Solid experience in at least one programming language. Experience with building, operating and maintaining scalable distributed systems, and with operations automation; Experience with Infrastructure as Code technologies; Knowledge of cloud computing fundamentals; Solid foundation in Linux administration and troubleshooting; Understanding of Service level agreements and objectives; Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable; Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus; Good interpersonal skills Proficient command of the English language, both written and spoken
Posted 2 months ago
3 - 7 years
13 - 17 Lacs
Bengaluru
Work from Office
Site Reliability Engineer - Private Cloud - Our mission at Booking.com is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties. About the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes Booking-managed cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed by Booking.com. These platform instances, and the workloads running on them, are hosted both in Booking datacenters (on-premises) and on public cloud infrastructure (AWS). The Private Cloud platform has three primary internal customer-facing verticals: virtualization, containerization, and serverless, corresponding to the three types of workloads it supports. At the highest level, the Booking Private Cloud drives three primary business outcomes: Agility in provisioning and using cloud infrastructure. Efficiency in cost and utilization of cloud infrastructure, as well as toil reduction for developers and engineers. Trust in the safety, reliability, and performance of our cloud infrastructure. Years of Experience: 2years-5years Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here at Booking. You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers You will have the opportunity to collaborate with many of the worlds leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc What youll be Doing: Design, develop and implement systems software that improves the stability, scalability, availability and latency of the Booking.com products; Take ownership of one or more services and have the freedom to do what is best for our business and customers; Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again; Build effective monitoring to monitor the health of your system, and jump in to handle outages; Build and run capacity tests to handle the growth of your systems; Plan for reliability by designing systems to work across our multinational data centers; Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day; Share the on-call rotation and be an escalation contact for incidents (depending on level of role) What youll bring: Solid experience in at least one programming language. Experience with building, operating and maintaining scalable distributed systems, and with operations automation; Experience with Infrastructure as Code technologies; Knowledge of cloud computing fundamentals; Solid foundation in Linux administration and troubleshooting; Understanding of Service level agreements and objectives; Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable; Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus; Good interpersonal skills Proficient command of the English language, both written and spoken
Posted 2 months ago
3 - 7 years
13 - 17 Lacs
Bengaluru
Work from Office
Role Description Site Reliability Engineer - Private Cloud - Our mission at Booking.com is to create transformative, innovative, and personalized travel experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties. About the team - Private cloud: The Private Cloud group operates, orchestrates, and optimizes Booking-managed cloud infrastructure. The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed by Booking.com. These platform instances, and the workloads running on them, are hosted both in Booking datacenters (on-premises) and on public cloud infrastructure (AWS). The Private Cloud platform has three primary internal customer-facing verticals: virtualization, containerization, and serverless, corresponding to the three types of workloads it supports. At the highest level, the Booking Private Cloud drives three primary business outcomes: Agility in provisioning and using cloud infrastructure. Efficiency in cost and utilization of cloud infrastructure, as well as toil reduction for developers and engineers. Trust in the safety, reliability, and performance of our cloud infrastructure. Years of Experience: 2years-5years Key Job Responsibilities and Duties: The core premise for the Booking SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here at Booking. You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers You will have the opportunity to collaborate with many of the worlds leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc What youll be Doing: Design, develop and implement systems software that improves the stability, scalability, availability and latency of the Booking.com products; Take ownership of one or more services and have the freedom to do what is best for our business and customers; Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again; Build effective monitoring to monitor the health of your system, and jump in to handle outages; Build and run capacity tests to handle the growth of your systems; Plan for reliability by designing systems to work across our multinational data centers; Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day; Share the on-call rotation and be an escalation contact for incidents (depending on level of role) What youll bring: Solid experience in at least one programming language. Experience with building, operating and maintaining scalable distributed systems, and with operations automation; Experience with Infrastructure as Code technologies; Knowledge of cloud computing fundamentals; Solid foundation in Linux administration and troubleshooting; Understanding of Service level agreements and objectives; Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable; Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus; Good interpersonal skills Proficient command of the English language, both written and spoken
Posted 2 months ago
5 - 10 years
25 - 35 Lacs
Bengaluru
Hybrid
Were always looking for talented and creative engineers to join our team. Event & Streaming Group offers a relaxed but fast environment where creative and collaborative talented people are rewarded. We are very active and passionate about catching up and introducing cutting-edge technology from OSS (Open-Source Software). Our Solution for Data Engineering and Event Management are being used for various services in Rakuten, Inc and continue to grow, following up needs of system for data-driven strategy. Userss requirements and needs are changing continuously, and Our Solution are also evolving fast to catch up their needs and support. Role: We are in search of a talented Engineer, which would work with members in India and Japan. In Event & Streaming Group where are collecting and engineering tremendous data using data engineering solutions, you will get to play a core role in administrating, monitoring and problem resolution on current data engineering platform, and the cutting-edge data engineering technology R&D. Responsibilities: Administration and Maintenance for Data Pipeline System that transfer and wrangle terabyte of data from various service using ELK, Apache Kafka, Apache NiFi. Collaboration with SRE Tm in Japan and India. Implement Automated Operation System. L1/L2 Incident Response. Requirements: Excellent Hands-on experience with Linux . (At least more than 3-years) Must have experience in administrating and maintaining one of the following: Apache Kafka, ELK, NIFI Cluster in production . (At least more than 1-years) Apache Pulsar or Confluent Kafka (At least more than 1-years) Hands-on experience with one of Apache Pulsar or Confluent Kafka on K8S . (At least more than 1-years) Hands-on Experience with one of deployment system like Chef, Ansible , etc Hands-on Experience with one of metrics collection system like Prometheus, Graphite , etc Experience on one of programing languages in J ava (or Scala), Python, or ShellScript (At least more than 1-years) Must have experience in administrating and maintaining client-server backend system in production. (At least more than 1-years) Must be self-organized and gritty on continuous improvements of the platform Must be a self-starter and good collaborator with good communication skills. Preferred Knowledge, Skills and Abilities: Hands-on experience with HDP (HDFS, Hive/HiveLLAP, MapReduce, Spark on Yarn) or CDP Hands-on experience or great knowledge with Docker, Kubernetes . Hands-on experience or great knowledge with GCP, AWS, Azure Fluent or Business level of Japanese. Looking for immediate joiners / can join with in 30-days Rakuten is committed to cultivating and preserving a culture of inclusion and connectedness. We are able to grow and learn better together with a diverse team and inclusive workforce. The collective sum of the individual differences, life experiences, knowledge, innovation, self-expression, and talent that our employees invest in their work represents not only part of our culture, but our reputation and Rakutens achievement as well. In recruiting for our team, we welcome the unique contributions that you can bring in terms of their education, opinions, culture, ethnicity, race, sex, gender identity and expression, nation of origin, age, languages spoken, veterans status, color, religion, disability, sexual orientation, and beliefs.”
Posted 3 months ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
36723 Jobs | Dublin
Wipro
11788 Jobs | Bengaluru
EY
8277 Jobs | London
IBM
6362 Jobs | Armonk
Amazon
6322 Jobs | Seattle,WA
Oracle
5543 Jobs | Redwood City
Capgemini
5131 Jobs | Paris,France
Uplers
4724 Jobs | Ahmedabad
Infosys
4329 Jobs | Bangalore,Karnataka
Accenture in India
4290 Jobs | Dublin 2