About OnlineSales.ai
Built by ex-Amazon ad-tech experts, OnlineSales.ai offers a future-proof Retail Media Operating System - boosting Retailer s profitability by 7% of Sales! We are an Enterprise B2B SaaS startup, based out of Pune India. With OnlineSales.ais platform, Retailers activate and delight 10x more Brands by offering an omni-channel media buying experience, advanced targeting, analytics & 2x better ROAS. Tier 1 Retailers and Marketplaces globally are accelerating their Monetization strategy with OnlineSales.ai and are innovating ahead of the market by at least 2 years.
About the Role
We are seeking a highly skilled Staff DevOps Engineer to architect and maintain a highly available, global infrastructure capable of handling high QPS systems with 99.99% uptime. The role requires expertise in managing deployments across multiple regions, ensuring fault-tolerant systems, and driving scalability for mission-critical applications.
What will you do @OnlineSales?
-
Architect, manage, and scale Kubernetes clusters for high throughput and low latency across multiple global regions.
-
Design and maintain Infrastructure as Code (IaC) to support a fault-tolerant, globally distributed architecture.
-
Build and optimize CI/CD pipelines to ensure smooth, zero-downtime deployments.
-
Ensure 99.99% availability for high QPS applications by implementing robust monitoring, incident management, and failover strategies.
-
Manage multi-region deployments to enable low-latency, geo-redundant infrastructure.
-
Collaborate with cross-functional teams to ensure security, scalability, and operational efficiency.
-
Lead and mentor a high-performing DevOps team, fostering a culture of excellence and innovation.
You will be a great fit, if you have :
-
7-10 years of experience managing large-scale, high-availability systems.
-
Proven expertise in Kubernetes administration, including multi-region deployments and scaling for high QPS.
-
Deep experience with IaC tools like Terraform or CloudFormation.
-
Hands-on with CI/CD pipelines for global, multi-region deployments.
-
Strong understanding of cloud platforms (AWS, GCP, or Azure) and geo-redundant architecture.
-
Proficient in Linux, scripting (Bash, Python), and troubleshooting large-scale distributed systems.
-
Experience leading teams and solving complex, production-grade system challenges.
Why OnlineSales.ai?
-
Startup-y . We believe Startup is a mindset. It s about being scrappy, being nimble, solving tough problems with constrained resources, and more. It s about working hard and playing hard
-
Enterprise SaaS . Opportunity to work with an Enterprise Product SaaS firm with aspirations of growing 10x across the globe
-
AI-led Retail Tech . We are working to digitize & democratize one of the most exciting and growing verticals - Retail Tech leveraging data, machine learning, and automation (culmination of ad-tech, mar-tech, and analytics for Retail vertical)
-
Meaningful work . This is not just a job. You can find a job anywhere. This is a place for the bold to get paid who make a real impact on business
-
No red tape . Say goodbye to pointless meetings or political hoops to jump through. We re scrappy, believe in autonomy, and empower our teams to do whatever it takes to do the unthinkable
-
Problem Solving . We ignite the best in you. We exist not only to deliver meaningful innovation but to ignite and inspire the creative problem-solver in you
-
Quirky & fun . Enjoy new skills and hobbies like being a quiz master, playing board games, trying your hands on percussion, playing Djembe, and spreading love within the org!