Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

You Are

Passionate about deploying applications in a highly available manner and ensuring each release adheres to SLAsUsed to writing scripts to automate instead of doing manual taskAn enthusiastic, self-motivated, independent problem solver who knows when to execute andwhen to ask for help but more importantly who thrives in finding efficient, creative solutions to complex problems in fast paced environments.Successful in supporting applications that span multiple data centers.Savvy about which open source tools to leverage and what to build in-house.Packed with a useful bag of tricks for troubleshooting system and network issues.

You Will Be

Managing all aspects of operationsQualifying new releases with stress, load and performance tests.Documenting tasks as you go.Developing monitoring scripts and run books using Puppet, Chef or Ansible247 support for mission critical deployments.

Handling These Fine Details

OOLTP No SQL database, for high performance, low latency applications like real-time advertising.Large-scale distributed database architectures.World-class distributed system technology including clustered system, distributed database, distributed computing, distributed storage, etc.247 operations and high availability (on call/rotation)Automated QA with continuous testing (occasionally).

You Have

Excellent knowledge of scripting language Perl, Python or Ruby (Python is highly preferred)Over 4 years of operations experience in supporting 247 services with a scalable product that serves millions of actively subscribed users.Excellent experience in scripting for monitoring/deployment using Puppet, Chef or Ansible to automate task/deployment/installs is strongly preferredKnowledge of monitoring and graphing systems like Nagios, Zabbix, Graphite, etc.Demonstrated experience in handling new deployments that contain dozens to hundreds ofLinux based server systems.Proven experience in deploying application servers through the full product life cycle including rolling upgrades on running services for millions of subscribers.Knowledge in data center deployment (including multi OS environment set-up and management) is preferred.Driven determination and demonstrative methods to hunt down difficult bugs in production systems using system logs and other run-time statistics.Experience in working with counterparts in another time zone to provide 24X7 coverage.Experience in being on an on-call rotation list for 2nd level operations support.Experience in developing run books for use by 1st level operations support

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

Hyderabad, Telangana, India

Indore, Madhya Pradesh