Principal System Engineer

8 - 10 years

10 - 14 Lacs

Posted:-1 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

As a Site Reliability Engineer at Operative, you will be at the center of our efforts to build and design scalable software solutions for our clients. You will part of a team of SRE s whos mission is to enable platform observability, automation, improvements to deployment process and infrastructure as a code for our SaaS products running in AWS.
Your efforts will be critical to ensuring we are following the best practices such as infrastructure as code, security as code, use of deployment and maintenance automation at all stages of our SDLC. You will work closely with the software development, product and support teams and take direction from engineering leadership and architecture. This role will require people management skills and technical hands-on work.
Responsibilities
  • Collaborate with ProdOps teams and engineering stakeholders to understand their deliverables and help manage staff to enable your stakeholders achieve priority objectives and remove their blockers.
  • User industry best practices for CD, site reliability, and cloud infrastructure deployment and management.
  • Lead and collaborate on projects within the SRE/DevOps space.
  • Be a thought leader as it relates to SRE/DevOps across the R&D organization.
  • Automate the maintenance of highly scalable, fault-tolerant solutions in AWS.
  • Assist with compliance, evidence gathering, technical remediation for Operatives compliance and audit processes.
  • Meet KPIs and deliver on objectives that track and advance Operatives production operations maturity.
  • Act as an escalation point to assist engineers with debugging infrastructure and automation issues.
  • Ensure that sufficient monitoring and alerting is in place to help the broader engineering and support teams be more proactive at production support.
  • Help maintain and update live SaaS systems with 99.99% client uptime SLAs.
  • Work with the broader engineering and production teams to maintain 24x7x365 on-call support.
  • Work with awesome people on a daily basis.
  • Other duties as assigned.
Qualifications
  • Bachelors Degree in Computer Science or related field required, the company is willing to accept experience or a combination of education and experience in lieu of a degree
  • 5 years of combined experience in SRE, DevOps, software development, systems and/or network administration experience at an organization supporting dozens to hundreds of applications and/or servers, required
  • AWS experience is a must
  • At least 5 years of experience supporting custom software in a production environment
  • Minimum 2 years of experience in deployment / configuration management using tools like Chef, Ansible, Puppet, Octopus, Team Foundation Server; automation projects are an acceptable experience
  • Experience with Continuous Integration tools such as Jenkins or GitLab
  • Experience with automation/configuration management using either CloudFormation, Terraform, Ansible, or equivalents
  • Proven experience getting a SaaS product organization to true continuous deployment
  • Prior work experience with Container and Container Management frameworks (e.g., Docker, Kubernetes, AWS ECS)
  • Prior experience implementing cloud solutions and cloud security paradigms
  • Good understanding of key aspects of cloud infrastructure (security, scale, cost, etc.) in comparison with on-prem
  • Experience with log collection and analysis, builds and performance monitoring/tuning of infrastructure
  • Familiar with a wide variety of cloud services and open-source technologies is preferred
  • Experience with service-oriented architecture and/or microservices is a plus
  • Someone who has a passion for speed and efficiency through automation and reducing waste with a focus on quality, security, and metrics to drive continuous improvement
  • Must have in-depth experience managing Linux based workloads
  • Excellent communication skills
    EDUCATION, CERTIFICATION AND EXPERIENCE
    8+ years of relevant experience.
  • Bachelor s or master s degree in computer science or equivalent

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Operative India Private Limitd logo
Operative India Private Limitd

Advertising Technology

Bangalore

RecommendedJobs for You