Production Operations Engineer

0 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Key Responsibilities

● Monitor production systems and job pipelines; respond promptly to alerts and anomalies

● Troubleshoot operational issues in collaboration with the development team

● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)

● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations

● Operate in Kubernetes environments to inspect, debug, and manage components

● Support deployment activities through post-release validations and basic checks

● Validate data quality and flag anomalies to the relevant engineering teams

● Maintain clear documentation of incidents, actions taken, and resolution outcomes

● Communicate effectively with remote teams for operational handoffs and follow-ups


Required Qualifications

● Experience in production operations, system support, or devops roles

● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)

● Hands-on experience with Kubernetes and Docker in production environments

● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)

● English proficiency for reading, writing, and asynchronous communication

● Strong execution discipline and ability to follow structured operational procedures


Preferred Qualifications

● Scripting ability (Python or Shell) for log parsing and automation

● Basic SQL skills for data verification or debugging

● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus

● Experience with large-scale distributed data systems or job scheduling frameworks

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
ACL Digital logo
ACL Digital

Information Technology and Services

Palo Alto

RecommendedJobs for You