Senior AWS Engineer – Observability & Monitoring (CloudWatch | Python | CI/CD)

7 - 9 years

0 Lacs

Posted:1 week ago| Platform: GlassDoor logo

Apply

Work Mode

On-site

Job Type

Part Time

Job Description

    7 - 9 Years
    2 Openings
    Kochi, Trivandrum


Role description

We are seeking an experienced Senior AWS Engineer to design, implement, and optimize end-to-end observability solutions across multiple AWS-based systems. The ideal candidate will have deep expertise in AWS CloudWatch, AWS X-Ray, Lambda monitoring, and infrastructure-as-code practices, combined with a strong understanding of service-level objectives (SLOs/SLAs) and ing integrations such as OpsGenie or PagerDuty.

This role is pivotal in shaping our observability strategy and mentoring junior engineers in building reliable, data-driven monitoring systems.

Key Responsibilities

  • Design and implement a comprehensive CloudWatch alarm and monitoring strategy across 13 AWS solutions.
  • Define and operationalize SLOs/SLAs for critical business workflows and services.
  • Instrument application code with custom metrics using Python and Embedded Metric Format (EMF) where visibility gaps exist.
  • Validate and enhance AWS X-Ray implementation to ensure full traceability and performance insight.
  • Create and maintain ing and escalation workflows using OpsGenie (or PagerDuty).
  • Optimize existing CloudWatch dashboards (e.g., doc-manager, state-machine) and build new dashboards for remaining solutions.
  • Employ CloudFormation (or equivalent IaC tools) to manage and automate observability resources.
  • Mentor junior engineers, promoting best practices in observability, monitoring automation, and performance analysis.

Required Skills & Expertise

  • Advanced proficiency with AWS CloudWatch (alarms, dashboards, metrics, Logs Insights).
  • Strong understanding of AWS Lambda monitoring and optimization techniques.
  • Hands-on experience with AWS X-Ray for distributed tracing and performance debugging.
  • Expertise in CloudFormation or Infrastructure-as-Code (IaC) frameworks for observability infrastructure.
  • Proficiency in Python, especially for custom metric instrumentation and automation.
  • Knowledge of EMF (Embedded Metric Format) for structured metric data.
  • Proven experience defining and implementing SLOs/SLAs.
  • Familiarity with OpsGenie or PagerDuty for ing and incident management workflows.

Skills

AWS, Cloudwatch, Lambda, Cloud Formation, Python

About UST

UST is a global digital transformation solutions provider. For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by purpose, UST partners with their clients from design to operation. With deep domain expertise and a future-proof philosophy, UST embeds innovation and agility into their clients’ organizations. With over 30,000 employees in 30 countries, UST builds for boundless impact—touching billions of lives in the process.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
UST Global logo
UST Global

Information Technology Services

Oxnard

RecommendedJobs for You