Posted:-1 days ago|
Platform:
Work from Office
Full Time
Test product-specific use cases and validate end-to-end alerting workflows across monitoring systems.
Simulate incidents and test scenarios that trigger alerts in tools like Datadog, Prometheus, or similar monitoring platforms.
Verify that alerts raised in monitoring tools are correctly consumed and acted upon by downstream systems or automated workflows.
Understand alert rules so test cases are easier to design, execute, debug, and maintain (alert configuration will be handled by Developers/SREs, but QA must understand them).
Collaborate closely with engineering teams (Developers, SREs/DevOps) to improve detection, investigation, and automated incident response.
Analyze alert behaviour, validate incident pipelines, and ensure seamless integration across all monitoring and automation tools.
Identify gaps in monitoring, logging, and alert workflows and provide clear, actionable QA feedback.
Document test scenarios, alert behaviour, and monitoring workflows in a clear and reproducible manner.
Monitoring Tools Expertise: Hands-on experience with at least one major monitoring system (Datadog or Prometheus), including working with alerts, dashboards, and troubleshooting.
Alert Simulation & Validation: Ability to trigger, simulate, and validate alert events end-to-end.
Incident Workflow Understanding: Strong understanding of how alerts propagate through monitoring systems and how automated systems respond to them.
Automation Mindset: Ability to use or write simple scripts (Python, Shell, etc.) to simulate workloads or events that trigger alerts.
Communication & Problem Solving: Ability to collaborate effectively with Developers and SRE/DevOps teams to ensure monitoring accuracy.
Experience with automated incident investigation or remediation tools.
Familiarity with CI/CD pipelines and integrating monitoring validation into pipelines.
Understanding of observability fundamentals metrics, logs, traces.
Exposure to infrastructure or SRE environments.
Basic knowledge of Kubernetes, Docker, or cloud platforms (AWS/GCP/Azure).
Infracloud Technologies
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now
Experience: Not specified
3.0 - 7.0 Lacs P.A.
4.0 - 8.0 Lacs P.A.
hyderabad
7.0 - 12.0 Lacs P.A.
noida, new delhi, pune
6.0 - 11.0 Lacs P.A.
8.0 - 10.0 Lacs P.A.
india
3.6 - 4.8 Lacs P.A.
kolkata
6.0 - 9.0 Lacs P.A.
hyderabad
10.0 - 20.0 Lacs P.A.
4.0 - 8.0 Lacs P.A.
ahmedabad, gujarat, india
Salary: Not disclosed