Technical Lead-Cloud & Infra Engg

3 - 5 years

2 - 9 Lacs

Posted:7 hours ago| Platform: GlassDoor logo

Apply

Work Mode

On-site

Job Type

Part Time

Job Description

Country/Region: IN
Requisition ID: 31238
Work Model:
Position Type:
Salary Range:
Location: INDIA - NOIDA- BIRLASOFT OFFICE

Title:Technical Lead-Cloud & Infra Engg

Description:

Area(s) of responsibility

Core Responsibilities

User & Access Management

  • Create, update, and delete user accounts.
  • Assign roles and permissions via OKTA groups:
    • Grafana_Admin_Assignment_Group (Admins)
    • Grafana_Editors_Assignment_Group (Regular users)
    • Grafana_SOC_Admin_Group and Grafana_SOC_Editor_Group for SOC environments.
  • Ensure admin access is granted only upon ARF approval.

Dashboard & Visualization Management

  • Create and manage dashboards using data sources like Prometheus, Loki, and Tempo.
  • Customize panels, variables, and layouts for dynamic filtering.
  • Add trace components using Tempo and trace IDs.

Alerting & Monitoring

  • Set up and manage alerts based on log and metric data.
  • Ensure alerts are configured correctly and notifications are sent to appropriate users.
  • Monitor the health and performance of the Grafana instance.

System Administration

  • Perform regular backups of Grafana configurations and data.
  • Restore data from backups when necessary.
  • Escalate issues to platform owners as needed.

Documentation & Compliance

  • Maintain documentation for Grafana configurations, dashboards, and processes.
  • Support audit and compliance requirements by ensuring traceability and access logs.

Stack Deployment & Maintenance

  • Deploy and manage Grafana stack with Prometheus, Loki, and Tempo using Docker Compose.
  • Configure Prometheus to scrape metrics and Loki for log aggregation.
  • Maintain and update docker-compose and Prometheus configuration files.

Required Qualifications

Education & Certifications

  • Bachelor’s degree in Computer Science, IT, or related field.
  • Certifications preferred: Grafana Cloud Admin, Prometheus Certified Associate, or equivalent.

Experience

  • 3–5 years of experience in monitoring and observability platforms.
  • Hands-on experience with Grafana, Prometheus, Loki, Tempo, and Docker.
  • Familiarity with OKTA, ARF workflows, and enterprise access control.

Skills

  • Strong troubleshooting and analytical skills.
  • Proficiency in scripting (Bash, Python) and automation tools (Ansible, Terraform).
  • Excellent communication and documentation abilities.
  • Willingness to work in 24x7 support environments and rotational shifts.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You