Home
Jobs

4 Loki Jobs

Filter
Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 8.0 years

20 - 30 Lacs

Hyderabad

Work from Office

Naukri logo

About the Role We are looking for a highly skilled Site Reliability Engineer (SRE) to lead the implementation and management of our observability stack across Azure-hosted infrastructure and .NET Core applications. This role will focus on configuring and managing Open Telemetry, Prometheus, Loki, and Tempo, along with setting up robust alerting systems across all services including Azure infrastructure and MSSQL databases. You will work closely with developers, DevOps, and infrastructure teams to ensure the performance, reliability, and visibility of our .NET Core applications and cloud services. Key Responsibilities • Observability Platform Implementation: Design and maintain distributed tracing, metrics, and logging using OpenTelemetry, Prometheus, Loki, and Tempo. Ensure complete instrumentation of .NET Core applications for end-to-end visibility. o Implement telemetry pipelines for application logs, performance metrics, and traces. Monitoring & Alerting: Develop and manage SLIs, SLOs, and error budgets. Create actionable, noise-free alerts using Prometheus Alertmanager and Azure Monitor. o Monitor key infrastructure components, applications, and databases with a focus on reliability and performance. • Azure & Infrastructure Integration: Integrate Azure services (App Services, VMs, Storage, etc.) with the observability stack. o Configure monitoring for MSSQL databases, including performance tuning metrics and health indicators. o Use Azure Monitor, Log Analytics, and custom exporters where necessary. Automation & DevOps: Automate observability configurations using Terraform, PowerShell, or other IaC tools. Integrate telemetry validation and health checks into CI/CD pipelines. Maintain observability as code for repeatable deployments and easy scaling. • Resilience & Reliability Engineering: Conduct capacity planning to anticipate scaling needs based on usage patterns and growth. Define and implement disaster recovery strategies for critical Azure-hosted services and databases. Perform load and stress testing to identify performance bottlenecks and validate infrastructure limits. Support release engineering by integrating observability checks and rollback strategies in CI/CD pipelines. Apply chaos engineering practices in lower environments to uncover potential reliability risks proactively. • Collaboration & Documentation: Partner with engineering teams to promote observability best practices in .NET Core development. o Create dashboards (Grafana preferred) and runbooks for system insights and incident response. o Document monitoring standards, troubleshooting guides, and onboarding materials. Required Skills and Experience 4+ years of experience in SRE, DevOps, or infrastructure-focused roles. Deep experience with .NET Core application observability using OpenTelemetry. Proficiency with Prometheus, Loki, Tempo, and related observability tools. Strong background in Azure infrastructure monitoring, including App Services and VMs. Hands-on experience monitoring MSSQL databases (deadlocks, query performance, etc.). • Familiarity with Infrastructure as Code (Terraform, Bicep) and scripting (PowerShell, Bash). Experience building and tuning alerts, dashboards, and metrics for production systems. Preferred Qualifications Azure certifications (e.g., AZ-104, AZ-400). Experience with Grafana, Azure Monitor, and Log Analytics integration. Familiarity with distributed systems and microservice architectures. Prior experience in high-availability, regulated, or customer-facing environments.

Posted 1 week ago

Apply

9 - 14 years

30 - 35 Lacs

Pune

Work from Office

Naukri logo

We are looking for a Lead Software Engineer Youll make a difference by: Must have Lead Software engineer who writes as well as understands requirements to lead implementation effort with team as per requirement. Experience in software development lifecycle using Angular 17+, TypeScript 6+, Java 17+, Spring Boot 3+ and Maven/Gradle. Experience with Streaming data with RabbitMQ. Experience working with Docker containerization techniques and Kubernetes cluster management . Experience working with Gitlab CI. Experience with RESTful Webservices. Experience in unit testing with JUnit, Jasmine and Karma and Static Code analysis with Sonarqube. Experience in service monitoring with Grafana, Loki and Prometheus. Experience working with AGILE technologies and releases. Good to have Experience with Python 3+ and / or shell script. Experience in software design and documentation with UML notations based on OOAD. Desired skills BE / B. Tech (Computer Science / Electronics / Instrumentation / Telecom) / MCA / ME or highe 8-12 years of experience. Good debugging and analytical skills. Great Communication skills. Desired Skills: 9+ years of experience is required. Great Communication skills. Analytical and problem-solving skills

Posted 2 months ago

Apply

6 - 10 years

18 - 20 Lacs

Chennai, Noida

Hybrid

Naukri logo

For the Observability Role that we are looking for, you can use the below details as a kick starting point to find the right resource The skillsets that we are looking for are as below 1. Experience in AWS environments 2. Experience in Kubernetes Environments as a administrator 3. Experience with Linux Operating systems 4. Experience in Python & shell scripting is a must 5. Experience in Jenkins Pipelines 6. Strong knowledge of DevOps principles 7. Preferably experience with the Opensource monitoring tools like Telegraf, Prometheus, Grafana, Loki 8. Experience in Developing dashboards in Grafana using various data sources like Loki , Prometheus , AWS CloudWatch 9. Experience in using Git / Bitbucket 10. Knowledge about Agile methodologies Keywords Devops Docker AWS Azure Kubernetes Pipelines Deployment Python/Java/any lan Bash Linux Jenkins Jira Bitbucket

Posted 3 months ago

Apply

5 - 10 years

7 - 12 Lacs

Bengaluru

Work from Office

Naukri logo

Role Purpose The purpose of this role is to design, test and maintain software programs for operating systems or applications which needs to be deployed at a client end and ensure its meet 100% quality assurance parameters Do JD - Key Responsibilities: Design, develop, and maintain high-performance, scalable applications using GoLang. Implement and manage observability tools including Otel, Grafana, Loki, Prometheus, and Tempo. Develop and maintain infrastructure automation scripts using Ansible. Collaborate with cross-functional teams to define, design, and ship new features. Ensure the performance, quality, and responsiveness of applications. Identify and correct bottlenecks and fix bugs. Help maintain code quality, organization, and automation. Required Skills and Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field. 5+ years of experience in software development with a strong focus on GoLang. Proficiency in observability tools such as Otel, Grafana, Loki, Prometheus, and Tempo. Experience with infrastructure automation using Ansible. Strong understanding of software development principles and design patterns. Excellent problem-solving skills and attention to detail. Ability to work independently and as part of a team. Strong communication skills. Preferred Qualifications: Experience with cloud platforms such as AWS, Azure, or GCP. Familiarity with containerization technologies like Docker and Kubernetes. Knowledge of CI/CD pipelines and tools. Instrumental in understanding the requirements and design of the product/ software Develop software solutions by studying information needs, studying systems flow, data usage and work processes Investigating problem areas followed by the software development life cycle Facilitate root cause analysis of the system issues and problem statement Identify ideas to improve system performance and impact availability Analyze client requirements and convert requirements to feasible design Collaborate with functional teams or systems analysts who carry out the detailed investigation into software requirements Conferring with project managers to obtain information on software capabilities Perform coding and ensure optimal software/ module development Determine operational feasibility by evaluating analysis, problem definition, requirements, software development and proposed software Develop and automate processes for software validation by setting up and designing test cases/scenarios/usage cases, and executing these cases Modifying software to fix errors, adapt it to new hardware, improve its performance, or upgrade interfaces. Analyzing information to recommend and plan the installation of new systems or modifications of an existing system Ensuring that code is error free or has no bugs and test failure Preparing reports on programming project specifications, activities and status Ensure all the codes are raised as per the norm defined for project / program / account with clear description and replication patterns Compile timely, comprehensive and accurate documentation and reports as requested Coordinating with the team on daily project status and progress and documenting it Providing feedback on usability and serviceability, trace the result to quality risk and report it to concerned stakeholders Status Reporting and Customer Focus on an ongoing basis with respect to project and its execution Capturing all the requirements and clarifications from the client for better quality work Taking feedback on the regular basis to ensure smooth and on time delivery Participating in continuing education and training to remain current on best practices, learn new programming languages, and better assist other team members. Consulting with engineering staff to evaluate software-hardware interfaces and develop specifications and performance requirements Document and demonstrate solutions by developing documentation, flowcharts, layouts, diagrams, charts, code comments and clear code Documenting very necessary details and reports in a formal way for proper understanding of software from client proposal to implementation Ensure good quality of interaction with customer w.r.t. e-mail content, fault report tracking, voice calls, business etiquette etc Timely Response to customer requests and no instances of complaints either internally or externally Stakeholder Interaction Stakeholder Type Stakeholder Identification Purpose of Interaction Internal Lead Software Developer and Project Manager Regular reporting updates Software Developers For work coordination and support in providing testing solutions External Clients Provide apt solutions and support as per the requirement Display Lists the competencies required to perform this role effectively: Functional Competencies/ Skill Leveraging Technology - Knowledge of current and upcoming technology along with expertise in programming (automation, tools and systems) to build efficiencies and effectiveness in own function/ Client organization - Competent Process Excellence - Ability to follow the standards and norms to produce consistent results, provide effective control and reduction of risk - Expert Technical knowledge - knowledge of various programming languages, tools, quality management standards and processes - Expert Competency Levels Foundation Knowledgeable about the competency requirements. Demonstrates (in parts) frequently with minimal support and guidance. Competent Consistently demonstrates the full range of the competency without guidance. Extends the competency to difficult and unknown situations as well. Expert Applies the competency in all situations and is serves as a guide to others as well. Master Coaches others and builds organizational capability in the competency area. Serves as a key resource for that competency and is recognised within the entire organization. Behavioral Competencies Formulation Prioritization Innovation Managing Complexity Execution Excellence Passion for Results Deliver No. Performance Parameter Measure 1.Continuous Integration, Deployment Monitoring of Software100% error free on boarding implementation, throughput %, Adherence to the schedule/ release plan2.Quality CSAT On-Time Delivery, Manage software, Troubleshoot queries Customer experience, completion of assigned certifications for skill upgradation 3.MIS Reporting100% on time MIS report generation

Posted 3 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies