Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 10.0 years
15 - 30 Lacs
Hyderabad, Pune, Bengaluru
Hybrid
Minimum of 5+ years of DevOps tools. Strong hands-on experience with Grafana, InfluxDB for monitoring and visualization. Experience the ETL tools like : Pentaho, Apache Hop. Experience with the visualization tools like : Grafana. Solid experience in Shell and Python scripting for automation. Experience in the Telco industry. Skills required Program languages: Python (Must). Databases: MySQL, Influxdb, Hive (big data (Must)) Server Ops: Management of Redhat Linux/Centos7, Flatcar (Must) Containerization and container platforms: Docker, Docker-compose (Must) Scripting: JavaScript, Shell, Bash (Must) • Monitoring tools: Grafana. (Must), tableau (Nice) Big data tools: (Nice). • DevOps/Design Tools : Draw io., JIRA, Confluence. • Software Management Tools: Maven (Nice) • CI/CD: Bitbucket, GitLab, Jenkins.
Posted 3 months ago
3.0 - 5.0 years
5 - 7 Lacs
Hyderabad
Hybrid
Urgent Requirement for Grafana, Employment:C2H Notice Period:Immediate We are seeking a skilled Database Specialist with strong expertise in Time-Series Databases, specifically Loki for logs, InfluxDB, and Splunk for metrics. The ideal candidate will have a solid background in query languages, Grafana, Alert Manager, and Prometheus. This role involves managing and optimizing time-series databases, ensuring efficient data storage, retrieval, and visualization. Key Responsibilities: Design, implement, and maintain time-series databases using Loki, InfluxDB, and Splunk to store and manage high-velocity time-series data. Develop efficient data ingestion pipelines for time-series data from various sources (e.g., IoT devices, application logs, metrics). Optimize database performance for high write and read throughput, ensuring low latency and high availability. Implement and manage retention policies, downsampling, and data compression strategies to optimize storage and query performance. Collaborate with DevOps and infrastructure teams to deploy and scale time-series databases in cloud or on-premise environments. Build and maintain dashboards and visualization tools (e.g., Grafana) for monitoring and analyzing time-series data. Troubleshoot and resolve issues related to data ingestion, storage, and query performance. Work with development teams to integrate time-series databases into applications and services. Ensure data security, backup, and disaster recovery mechanisms are in place for time-series databases. Stay updated with the latest advancements in time-series database technologies and recommend improvements to existing systems. Key Skills: Strong expertise in Time-Series Databases with Loki (for logs), InfluxDB, and Splunk (for metrics).
Posted 3 months ago
8.0 - 12.0 years
27 Lacs
Hyderabad, Pune, Bengaluru
Work from Office
We are looking for "Sr. IOT Engineer / SME" with Minimum 8 years experience Contact- Atchaya (95001 64554) Required Candidate profile Basic understanding of IoT data routing Experience with databases and storage systems like: InfluxDB, PostgreSQL, Redis Strong knowledge in Azure and Azure Kubernetes Service (AKS)
Posted 3 months ago
3.0 - 8.0 years
0 - 3 Lacs
Hyderabad
Work from Office
Job Summary: We are looking for a Machine Learning Engineer with strong data engineering capabilities to support the development and deployment of predictive models in a smart manufacturing environment. This role involves building robust data pipelines, developing high-accuracy ML models for defect prediction, and implementing automated control systems for real-time corrective actions on the production floor. Key Responsibilities: Data Engineering & Integration: Validate and ensure the correct flow of data from Influx DB/CDL to Smart box/Databricks. Assist data scientists in the initial modeling phase through reliable data provisioning. Provide ongoing support for data pipeline corrections and ad-hoc data extraction. ML Model Development for Defect Prediction: Develop 3 separate ML models for predicting 3 types of defects based on historical data. Predict defect occurrence within a 5-minute window using: Artificial sampling techniques Dimensionality reduction Deliver results with: Accuracy 95% Precision & recall 80% Feature importance insights Closed-Loop Control System Implementation: Prescribe machine setpoint changes based on model outputs to prevent defect occurrence. Design and implement a closed-loop system that includes: Real-time data fetching from production line PLCs (via Influx DB/CDL). Deployment of ML models on Smart box. Pipeline to output recommendations to the appropriate PLC tag. Retraining pipeline triggered by drift detection (cloud-based retraining when recommendations deviate from centerlines). Qualifications: Education: Bachelor's or Masters degree in Computer Science, Data Science, Electrical Engineering, or related field. Technical Skills: Proficient in Python and ML libraries (e.g., scikit-learn, XG Boost, pandas) Experience with: Influx DB and CDL for industrial data integration Smart box and Databricks for model deployment and data processing Real-time data pipelines and industrial control systems (PLCs) Model performance tracking and retraining pipelines Preferred: Experience in manufacturing analytics or predictive maintenance Familiarity with Industry 4.0 principles and edge/cloud hybrid architectures Soft Skills: Strong analytical and problem-solving abilities Effective communication with cross-functional teams (data science, automation, production) Attention to detail and focus on solution reliability
Posted 3 months ago
3.0 - 8.0 years
0 - 3 Lacs
Hyderabad
Work from Office
Job Overview: We are seeking a skilled and proactive Machine Learning Engineer to join our smart manufacturing initiative. You will play a pivotal role in building data pipelines, developing ML models for defect prediction, and implementing closed-loop control systems to improve production quality. Responsibilities: Data Engineering & Pipeline Support: Validate and ensure correct data flow from Influx DB/CDL to Smart box/Databricks platforms. Collaborate with data scientists to support model development through accurate data provisioning. Provide ongoing support in resolving data pipeline issues and performing ad-hoc data extractions. ML Model Development: Develop three distinct ML models to predict different types of defects using historical production data. Predict short-term outcomes (next 5 minutes) using techniques like artificial sampling and dimensionality reduction. Ensure high model performance: Accuracy 95%, Precision & Recall 80%. Extract and present feature importance to support model interpretability. Closed-loop Control Architecture: Implement end-to-end ML-driven automation to proactively correct machine settings based on model predictions. Key architecture components include: Real-time data ingestion from PLCs via Influx DB/CDL. Model deployment and inference on Smart box. Output pipeline to share actionable recommendations via PLC tags. Automated retraining pipeline in the cloud triggered by model drift or recommendation deviations. Qualifications: Proven experience with real-time data streaming from industrial systems (PLCs, Influx DB/CDL). Hands-on experience in building and deploying ML models in production. Strong understanding of data preprocessing, dimensionality reduction, and synthetic data techniques. Familiarity with cloud-based retraining workflows and model performance monitoring. Experience in smart manufacturing or predictive maintenance is a plus.
Posted 3 months ago
6.0 - 9.0 years
32 - 35 Lacs
Noida, Kolkata, Chennai
Work from Office
Dear Candidate, We are hiring a Rust Developer to build safe, concurrent, and high-performance applications for system-level or blockchain development. Key Responsibilities: Develop applications using Rust and its ecosystem (Cargo, Crates) Write memory-safe and zero-cost abstractions for systems or backends Build RESTful APIs, CLI tools, or blockchain smart contracts Optimize performance using async/await and ownership model Ensure safety through unit tests, benchmarks, and fuzzing Required Skills & Qualifications: Proficient in Rust , lifetimes , and borrowing Experience with Tokio , Actix , or Rocket frameworks Familiarity with WebAssembly , blockchain (e.g. Substrate) , or embedded Rust Bonus: Background in C/C++ , systems programming, or cryptography Soft Skills: Strong troubleshooting and problem-solving skills. Ability to work independently and in a team. Excellent communication and documentation skills. Note: If interested, please share your updated resume and preferred time for a discussion. If shortlisted, our HR team will contact you. Srinivasa Reddy Kandi Delivery Manager Integra Technologies
Posted 3 months ago
4.0 - 9.0 years
10 - 20 Lacs
Chennai
Work from Office
We are seeking a highly skilled and experienced Senior Full Stack Developer to join our dynamic team. The ideal candidate will be proficient in both front-end and back-end technologies and capable of leading the design, development, and maintenance of scalable web applications. Key Responsibilities: Design and develop robust and scalable web applications using modern frameworks and technologies. Lead the full software development lifecycle from requirements gathering to deployment and maintenance. Collaborate with cross-functional teams including product managers, designers, and QA engineers. Optimize applications for maximum speed and scalability. Ensure code quality through test-driven development and code reviews. Stay current with emerging technologies and best practices. Key Skills & Technologies: Frontend Development : React.js, JavaScript, Next.js Backend Development : Node.js, NestJS Databases : MongoDB Other Technologies : Redis, Kafka, InfluxDB, WebSocket Additional Skills : Typescript, Architectural Design Preferred (but not required) Skills: Experience with Blockchain and Cryptocurrency technologies Knowledge of Artificial Intelligence concepts and applications Requirements: 5+ years of experience in full stack development. Strong problem-solving skills and attention to detail. Proven experience with RESTful APIs and modern application architectures. Excellent communication skills and the ability to mentor junior developers. Why Join Us? Work with a forward-thinking, innovative team. Competitive salary and benefits. Opportunities for growth and learning.
Posted 3 months ago
5.0 - 10.0 years
15 - 25 Lacs
Bengaluru
Hybrid
Job description Walk in interview on 31st May 25 - Azure Devops Engineer - Bangalore Years of Experience - 8 to 12 Years Work mode: Hybrid Interview Date - 31st May 2025, Saturday Time of Interview - 9.30 AM To 4.00 PM Kindly carry 2 hard copy of resume Interview Location: Arrow Electronics India Pvt Ltd, Rockline Seethalaxmi (SKAV) Building, Kasturba Road, Shanthala Nagar, Opp to Vishweshwaraya Museum, Bengaluru - 560001 What youll be doing: Principal Accountabilities Designs and develops software solutions to meet business requirements. Manages full software development life cycle including testing, implementation, and auditing. Performs product design, bug verification, and beta support, which may require research and analysis. Operates under moderate supervision. Usually reports to the Manager of Software Development. Execute, assess, and troubleshoot software programs and applications. Analyze and amend software errors in a timely and accurate fashion. Coding, developing, and documenting software specifications throughout the project life cycle. Participate in software upgrades, revisions, fixes and patches as mandated by the vendor. Job Complexity Requires in-depth knowledge and experience Solves complex problems; takes a new perspective using existing solutions Works independently; receives minimal guidance Acts as a resource for colleagues with less experience Represents the level at which career may stabilize for many years or even until retirement Contributes to process improvements Typically resolves problems using existing solutions Provides informal guidance to junior staff Works with minimal guidance What we are looking for: Typically requires 5–7 years of related experience with a 4 year degree; or 3 years and an advanced degree; or equivalent work experience. Experience with Influx DB and Flux language PowerShell + SQL query experience Experience with administrating Telegraph agents Experience as administrating Grafana Building dashboards Creating alerts Log collections (Loki logs) Experience with monitoring and alerting tools such as: InfluxDB Grafana Loki Logs Candidate should possess good communication skills Self-driven, Bottom line oriented and take ownership of tasks assigned Effective working relationships with all functional units of the organization, Ability to work as part of a cross-cultural team including flexibility to support multiple locations when necessary, Excellent interpersonal skills in areas such as teamwork, facilitation, and negotiation, and able to work independently or as part of a team. Excellent problem-solving skills and the ability to work efficiently Working knowledge of Azure DevOps GIT
Posted 3 months ago
6.0 - 9.0 years
32 - 35 Lacs
Noida, Kolkata, Chennai
Work from Office
Dear Candidate, We are hiring a Lua Developer to create lightweight scripting layers in games, embedded systems, or automation tools. Key Responsibilities: Develop scripts and integrations using Lua Embed Lua in C/C++ applications for extensibility Write custom modules or bindings for game engines or IoT devices Optimize Lua code for memory and execution time Integrate with APIs, data sources, or hardware systems Required Skills & Qualifications: Proficient in Lua and its integration with host languages Experience with Love2D , Corona SDK , or custom engines Familiarity with C/C++ , embedded Linux , or IoT Bonus: Game scripting or automation experience Soft Skills: Strong troubleshooting and problem-solving skills. Ability to work independently and in a team. Excellent communication and documentation skills. Note: If interested, please share your updated resume and preferred time for a discussion. If shortlisted, our HR team will contact you. Srinivasa Reddy Kandi Delivery Manager Integra Technologies
Posted 3 months ago
12.0 - 20.0 years
45 - 65 Lacs
Bengaluru
Work from Office
Lead OT, IIoT, XR, and real-time data strategy across digital platforms and agile teams. Required Candidate profile 12–18 yrs in OT strategy, real-time data systems, Kafka/Spark, edge compute, global team management.
Posted 3 months ago
3.0 - 7.0 years
15 - 20 Lacs
Pune
Work from Office
What Youll Do - Configure and manage observability agents across AWS, Azure & GCP - Use IaC techniques and tools such as Terraform, Helm & GitOps, to automate deployment of Observability stack - Experience with different language stacks such as Java, Ruby, Python and Go - Instrument services using OpenTelemetry and integrate telemetry pipelines - Optimize telemetry metrics storage using time-series databases such as Mimir & NoSQL DBs - Create dashboards, set up alerts, and track SLIs/SLOs - Enable RCA and incident response using observability data - Secure the observability pipeline You Bring - BE/BTech/MTech (CS/IT or MCA), with an emphasis in Software Engineering - Strong skills in reading and interpreting logs, metrics, and traces - Proficiency with LGTM (Loki, Grafana, Tempo, Mimi) or similar stack, Jaeger, Datadog, Zipkin, InfluxDB etc. - Familiarity with log frameworks such as log4j, lograge, Zerolog, loguru etc. - Knowledge of OpenTelemetry, IaC, and security best practices - Clear documentation of observability processes, logging standards & instrumentation guidelines - Ability to proactively identify, debug, and resolve issues using observability data - Focused on maintaining data quality and integrity across the observability pipeline
Posted 3 months ago
6.0 - 9.0 years
8 - 11 Lacs
hyderabad
Hybrid
We are seeking a skilled Database Specialist with strong expertise in Time-Series Databases, specifically Loki for logs, InfluxDB, and Splunk for metrics. The ideal candidate will have a solid background in query languages, Grafana, Alert Manager, and Prometheus. This role involves managing and optimizing time-series databases, ensuring efficient data storage, retrieval, and visualization. Key Responsibilities: Design, implement, and maintain time-series databases using Loki, InfluxDB, and Splunk to store and manage high-velocity time-series data. Develop efficient data ingestion pipelines for time-series data from various sources (e.g., IoT devices, application logs, metrics). Optimize database performance for high write and read throughput, ensuring low latency and high availability. Implement and manage retention policies, downsampling, and data compression strategies to optimize storage and query performance. Collaborate with DevOps and infrastructure teams to deploy and scale time-series databases in cloud or on-premise environments. Build and maintain dashboards and visualization tools (e.g., Grafana) for monitoring and analyzing time-series data. Troubleshoot and resolve issues related to data ingestion, storage, and query performance. Work with development teams to integrate time-series databases into applications and services. Ensure data security, backup, and disaster recovery mechanisms are in place for time-series databases. Stay updated with the latest advancements in time-series database technologies and recommend improvements to existing systems. Key Skills: Strong expertise in Time-Series Databases with Loki (for logs), InfluxDB, and Splunk (for metrics).
Posted Date not available
5.0 - 9.0 years
15 - 25 Lacs
pune, chennai, bengaluru
Work from Office
Years of exp: 4 plus years (Relevant Exp ) Location: bang/Pune/Hyderabad/Chennai NP: Immediate/15 -20 days Passport Mandatory Role & responsibilities Grafana JD Splunk Query Skills (SPL - Search Processing Language) : Proficiency in creating complex searches, using commands like eval, stats, and regex for data filtering, aggregation, and visualization. Experience with Splunk components (e.g., Search and Reporting app), building reports, dashboards, and handling machine data analysis. InfluxDB Experience (Mimicking Splunk Queries) : Knowledge of InfluxDB as a scalable time-series database for metrics, events, and real-time data ingest. Ability to "mimic" Splunk-like queries using InfluxQL (InfluxDB's SQL-like query language) or Flux for data transformation, aggregation, and schema browsing. Grafana Dashboard Customization : Knowledge in building and modifying dashboards, including adding visualizations, variables, templating, and integrating data sources like InfluxDB or Prometheus. Customizing home dashboards, panels (e.g., graphs, tables), time settings (time-based data plots), andvrole-based views for better monitoring.
Posted Date not available
10.0 - 13.0 years
20 - 25 Lacs
pune
Work from Office
Company Overview With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And were only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on. At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR technology built for all. Here, we know that youre more than your work. Thats why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If youre passionate about our purpose people then we cant wait to support whatever gives you purpose. Were united by purpose, inspired by you. Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation. Site Reliability Engineers must have a passion for learning and evolving with current technology trends. They strive to innovate and are relentless in their pursuit of a flawless customer experience. They have an automate everything mindset, helping us bring value to our customers by deploying services with incredible speed, consistency and availability. Primary/Essential Duties and Key Responsibilities: Proficient in Splunk/ELK, and Datadog. Experience with observability tools such as Prometheus/InfluxDB, and Grafana. Possesses strong knowledge of at least one scripting language such as Python, Bash, Powershell or any other relevant languages. Design, develop, and maintain observability tools and infrastructure. Collaborate with other teams to ensure observability best practices are followed. Develop and maintain dashboards and alerts for monitoring system health. Troubleshoot and resolve issues related to observability tools and infrastructure. Engage in and improve the lifecycle of services from conception to EOL, includingsystem design consulting, and capacity planning Define and implement standards and best practices related toSystem Architecture, Service delivery, metrics and the automation of operational tasks Support services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response. Improve system performance, application delivery and efficiency through automation, process refinement, postmortem reviews, and in-depth configuration analysis Collaborate closely with engineering professionals within the organization to deliver reliable services Identify and eliminate operational toil by treating operational challenges as a software engineering problem Actively participate in incident response, including on-call responsibilities Partner with stakeholders to influence and help drive the best possible technical and business outcomes Guide junior team members and serve as a champion for Site Reliability Engineering Engineering degree, or a related technical discipline, and 10+years of experience in SRE. Experience coding in higher-level languages (e.g., Python, Javascript, C++, or Java) Knowledge of Cloud based applications & Containerization Technologies Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing Ability to analyze current technology utilized and engineering practices within the company and develop steps and processes to improve and expand upon them Working experience with industry standards like Terraform, Ansible. (Experience, Education, Certification, License and Training) Must have hands-on experience working within Engineering or Cloud. Experience with public cloud platforms (e.g. GCP, AWS, Azure) Experience in configuration and maintenance of applications & systems infrastructure. Experience with distributed system design and architecture Experience building and managing CI/CD Pipelines Where were going UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet its our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow! UKG is proud to be an equal opportunity employer and is committed to promoting diversity and inclusion in the workplace, including the recruitment process. Disability Accommodation For individuals with disabilities that need additional assistance at any point in the application and interview process, please email UKGCareers@ukg.com
Posted Date not available
10.0 - 13.0 years
20 - 25 Lacs
pune
Work from Office
Company Overview With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And were only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on. At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR technology built for all. Here, we know that youre more than your work. Thats why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If youre passionate about our purpose people then we cant wait to support whatever gives you purpose. Were united by purpose, inspired by you. Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation. Site Reliability Engineers must have a passion for learning and evolving with current technology trends. They strive to innovate and are relentless in their pursuit of a flawless customer experience. They have an automate everything mindset, helping us bring value to our customers by deploying services with incredible speed, consistency and availability. Primary/Essential Duties and Key Responsibilities: Proficient in Splunk/ELK, and Datadog. Experience with observability tools such as Prometheus/InfluxDB, and Grafana. Possesses strong knowledge of at least one scripting language such as Python, Bash, Powershell or any other relevant languages. Design, develop, and maintain observability tools and infrastructure. Collaborate with other teams to ensure observability best practices are followed. Develop and maintain dashboards and alerts for monitoring system health. Troubleshoot and resolve issues related to observability tools and infrastructure. Engage in and improve the lifecycle of services from conception to EOL, includingsystem design consulting, and capacity planning Define and implement standards and best practices related toSystem Architecture, Service delivery, metrics and the automation of operational tasks Support services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response. Improve system performance, application delivery and efficiency through automation, process refinement, postmortem reviews, and in-depth configuration analysis Collaborate closely with engineering professionals within the organization to deliver reliable services Identify and eliminate operational toil by treating operational challenges as a software engineering problem Actively participate in incident response, including on-call responsibilities Partner with stakeholders to influence and help drive the best possible technical and business outcomes Guide junior team members and serve as a champion for Site Reliability Engineering Engineering degree, or a related technical discipline, and 10+years of experience in SRE. Experience coding in higher-level languages (e.g., Python, Javascript, C++, or Java) Knowledge of Cloud based applications & Containerization Technologies Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing Ability to analyze current technology utilized and engineering practices within the company and develop steps and processes to improve and expand upon them Working experience with industry standards like Terraform, Ansible. (Experience, Education, Certification, License and Training) Must have hands-on experience working within Engineering or Cloud. Experience with public cloud platforms (e.g. GCP, AWS, Azure) Experience in configuration and maintenance of applications & systems infrastructure. Experience with distributed system design and architecture Experience building and managing CI/CD Pipelines Where were going UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet its our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow! UKG is proud to be an equal opportunity employer and is committed to promoting diversity and inclusion in the workplace, including the recruitment process. Disability Accommodation For individuals with disabilities that need additional assistance at any point in the application and interview process, please email UKGCareers@ukg.com
Posted Date not available
10.0 - 15.0 years
12 - 17 Lacs
pune
Work from Office
Company Overview With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And we're only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieveRead on. Here, we know that you're more than your work. That's why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose "” a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If you're passionate about our purpose "” people "”then we can't wait to support whatever gives you purpose. We're united by purpose, inspired by you. Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation. Site Reliability Engineers must have a passion for learning and evolving with current technology trends. They strive to innovate and are relentless in their pursuit of a flawless customer experience. They have an "automate everything" mindset, helping us bring value to our customers by deploying services with incredible speed, consistency and availability. Primary/Essential Duties and Key Responsibilities: Proficient in Splunk/ELK, and Datadog. Experience with observability tools such as Prometheus/InfluxDB, and Grafana. Possesses strong knowledge of at least one scripting language such as Python, Bash, Powershell or any other relevant languages. Design, develop, and maintain observability tools and infrastructure. Collaborate with other teams to ensure observability best practices are followed. Develop and maintain dashboards and alerts for monitoring system health. Troubleshoot and resolve issues related to observability tools and infrastructure. Engage in and improve the lifecycle of services from conception to EOL, includingsystem design consulting, and capacity planning Define and implement standards and best practices related toSystem Architecture, Service delivery, metrics and the automation of operational tasks Support services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response. Improve system performance, application delivery and efficiency through automation, process refinement, postmortem reviews, and in-depth configuration analysis Collaborate closely with engineering professionals within the organization to deliver reliable services Identify and eliminate operational toil by treating operational challenges as a software engineering problem Actively participate in incident response, including on-call responsibilities Partner with stakeholders to influence and help drive the best possible technical and business outcomes Guide junior team members and serve as a champion for Site Reliability Engineering Engineering degree, or a related technical discipline, and 10+years of experience in SRE. Experience coding in higher-level languages (e.g., Python, Javascript, C++, or Java) Knowledge of Cloud based applications & Containerization Technologies Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing Ability to analyze current technology utilized and engineering practices within the company and develop steps and processes to improve and expand upon them Working experience with industry standards like Terraform, Ansible. (Experience, Education, Certification, License and Training) Must have hands-on experience working within Engineering or Cloud. Experience with public cloud platforms (e.g. GCP, AWS, Azure) Experience in configuration and maintenance of applications & systems infrastructure. Experience with distributed system design and architecture Experience building and managing CI/CD Pipelines Where we're going UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet it's our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow! Disability Accommodation UKGCareers@ukg.com
Posted Date not available
5.0 - 10.0 years
8 - 18 Lacs
hyderabad
Remote
We are looking for a skilled Grafana Developer with end-to-end expertise in designing, building, and managing Grafana dashboards and widgets. The ideal candidate will have strong experience in integrating Grafana with PostgreSQL/TimescaleDB, writing complex SQL queries for data visualization, and optimizing performance for large-scale datasets. Experience in Telecom or IoT domains is a plus. Key Responsibilities Design, develop, and maintain interactive Grafana dashboards and widgets for real-time and historical data visualization. Write complex SQL queries (PostgreSQL/TimescaleDB) to extract, transform, and aggregate data for Grafana visualizations. Experience in Grafana panels, variables, templating, annotations, and time ranges . Optimize database performance, including query tuning, indexing, and partitioning strategies for time-series data. Experience with data sources (Prometheus, InfluxDB, MySQL, PostgreSQL, TimescaleDB Elasticsearch, Loki, etc.). Custom plugins or extensions Experience in Grafana using Grafana Plugin SDK and Grafana Toolkit, Grafana DataFrames Work closely with stakeholders to understand business requirements and translate them into effective dashboards. Ensure high availability, scalability, and security of Grafana deployments. Required Skills & Qualifications 4+ years of hands-on experience in Grafana dashboard development (building, customizing, and managing dashboards). Strong expertise in PostgreSQL/TimescaleDB, including writing complex analytical queries for time-series data. Proficiency in SQL optimization, indexing, and performance tuning for large datasets. Experience with time-series databases and data modeling. Knowledge of REST APIs, JSON, and scripting (Python/Bash) for data processing. Familiarity with Telecom or IoT data (e.g., network KPIs, sensor data, logs) is a plus. Understanding of monitoring and observability best practices. Experience with Grafana Alerting and Annotations is desirable. Creating dynamic dashboards using variables ($variable syntax). Using custom variable types (Query, Custom, Interval, Datasource, etc.). Implementing reusable dashboard templates . Good to Have Experience in Telecom, IoT, or Industrial Monitoring projects. Knowledge of Prometheus, InfluxDB, or other time-series databases. Familiarity with Docker, Kubernetes, and cloud platforms (AWS/GCP/Azure). Experience with Grafana Loki for log aggregation. Automate dashboard provisioning and management using Infrastructure as Code (IaC) tools like Terraform or Ansible (good to have). Collaborate with DevOps and backend teams to ensure seamless data pipeline integration.
Posted Date not available
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
73564 Jobs | Dublin
Wipro
27625 Jobs | Bengaluru
Accenture in India
22690 Jobs | Dublin 2
EY
20638 Jobs | London
Uplers
15021 Jobs | Ahmedabad
Bajaj Finserv
14304 Jobs |
IBM
14148 Jobs | Armonk
Accenture services Pvt Ltd
13138 Jobs |
Capgemini
12942 Jobs | Paris,France
Amazon.com
12683 Jobs |