Jobs
Interviews

Diyar United Company

5 Job openings at Diyar United Company
Application Monitoring Specialist (Dynatrace) -Remote Egypt/ India India 5 years Not disclosed Remote Full Time

Key Responsibilities: Monitoring Setup and Maintenance Design, deploy, and maintain Dynatrace OneAgent across multiple environments (cloud, on-prem, hybrid). Configure custom dashboards, synthetic monitors, custom metrics, and alerts for proactive monitoring. Establish application performance baselines and implement service-level objectives (SLOs) and indicators (SLIs). Performance Monitoring & Troubleshooting Analyze application and infrastructure performance metrics to identify bottlenecks and issues. Investigate and troubleshoot performance anomalies across web, backend, database, and infrastructure layers. Provide root cause analysis (RCA) and recommendations for performance improvement. Collaboration & Communication Work with DevOps, SREs, Developers, and QA teams to embed monitoring into CI/CD pipelines. Provide actionable insights during incident response and post-mortems. Act as a subject matter expert for Dynatrace across internal teams. Reporting and Documentation Generate regular performance and uptime reports for stakeholders. Maintain detailed documentation for monitoring architecture, policies, and incident playbooks. Required Skills & Qualifications: 5+ years of experience in application performance monitoring (APM), with at least 3 years in Dynatrace. Deep understanding of Dynatrace capabilities: Smartscape, PurePath, Synthetic Monitoring, Davis AI. Preferred Dynatrace certifications (Associate, Professional, or Advanced). Show more Show less

Power BI Developer- Offshore and Onsite Qatar Doha,Qatar 4 - 5 years Not disclosed On-site Full Time

Scope of Work: Power BI Skills: Advanced Data Modeling : Expertise in creating and optimizing complex data models. Proficiency in managing data relationships and hierarchies. Power BI Desktop Mastery : Advanced skills in using Power BI Desktop for data visualization. Ability to create interactive and dynamic reports and dashboards. Data Transformation and ETL : Strong experience in data extraction, transformation, and loading (ETL) processes. Proficiency in using M language for data shaping. Advanced Analytics and DAX : Mastery of Data Analysis Expressions (DAX) for complex calculations. Experience with advanced analytics techniques and predictive modeling. Performance Optimization : Skills in optimizing report performance. Experience in troubleshooting and resolving performance issues. The ideal resource should have at least 4-5 years of experience in the domain and be an E xcellent English speaker and communication skills. Additionally, prior experience with Microsoft Power Apps and integration between both technologies would be a valuable advantage Show more Show less

Sr. Infra Support Engineer- Remote/ Immediate Joiner Hyderabad,Telangana,India 5 years None Not disclosed Remote Full Time

Basic Scope of job As a Cloud & Server Engineer, You will be responsible for the administration, support, and optimization of both Azure cloud ,on-prem server and Kubernetes cluster environments. You will take ownership of incidents, execute infrastructure changes, and contribute to the design, implementation, and maintenance of core infrastructure services including cloud networking, storage, and backup solutions. You will also drive improvements in system performance, security, and cost optimization across both cloud and on-prem platforms. Duties & Responsibilities Cloud Manage & Support • Manage and support on-prem ,Azure server infrastructure (VMs, OS, backups, storage, networking) and Kubernetes Cluster with Rancher. • Monitor cost and implement Azure governance practices (e.g., tagging, reserved instances). • Maintain cloud security posture (e.g., PFsense, firewalls, identity/access). • Automate operational tasks using scripting tools (PowerShell, Azure CLI, Logic Apps , Ansible). • Perform patch management on Linux system and ensure security compliance across environments. • Monitor the system using tools such as Grafana, CheckMK , Huawei DigitialView. • Contribute to monthly/quarterly health reports and environment reviews. Cloud Infra & Kubernetes Cluster Administration 1. Management & Maintenance • Provision VM, Install, configure development, staging, and production environments. • Keep virtual environment up to date and healthy with routine maintenance and housekeeping activities and coordinate with vendor to solve any infrastructure-related issues. • Setting up virtual machines based on the demands of various workloads, including assigning virtual CPUs, memory, and storage • Establishing virtual networks, VLANs, and subnets to ensure that VMs and applications can communicate securely and efficiently. • Perform virtual storage resources, ensuring high availability (HA), redundancy, and optimization based on different storage tiers (SSD, HDD). • Support & manage M365. • Managing user roles, privileges, and multi-factor authentication to ensure that only authorized personnel can make changes or access critical resources • creation and configuration of Kubernetes clusters • Set up authentication, authorization and Cluster Monitoring and Logging • Monitor cluster health and performance using Prometheus or Grafana • Set up centralized logging • Support to configure alerting (Prometheus) • Cluster Upgrades and Patching • Upgrade Kubernetes versions and components • Apply security patches to Kubernetes and container runtimes • Support in scaling and Resource Management • Set resource requests and limits for containers • Manage node and pod failure handling (rescheduling) • Test disaster recovery and backups • Manage Secrets and sensitive data • Implement network policies for communication control • Support during applications Deployments • Set up Load Balancers and Services 2. Performance Tuning • Perform Regular monitoring CPU, memory, storage, and network utilization to prevent bottlenecks or resource exhaustion • Running diagnostic tools to ensure system health and to preemptively address potential issues in hardware or software 3. Backup and Recovery • Design and implement regular backup strategies based on the best practices. • Backup Job Setup: Configure backup jobs to define the schedule, retention policy, and target repository for backup data. • Backup Scheduling: Set up daily, weekly, or on-demand backups depending on the business needs. • Backup Integrity Check: Regularly verify that backups are successful and free from errors by running backup verification jobs. • SureBackup: Test backups in an isolated environment to ensure that they are recoverable and operational. • Restore Testing: Periodically restore files or entire virtual machines (VMs) to validate that the restore process works smoothly and quickly. • Replication Jobs: Configure replication of VMs to another site for disaster recovery (DR) purposes. • Failover and Failback: Test and perform failover to replicated environments in the event of a disaster and failback to the primary site once the issue is resolved. • Backup Job Setup: Configure backup jobs to define the schedule, retention policy, and target repository for backup data. • Backup Scheduling: Set up daily, weekly, or on-demand backups depending on the business needs. • Backup Integrity Check: Regularly verify that backups are successful and free from errors by running backup verification jobs • SureBackup: Test backups in an isolated environment to ensure that they are recoverable and operational. • Restore Testing: Periodically restore files or entire virtual machines (VMs) to validate that the restore process works smoothly and quickly. • Replication Jobs: Configure replication of VMs to another site for disaster recovery (DR) purposes. • Failover and Failback: Test and perform failover to replicated environments in the event of a disaster and failback to the primary site once the issue is resolved. 4. Security and Access Control • Manage user roles and privileges using least-privilege principles. • Perform security hardening and compliance. • Implementing SIEM on the system. 5. Replication and High Availability • Configuring available infrastructure native HA features for automatic failover of VMs and using replication or DR tools to ensure business continuity in case of a site failure. 6. Monitoring and Alerting • Use tools like Grafana , CheckMK for health and performance. • Monitoring logs and alerts to detect anomalies or failures in the infrastructure 7. Automation and Scripting • Automate routine tasks using Ansible. • Schedule recurring jobs with cron or orchestration tools like Airflow. 8. Documentation and Standards • Maintain detailed documentation of Cloud environments and procedures. • Documenting incidents, solutions, and changes made during the troubleshooting process for accountability and future reference Analytics & Visualization • Analyze complex datasets to uncover trends, patterns, and actionable insights. • Translate stakeholder requirements into KPIs, reports, and dashboards. • Design, build, and maintain dashboards • Manage reporting layers including deployment, version control, and performance tuning. • Collaborate with Product Owners, Engineers, and Data Scientists to align data strategy with business goals. Stakeholder Collaboration & Leadership • Act as a liaison between technical teams and business stakeholders. • Partner with internal teams to gather and refine reporting requirements. • Mentor junior analysts and support a culture of data literacy. • Standardize reporting across departments and ensure alignment with company-wide metrics. Education & Qualification • 5+ years of experience in infrastructure or cloud and Kubernetes administration roles. • Strong experience in Unix OS, and Azure IaaS components.. • Skilled in troubleshooting and resolving system and cloud performance issues. • Experience with patching, backup (Veeam) DR (Azure Backup & ASR), and automation. • Familiarity with scripting languages like Python or Bash and automation tool Ansible. • Familiarity with monitoring platforms (Grafana, CheckMK). • Knowledge of ITIL processes and change management • Relevant certifications preferred (e.g., RHCSA, CKA, AZ-900, AZ-104, AZ-500, AZ-700, VMware Certified Technical Associate (VCTA), VMware Certified Professional (VCP), , Microsoft 365 Certified Fundamentals (MS-900) Please share CV to Z.Uddin@diyarme.com

Sr. Infra Support Engineer- Remote/ Immediate Joiner Hyderabad,Telangana,India 5 - 7 years INR Not disclosed Remote Full Time

Basic Scope of job As a Cloud & Server Engineer, You will be responsible for the administration, support, and optimization of both Azure cloud ,on-prem server and Kubernetes cluster environments. You will take ownership of incidents, execute infrastructure changes, and contribute to the design, implementation, and maintenance of core infrastructure services including cloud networking, storage, and backup solutions. You will also drive improvements in system performance, security, and cost optimization across both cloud and on-prem platforms. Duties & Responsibilities Cloud Manage & Support Manage and support on-prem ,Azure server infrastructure (VMs, OS, backups, storage, networking) and Kubernetes Cluster with Rancher. Monitor cost and implement Azure governance practices (e.g., tagging, reserved instances). Maintain cloud security posture (e.g., PFsense, firewalls, identity/access). Automate operational tasks using scripting tools (PowerShell, Azure CLI, Logic Apps , Ansible). Perform patch management on Linux system and ensure security compliance across environments. Monitor the system using tools such as Grafana, CheckMK , Huawei DigitialView. Contribute to monthly/quarterly health reports and environment reviews. Cloud Infra & Kubernetes Cluster Administration 1. Management & Maintenance Provision VM, Install, configure development, staging, and production environments. Keep virtual environment up to date and healthy with routine maintenance and housekeeping activities and coordinate with vendor to solve any infrastructure-related issues. Setting up virtual machines based on the demands of various workloads, including assigning virtual CPUs, memory, and storage Establishing virtual networks, VLANs, and subnets to ensure that VMs and applications can communicate securely and efficiently. Perform virtual storage resources, ensuring high availability (HA), redundancy, and optimization based on different storage tiers (SSD, HDD). Support & manage M365. Managing user roles, privileges, and multi-factor authentication to ensure that only authorized personnel can make changes or access critical resources creation and configuration of Kubernetes clusters Set up authentication, authorization and Cluster Monitoring and Logging Monitor cluster health and performance using Prometheus or Grafana Set up centralized logging Support to configure alerting (Prometheus) Cluster Upgrades and Patching Upgrade Kubernetes versions and components Apply security patches to Kubernetes and container runtimes Support in scaling and Resource Management Set resource requests and limits for containers Manage node and pod failure handling (rescheduling) Test disaster recovery and backups Manage Secrets and sensitive data Implement network policies for communication control Support during applications Deployments Set up Load Balancers and Services 2. Performance Tuning Perform Regular monitoring CPU, memory, storage, and network utilization to prevent bottlenecks or resource exhaustion Running diagnostic tools to ensure system health and to preemptively address potential issues in hardware or software 3. Backup and Recovery Design and implement regular backup strategies based on the best practices. Backup Job Setup: Configure backup jobs to define the schedule, retention policy, and target repository for backup data. Backup Scheduling: Set up daily, weekly, or on-demand backups depending on the business needs. Backup Integrity Check: Regularly verify that backups are successful and free from errors by running backup verification jobs. SureBackup: Test backups in an isolated environment to ensure that they are recoverable and operational. Restore Testing: Periodically restore files or entire virtual machines (VMs) to validate that the restore process works smoothly and quickly. Replication Jobs: Configure replication of VMs to another site for disaster recovery (DR) purposes. Failover and Failback: Test and perform failover to replicated environments in the event of a disaster and failback to the primary site once the issue is resolved. Backup Job Setup: Configure backup jobs to define the schedule, retention policy, and target repository for backup data. Backup Scheduling: Set up daily, weekly, or on-demand backups depending on the business needs. Backup Integrity Check: Regularly verify that backups are successful and free from errors by running backup verification jobs SureBackup: Test backups in an isolated environment to ensure that they are recoverable and operational. Restore Testing: Periodically restore files or entire virtual machines (VMs) to validate that the restore process works smoothly and quickly. Replication Jobs: Configure replication of VMs to another site for disaster recovery (DR) purposes. Failover and Failback: Test and perform failover to replicated environments in the event of a disaster and failback to the primary site once the issue is resolved. 4. Security and Access Control Manage user roles and privileges using least-privilege principles. Perform security hardening and compliance. Implementing SIEM on the system. 5. Replication and High Availability Configuring available infrastructure native HA features for automatic failover of VMs and using replication or DR tools to ensure business continuity in case of a site failure. 6. Monitoring and Alerting Use tools like Grafana , CheckMK for health and performance. Monitoring logs and alerts to detect anomalies or failures in the infrastructure 7. Automation and Scripting Automate routine tasks using Ansible. Schedule recurring jobs with cron or orchestration tools like Airflow. 8. Documentation and Standards Maintain detailed documentation of Cloud environments and procedures. Documenting incidents, solutions, and changes made during the troubleshooting process for accountability and future reference Analytics & Visualization Analyze complex datasets to uncover trends, patterns, and actionable insights. Translate stakeholder requirements into KPIs, reports, and dashboards. Design, build, and maintain dashboards Manage reporting layers including deployment, version control, and performance tuning. Collaborate with Product Owners, Engineers, and Data Scientists to align data strategy with business goals. Stakeholder Collaboration & Leadership Act as a liaison between technical teams and business stakeholders. Partner with internal teams to gather and refine reporting requirements. Mentor junior analysts and support a culture of data literacy. Standardize reporting across departments and ensure alignment with company-wide metrics. Education & Qualification 5+ years of experience in infrastructure or cloud and Kubernetes administration roles. Strong experience in Unix OS, and Azure IaaS components.. Skilled in troubleshooting and resolving system and cloud performance issues. Experience with patching, backup (Veeam) DR (Azure Backup & ASR), and automation. Familiarity with scripting languages like Python or Bash and automation tool Ansible. Familiarity with monitoring platforms (Grafana, CheckMK). Knowledge of ITIL processes and change management Relevant certifications preferred (e.g., RHCSA, CKA, AZ-900, AZ-104, AZ-500, AZ-700, VMware Certified Technical Associate (VCTA), VMware Certified Professional (VCP), , Microsoft 365 Certified Fundamentals (MS-900) Please share CV to [HIDDEN TEXT] Show more Show less

Senior Provisioning Support Engineer- Remote hyderabad,telangana,india 5 years None Not disclosed Remote Full Time

Basic Scope of Job: Assist to maintain Instalink Provisioning systems (Nokia), Mobile Number Portability Platform and Provisioning Tools to ensure the stable running operation and support configuration request to support Telecom operator business requirement. Principal Duties and Responsibilities: Manage and support the Nokia Instalink Provisioning System for seamless service provisioning (voice, data, broadband). Monitor provisioning workflows, logs, and service activations to identify and resolve issues promptly. Troubleshoot provisioning failures and escalate critical issues to vendors when required. Maintain integrations between Nokia Instalink and OSS/BSS platforms, ensuring smooth communication with upstream and downstream systems (e.g., CRM, billing systems, HSS, network elements). Test, validate, and implement new features, patches, and configurations. Investigate system alerts, errors, and failures to ensure seamless provisioning operations. Provide 24x7 support to internal customers for system and application-related issues. Perform system maintenance, backups, and upgrades to ensure platform availability. Configure, schedule, and monitor provisioning processes to meet SLAs/OLAs. Implement vendor-provided configuration changes and run acceptance tests for new solutions and services. Automate tasks and extract information by developing and implementing scripts. Perform regular reconciliations to ensure data integrity across provisioning systems. Analyze system performance, run traces, troubleshoot reported incidents, and resolve issues. Prepare and analyze reports, statistics, and trends for provisioning platforms. Support commercial teams with new requests, ideas, and marketing requirements. Facilitate project implementation across other divisions and departments. Continuously improve operational processes by studying system performance and recommending redesigns. Ensure compliance with company Information Security Policies , maintaining confidentiality, integrity, and security of all information assets. Meet quality, performance, and KPI targets while providing supporting documentation. Education & Qualification: First degree or equivalent in a relevant discipline Minimum of 2–5 years of experience in telecom provisioning systems support. Experience working with Nokia solutions (e.g., Instalink, NetAct). Strong background in troubleshooting provisioning failures and workflow disruptions. Experience: Minimum of six (6) years relevant work experience. Hands-on experience with Nokia Instalink Provisioning Systems. Strong understanding of telecom provisioning processes and protocols. Proficiency in SQL for query building and troubleshooting. Knowledge of telecom technologies (2G/3G/4G/5G, IP networks, broadband systems). Experience with system integration and APIs. Proficiency in automation and scripting (e.g., Python, Shell). Experience with monitoring and ticketing tools (e.g., Nagios, ServiceNow, Jira). Familiarity with GSM networking principles and protocols. Strong knowledge of UNIX/Linux administration and shell scripting. Good understanding of databases (e.g., Oracle, SQL). Familiarity with technologies like SOAP, Java, and CORBA. Strong troubleshooting and problem-solving skills Proficiency in Arabic, French, and English Interested candidates please share CV: Z.Uddin@diyarme.com