Lead System Engineer - Unix/Linux, VMware & Ansible

8 years

0 Lacs

Posted:17 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Overview

  • Blue Yonder is the proven leader in artificial intelligence and machine learning (AI/ML)-driven supply chain and retail solutions for 4,000 of the world’s leading retail, manufacturing, and logistics companies. Blue Yonder’s world-class client brands include 75 of the top 100 retailers, 77 of the top 100 consumer goods companies, and 8 of the top 10 global 3PLs. Running Blue Yonder, you can plan to deliver.
  • The Candidate should have prior background working in IT Infrastructure and should have a solid high-level understanding of the underlying IT principles (Systems, Storage, backup, IaC, SaaS, Virtualization, Kubernetes, Containers, CI/CD). Lead System Engineer act as an escalation point for critical issues, ensure systems are secure and compliant through proactive patching, and collaborate across teams to maintain reliable and resilient IT services.

Scope

  • Manage, configure, Optimize and administer Unix/Linux systems (RHEL, CentOS, Ubuntu, AIX, or Solaris) or Windows servers and VMware virtualization environments (vSphere, ESXi, vCenter)
  • Maintain OS patching, upgrades, and compliance across environments.
  • Develop and maintain automation frameworks for system provisioning, configuration, and operations using tools such as Ansible, Terraform, or scripting (Python, Shell, PowerShell).
  • Implement self-service and automated workflows for routine operational tasks.
  • Drive continuous improvement by identifying opportunities to reduce manual work and enhance system efficiency.
  • Prioritize workload and resolve any technical issues/roadblocks
  • Solid skills in logical troubleshooting, communication, documentation and problem resolution
  • Create and update application run books & appropriate technical documentation
  • Ensure all release processes, policies and procedures are properly communicated and documented
  • Administer and optimize enterprise storage solutions, with a focus on NetApp ONTAP storage systems. This includes managing LUNs, volumes, SAN/NAS protocols, and performance tuning
  • Lead the strategy and operations for data protection using Commvault or similar backup solutions. This includes managing backup policies, performing data restores, and conducting regular disaster recovery testing.
  • Own the end-to-end patch management process for servers, virtualization, and storage.
  • Coordinate and execute patching schedules while minimizing downtime.
  • Conduct root cause analysis (RCA) and implement preventive measures.
  • Ensure effective monitoring, alerting, and incident response for critical infrastructure.
  • Participate in on-call support rotation for high-priority issues.
  • Actively engage in CI/CD, Agile and DevOps process, participate regularly in planning and releases
  • Assist in establishing and enforcing standards that will improve the ease of automating the build process and the development environments
  • Manage and maintain enterprise infrastructure tools as the primary subject matter expert
  • Automate, deploy and manage virtualization infrastructure
  • Provide support, and implementation of security policies, compliance, governance and best practices

Our Current Technical Environment

  • Operating System: Windows & Linux
  • Hyper converged Environment: VMWare
  • Programming languages: Python, PowerShell, and Shell scripting
  • Cloud Architecture: MS Azure (Terraform, ARM templates, AKS, Virtual Networks, Azure AD)
  • Configuration management tools: Ansible and Terraform
  • DevOps Tools: GIT, GitLab/GitHub and Docker
  • Storage: NetApp

What You’ll Do

  • Manage, configure, Optimize and administer Unix/Linux systems (RHEL, CentOS, Ubuntu, AIX, or Solaris) or Windows servers and VMware virtualization environments (vSphere, ESXi, vCenter)
  • Maintain OS patching, upgrades, and compliance across environments.
  • Develop and maintain automation frameworks for system provisioning, configuration, and operations using tools such as Ansible, Terraform, or scripting (Python, Shell, PowerShell).
  • Implement self-service and automated workflows for routine operational tasks.
  • Drive continuous improvement by identifying opportunities to reduce manual work and enhance system efficiency.
  • Prioritize workload and resolve any technical issues/roadblocks
  • Solid skills in logical troubleshooting, communication, documentation and problem resolution
  • Create and update application run books & appropriate technical documentation
  • Ensure all release processes, policies and procedures are properly communicated and documented
  • Administer and optimize enterprise storage solutions, with a focus on NetApp ONTAP storage systems. This includes managing LUNs, volumes, SAN/NAS protocols, and performance tuning
  • Lead the strategy and operations for data protection using Commvault or similar backup solutions. This includes managing backup policies, performing data restores, and conducting regular disaster recovery testing.
  • Own the end-to-end patch management process for servers, virtualization, and storage.
  • Coordinate and execute patching schedules while minimizing downtime.
  • Conduct root cause analysis (RCA) and implement preventive measures.
  • Ensure effective monitoring, alerting, and incident response for critical infrastructure.
  • Participate in on-call support rotation for high-priority issues.
  • Actively engage in CI/CD, Agile and DevOps process, participate regularly in planning and releases
  • Assist in establishing and enforcing standards that will improve the ease of automating the build process and the development environments
  • Manage and maintain enterprise infrastructure tools as the primary subject matter expert
  • Automate, deploy and manage virtualization infrastructure

What We Are Looking For

  • Bachelor’s degree in computer science, MIS or engineering related field or equivalent work experience
  • 8+ years of combined related work experience
  • 6+ years of experience in Unix/Linux system engineering (RHEL, CentOS, Ubuntu, AIX, or Solaris) or Windows.
  • 5+ years of experience with VMware technologies (vSphere, ESXi, vCenter).
  • 3+ years working experience with Ansible configuration and orchestration
  • Strong scripting and automation skills (Python, Bash, Shell, PowerShell, Ansible or Terraform).
  • Solid knowledge of storage, networking, backup, and security concepts.
  • Experience managing hybrid environments (on-premises + cloud, preferably Azure).
  • Experience with container platforms (Docker, Kubernetes).
  • Experience in Cloud Technologies – Private, Public, Hybrid, IaaS+, PaaS, SaaS
  • Experience working with CI/CD tools and Git
  • Intermediate knowledge of Networking (VLAN, sub netting, routing and switching)
  • Ability to interact with various levels of professionals
  • Ability to work under pressure in a fast-paced environment and meet tight deadlines
  • Ability to act independently to drive IT goals and changes
  • Identify and escalate situations requiring urgent attention
  • Proficiency in operating system and software
  • Willing to work under different technologies and take up new technology responsibilities outside the core skills
  • Demonstrable experience with Continuous Integration/Delivery principles (ci/cd) and implementation
  • Strong Scripting experience like Python, Bash Shell, PowerShell, etc.
  • Solid understanding of Restful APIs
  • Advanced troubleshooting methodology
  • Ability to judge priorities and adjust their work accordingly
  • Experience with the implementation and use of different Application (APM) and/or Infrastructure monitoring tools.
  • Being able to work cross platform, with Windows and Linux. This helps understand hybrid platform environment and thus helps design considerations. Certification is preferable (RHCE or likewise).
  • Knowledge of ITIL processes (incident, problem, change management).
  • Knowledge of protocols: HTTP, SSL, SSH, WINRM, JMS, JDBC, REST API (ServiceNow and AWX/Tower), etc.
  • Familiarity with observability and analysis solutions such as Elastic and Datadog.
  • Experience in automation of key functions, including back-up, continuous integration, provisioning is a huge plus
  • Fluent English and high oral and written communication

Our Values

If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success – and the success of our customers. Does your heart beat like ours? Find out here: Core ValuesAll qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Blue Yonder logo
Blue Yonder

Supply Chain Management/Technology

Scottsdale

RecommendedJobs for You