Associate Architect-SRE-3

0 years

0 Lacs

Posted:1 day ago| Platform: SimplyHired logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Description

At CDW, we make it happen, together. Trust, connection, and commitment are at the heart of how we work together to deliver for our customers. It’s why we’re coworkers, not just employees. Coworkers who genuinely believe in supporting our customers and one another. We collectively forge our path forward with a level of commitment that speaks to who we are and where we’re headed. We’re proud to share our story and Make Amazing Happen at CDW.

Site Reliability Engineer (SRE) – Managed Services

A culture of diverse perspectives. Coworkers who collaborate to go above and beyond. Motivated individuals who lead by example. This is what CDW is about. Our legacy of innovative thinking and vision for customer-centric technology position us for continued success in our industry—and for you in your career.

As part of our Managed Services organization, you’ll play a critical role in ensuring the reliability, scalability, and performance of our systems, bridging the gap between software engineering and infrastructure operations.


Tap your technical reliability skills in this technical role at CDW.

You will be responsible for maintaining the operational excellence and reliability of Managed Services software and infrastructure—on-premises and in the cloud. You’ll drive improvements through automation, monitoring, and deep technical troubleshooting, while mentoring less experienced staff and providing escalation paths for critical issues. You’ll enforce best practices in system architecture, reliability, and service performance.


What you will do:

Work with a variety of tools and technologies to ensure the reliability and performance of the Managed Services organization. Drive initiatives to optimize and secure infrastructure operations while contributing to scalability and continuous improvement efforts.

  • Maintain and ensure operational excellence of systems and applications supporting Managed Services infrastructure.
  • Troubleshoot and resolve issues in a fast-paced, distributed environment, focusing on root cause analysis and post-incident reviews.
  • Build and maintain observability frameworks using tools like Prometheus, OpenTelemetry, and Dynatrace to ensure reliable monitoring of systems and applications.
  • Define and track Service Level Objectives (SLOs), Service Level Indicators (SLIs), and manage Error Budgets to balance reliability and innovation.
  • Use automation tools like Ansible and scripting languages like Python to reduce operational toil and optimize processes.
  • Manage and troubleshoot Kubernetes clusters and containerized environments, ensuring smooth service-to-service communication.
  • Diagnose and resolve networking issues across OSI layers 1-3 on systems using including packet capture analysis.
  • Oversee PKI certificate management and utilize tools like HashiCorp Vault for secrets management.
  • Collaborate with cross-functional teams to identify system improvements and resolve critical issues.
  • Provide mentoring and guidance to junior engineers, ensuring knowledge sharing and professional growth.


Required Qualifications and Skills:

Core Technical Skills:

  • Proficiency in Linux administration (system tuning, SSH, log analysis with tools like grep and regex).
  • Strong understanding of networking protocols and troubleshooting (Layer 1-3).
  • Hands-on experience with Kubernetes and container orchestration.
  • Automation experience with Ansible and scripting proficiency in Python.
  • Knowledge of PKI certificate management and HashiCorp Vault or similar tools for secrets management.
  • Expertise in monitoring and observability tools like Prometheus, Grafana, or Dynatrace.

Reliability Engineering Skills:

  • Experience defining and managing SLIs, SLOs, and Error Budgets.
  • Proven ability to instrument and analyze system performance metrics.
  • Deep familiarity with troubleshooting distributed systems and microservices architecture.

Soft Skills:

  • Strong initiative and curiosity for deep-diving into complex issues.
  • Excellent communication skills to convey solutions and collaborate across teams.
  • Ability to prioritize and resolve competing priorities in high-pressure situations.


Preferred Qualifications:

  • Experience with observability frameworks like OpenTelemetry.
  • Familiarity with ITIL frameworks and best practices in incident and problem management.
  • Background in enterprise environments, navigating corporate processes and bureaucracy.
  • Security experience in areas like RBAC, least privilege principles, and secure software development lifecycle processes.
  • Certifications in Kubernetes or relevant technologies.


What you can expect from us:

Diverse, award-winning culture and work/life benefits.

  • An inclusive culture: Empowering you to bring your true self and your best ideas to the table.
  • Learning and growth opportunities: Career development resources, skills-development training, and robust advancement opportunities.
  • Comprehensive benefits, including health, dental, and vision coverage, paid vacation, coworker stock purchase programs, tuition reimbursement, and coworker discounts.


Who we are:

We make technology work so people can do great things.

CDW is a Fortune 500 technology solutions provider to business, government, education, and healthcare organizations in the United States, Canada, and the United Kingdom. We help customers navigate and succeed in an ever-changing world by providing them with the technology advice and solutions they need—when, where, and how they need them.

CDW is an equal-opportunity employer committed to a diverse and inclusive workplace. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

We make technology work so people can do great things.

CDW is a leading multi-brand provider of information technology solutions to business, government, education and healthcare customers in the United States, the United Kingdom and Canada. A Fortune 500 company and member of the S&P 500 Index, CDW helps its customers to navigate an increasingly complex IT market and maximize return on their technology investments. Together, we unite. Together, we win. Together, we thrive.

CDW is an equal opportunity employer. All qualified applicants will receive consideration for employment without regards to race, color, religion, sex, sexual orientation, gender identity, national origin, disability status, protected veteran status or any other basis prohibited by state and local law.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You