Position Summary... You are right for the job if you are comfortable with System design, Architecture, deep technical Linux, networking topics, and distributed architectures. You will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You will excel if you have enthusiasm for digging deep, and a flare for sharp technical communication, prioritization, and organization. You will work directly with our Software Engineering teams to build our next generation always up cloud-based e-commerce/Retail and Enterprise platform.
What youll do... About the team
Site Reliability and Engineering group focuses on producing mission-critical infrastructure, tools, and processes that will ensure highest levels of availability and reliability of all our websites. SRE s drives standardization and service focused instrumentation, provides subject matter expertise, resolves break/fix scenarios, engaging broader teams as necessary; and partners/leads to achieve continuous improvement. In addition SRE s contributes to command-and-control related activities focused on restoration of complex outages, and rapid restoration.
What You ll Do
Programming/Tooling and Automation experience in one or more of the following languages: Golang, Java, Python, Typescript, Node and Shell . Good understanding of Kafka internals , SQL/noSQL databases like Cassandra , Elasticsearch and Postgress and In-Memory Caching frameworks like Memcached . Influence, design and create new architectures, standards, and methods for large-scale enterprise systems. Design, write and build tools to improve the reliability, latency, availability and scalability of Walmart e-commerce/Retail and Enterprise products.
- Engender reliability and availability starting with metrics and measurements.
- Enable scaling by providing tools, developing training and/or augmenting processes.
- Build tools/automate to prevent re-occurrence of problem to mission critical products/services.
Participate in capacity planning, demand forecasting, software performance analysis and system tuning. Engage with enterprise and business/infrastructure functions to establish, track, and optimize operational metrics and targets in line with SRE principles (SLO/SLI, Latency percentiles , error budgets, tech debt and setup alert guidelines )
What You ll bring
Bachelors Degree or Master s Degree with 12+ years of experience in Computer Science or related field. Proficiency in any of the programming languages like Java, GoLang, etc Experience in designing, investigating, analysing, and troubleshooting large-scale enterprise systems. Methodical and systematic problem-solving approach, combined with a solid awareness of ownership, initiative, and drive. Fluency with running services at scale; In depth understanding of Unix systems internals and networking. Experience with IaaS and PaaS providers such as AWS, AZURE OpenStack, GCP Experience with containerisation and container platforms. (e.g., Docker, Kubernetes, Docker EE, OpenShift, Mesosphere). Experience with enterprise monitoring solutions like AppDynamics, New Relic, Prometheus, Graphite, Grafana, Nagios, Sensu and Splunk
.
.
.
.
.
.
Equal Opportunity Employer
Walmart, Inc., is an Equal Opportunities Employer - By Choice. We believe we are best equipped to help our associates, customers and the communities we serve live better when we really know them. That means understanding, respecting and valuing unique styles, experiences, identities, ideas and opinions - while being inclusive of all people.
Minimum Qualifications...
Minimum Qualifications:Option 1: Bachelors degree in computer science, computer engineering, computer information systems, software engineering, or related area and 4 years experience in software engineering or related area.Option 2: 6 years experience in software engineering or related area.
Preferred Qualifications...
Master s degree in Computer Science, Computer Engineering, Computer Information Systems, Software Engineering, or related area and 2 years experience in software engineering or related area
Primary Location...