Position Summary...We are looking for an experienced Senior Software Engineer (AI/ML) with a strong foundation in software engineering, distributed systems, and cloud-native technologies. The ideal candidate will design and implement AI/ML-driven solutions that power intelligent observability, real-time monitoring, and autonomous remediation across large-scale systems.What you'll do...
About The Team
We are building a 
unified observability platform
 that delivers 
360 visibility
 across distributed systems with minimal instrumentation overhead. The platform seamlessly integrates with existing environments and workflows, leveraging 
AI-driven insights
 to 
detect, predict, and resolve issues
 in real time.Our goal is to enable 
self-healing systems
 through 
AI agents
 that autonomously diagnose and trigger remediation actions with minimal human intervention.
What You'll Do
Core Development
- Design, develop, and deploy scalable AI/ML models for anomaly detection, forecasting, and root-cause analysis. 
- Build and optimize real-time inference APIs and services integrating ML pipelines into production. 
- Develop data pipelines for large-scale telemetry, logs, metrics, and traces using event-driven architectures. 
- Automate model training, evaluation, and deployment pipelines (MLOps). 
- Continuously monitor model performance and optimize for accuracy, latency, and cost. 
- Work closely with platform and SRE teams to build AI-powered automation and observability workflows. 
 
Software & Systems Engineering
- Build high-performance backend systems using Golang and modern design patterns. 
- Architect distributed and fault-tolerant systems with strong fundamentals in concurrency, scalability, and resilience. 
- Design multi-cloud applications using Kubernetes, Docker, and infrastructure-as-code tools. 
- Implement service discovery, load balancing, and failure recovery mechanisms. 
- Contribute to CI/CD, observability, and automation frameworks for production systems. 
 
Data & Messaging
- Design data flows using Kafka, Pub/Sub, or similar event streaming platforms. 
- Work with SQL (PostgreSQL/MySQL) and NoSQL (MongoDB, Cassandra, ClickHouse) databases for structured and unstructured data. 
- Implement efficient data serialization, compression, and query optimization for large-scale data. 
 
Collaboration & Technical Leadership
- Collaborate with SRE, DevOps, and Product teams to integrate AI/ML features into observability workflows. 
- Write clear design documents, architecture diagrams, and technical proposals. 
- Contribute to long-term technical strategy and roadmap decisions. 
- Mentor junior engineers on best practices in backend, ML systems, and distributed computing. 
 
What You'll bring
(3 to 5 brief pointers about the qualifications, exposures and experiences required for the role)
Experience & Education
- 510 years of software engineering experience, including 24 years in AI/ML engineering. 
- Proven experience deploying ML models end-to-end (data ingestion ? training ? inference ? monitoring). 
- Strong coding skills in Golang (or Python with willingness to learn Go). 
- Bachelor's or Master's degree in Computer Science, Engineering, or related field. 
- Strong understanding of algorithms, data structures, and system design. 
 
AI/ML Expertise (Expert Level)
- Experience with ML frameworks such as TensorFlow, PyTorch, or Scikit-learn. 
- Hands-on experience with time-series modeling, anomaly detection, or forecasting. 
- Exposure to LLMs, RAG pipelines, or agentic workflows for automation. 
- Familiarity with MLOps tools like Kubeflow, MLflow, Vertex AI, or SageMaker. 
 
Data & Messaging Systems
- Proficiency with Kafka, Pub/Sub, or similar distributed messaging systems. 
- Hands-on with SQL/NoSQL databases and schema design for performance at scale. 
 
Software Engineering Best Practices
- Expertise in designing RESTful or gRPC APIs and scalable microservices. 
- Strong focus on testing, CI/CD pipelines, and production readiness. 
- Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry). 
 
Nice To Have Skills
- Experience in real-time observability, AIOps, or incident management platforms. 
- Knowledge of distributed consensus (Raft, Paxos) and event sourcing. 
- Contributions to open-source ML, observability, or infrastructure projects. 
- Familiarity with LLM orchestration frameworks (LangChain, Haystack, Semantic Kernel). 
 
About Walmart Global Tech
Imagine working in an environment where one line of code can make life easier for hundreds of millions of people. That's what we do at Walmart Global Tech. We're a team of software engineers, data scientists, cybersecurity expert's and service professionals within the world's leading retailer who make an epic impact and are at the forefront of the next retail disruption. People are why we innovate, and people power our innovations. We are people-led and tech-empowered.We train our team in the skillsets of the future and bring in experts like you to help us grow. We have roles for those chasing their first opportunity as well as those looking for the opportunity that will define their career. Here, you can kickstart a great career in tech, gain new skills and experience for virtually every industry, or leverage your expertise to innovate at scale, impact millions and reimagine the future of retail.
Flexible, hybrid work
We use a hybrid way of working with primary in office presence coupled with an optimal mix of virtual presence. We use our campuses to collaborate and be together in person, as business needs require and for development and networking opportunities. This approach helps us make quicker decisions, remove location barriers across our global team, be more flexible in our personal lives.
Benefits
Beyond our great compensation package, you can receive incentive awards for your performance. Other great perks include a host of best-in-class benefits maternity and parental leave, PTO, health benefits, and much more.
Belonging
We aim to create a culture where every associate feels valued for who they are, rooted in respect for the individual. Our goal is to foster a sense of belonging, to create opportunities for all our associates, customers and suppliers, and to be a Walmart for everyone.At Walmart, our vision is everyone included. By fostering a workplace culture where everyone isand feelsincluded, everyone wins. Our associates and customers reflect the makeup of all 19 countries where we operate. By making Walmart a welcoming place where all people feel like they belong, we're able to engage associates, strengthen our business, improve our ability to serve customers, and support the communities where we operate.
Equal Opportunity Employer
Walmart, Inc., is an Equal Opportunities Employer  By Choice. We believe we are best equipped to help our associates, customers and the communities we serve live better when we really know them. That means understanding, respecting and valuing unique styles, experiences, identities, ideas and opinions  while being inclusive of all people.
Minimum Qualifications...
Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications. 
Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 3 years experience in software engineering or related area.Option 2: 5 years experience in software engineering or related area.
Preferred Qualifications...
Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications. 
Master's degree in computer science, information technology, engineering, information systems, cybersecurity, or related area and 1 year's experience leading information security or cybersecurity projects, We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly. The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart's accessibility standards and guidelines for supporting an inclusive culture.Information Technology - CISCO Certification - Certification
Primary Location...
G, 1, 3, 4, 5 Floor, Building 11, Sez, Cessna Business Park, Kadubeesanahalli Village, Varthur Hobli , India R-2322658