Azure Core builds the foundational platform for Microsoft Azure across compute, storage, networking, management, and resilience. The teams focus on delivering platform quality, availability, capacity, efficiency, and customer-facing control planes that are highly reliable and scalable.The Azure Core Business Continuity and Disaster Recovery (BCDR) team provides end-to-end protection and recovery for infrastructure as a service (IaaS), databases, and cloud-native services. Our investments and roadmaps include backup, site recovery, cross-region and zonal resiliency, ransomware recovery, and large-scale management. The team partners across Azure Core and collaborates with field and customer teams to raise the resiliency baseline and deliver unified management experiences.This role offers the opportunity to work on critical technologies that ensure business continuity and security for customers worldwide, while contributing to innovation in cloud resiliency and recovery solutions.Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
- Own a customer problem area in BCDR (e.g., zonal/region recovery workflows, ransomware safe backups, posture & drills) and deliver clear PRDs, OKRs, and roadmaps aligned to Azure Core quality and resiliency goals.
- Translate customer/regulatory needs (RTO/RPO, test failover, evidence) into platform features and guidance, partnering across product groups and other stakeholders like Finance, Field, Business Planning, Support.
- Drive end to end scenario design for backup, restore, replication and recovery—including cross zone/region patterns—and validate with telemetry, experiments, and customer previews.
- Partner closely with engineering to plan, deliver and safely roll out features using Azure Core safe deployment practices; ensure SLI/SLO definitions and instrumentation support customer outcomes.
- Own livesite quality for your scenarios, and post incident learning to improve patterns and product guardrails.
- Land the product with customers and field: content, demos, and best practice guidance aligned to the Well Architected reliability pillar and BCDR essentials.
- Collaborate cross Azure (Compute, Storage, Networking, Security/Defender, Data) to unblock dependencies and deliver cohesive experiences at scale. Contribute to PM craft: crisp writing, storyboards, data informed decisions, and inclusive collaboration.
Qualifications
Required Qualifications:
- Bachelor's Degree AND 2+ years' experience in product/service/project/program management or software development
- OR equivalent experience.
- 1+ year(s) experience with shipping cloud platform or enterprise features end to end (backlog→PRD→delivery) with engineering/design and measurable customer impact.
- 1+ year(s) experience of BCDR or reliability concepts (e.g., RTO/RPO, test failover, cross zone/region patterns) and ability to translate them into customer experiences and docs.
- 1+ year(s) experience with data driven decision making (telemetry/experiments, field/customer feedback) and strong written communication.
Other Qualifications
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications
- Bachelor's Degree AND 4+ years of experience in product/service/project/program management
- OR software development
- OR equivalent experience.
- 1+ year(s) experience with Azure Backup, Azure Site Recovery, or adjacent resiliency experiences and security & ransomware recovery concepts (immutability, clean room recovery, threat informed restores).
- 1+ year(s) experience with IaaS/AKS/Database scenarios or platform services that require high availability and disaster recovery.
- 1+ year(s) experience collaborating across Compute/Storage/Networking/Security platform teams and with field/customer stakeholders.
#azurecorejobs
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.