We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Principal Software Engineer - AI Driven Configuration & Experimentation Platform

Microsoft
United States, Washington, Redmond
Oct 28, 2025
OverviewECS (Experiments and Configuration Service) is the backbone of Microsoft's experimentation and configuration ecosystem, powering safe rollouts and controlled experimentation across M365, Copilot, and Azure. We are expanding beyond core experimentation into next-generation platforms for change inventory intelligence and AI-powered RCA agents, aiming to redefine how engineers troubleshoot, learn from incidents, and continuously improve service reliability.As a Principal Software Engineer in AI Driven Configuration & Experimentation Platform, you will lead the design and evolution of large-scale distributed systems that empower thousands of developers across Microsoft. You'll collaborate with partner teams, influence long-term strategy, and shape the architecture for high-reliability experimentation, change management, and AI-driven operational quality. This opportunity will allow you to: Drive company-wide impact by defining technical strategy and standards for experimentation, change inventory, and incident analysis; Partner with leaders across engineering and product to solve systemic challenges in safe rollouts, telemetry, and the automation of root cause analysis (RCA); Mentor engineers and raise the bar for engineering quality, design rigor, and AI-augmented developer experience.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesPartners with appropriate stakeholders to determine user requirements for a set of scenarios.Leads identification of dependencies and the development of design documents for a product, application, service, or platform.Leverages subject-matter knowledge of cross-product features with appropriate stakeholders (e.g., project managers) to drive multiple group's project plans, release plans, and work items.Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers.Lead technical strategy and architecture for ECS, shaping the future of experimentation, configuration, and change intelligence platforms used across M365, Copilot, and Azure.
Applied = 0

(web-675dddd98f-24cnf)