Advancing failure prediction and mitigation-introducing Narya

added by DotNetKicks
3/11/2021 4:09:39 PM

566 Views

"This post continues our Advancing Reliability series highlighting initiatives underway to constantly improve the reliability of the Azure platform. In 2018 we shared steps we're taking to improve virtual machine (VM) resiliency using live migration. In 2019 we shared how we're further improving virtual machine resiliency with Project Tardigrade, which identifies host failures and recovers from them through memory-preserving soft kernel reboots.


0 comments