Analyzing the GitHub outage

added by DotNetKicks
11/12/2018 2:30:00 PM

1 Kicks, 2291 Views

The next part that I find really interesting is that the system that GitHub uses for managing topologies is not consistent but is required to be. The problem is that there is an inherent delay between their orchestrator re-organizing the cluster after a failure and when the failure actually occurs.