Something Wrong with Facebook 2019
By
Ega Wahyudi
—
Wednesday, January 1, 2020
—
What's Wrong With Facebook
Something Wrong With Facebook
The crucial defect that created this interruption to be so serious was a regrettable handling of a mistake condition. An automated system for confirming setup values wound up causing much more damage than it fixed.
The intent of the automated system is to check for setup values that are invalid in the cache and replace them with updated values from the consistent shop. This works well for a short-term trouble with the cache, yet it does not function when the consistent store is void.
Today we made a modification to the relentless copy of an arrangement worth that was taken invalid. This implied that every single client saw the invalid worth and also tried to repair it. Because the repair includes making a query to a collection of data sources, that collection was quickly bewildered by thousands of countless inquiries a second.
To make matters worse, each time a client got a mistake attempting to inquire one of the databases it translated it as an invalid worth, as well as erased the corresponding cache trick. This implied that even after the original issue had actually been taken care of, the stream of questions continued. As long as the data sources failed to service a few of the demands, they were creating even more requests to themselves. We had gotten in a feedback loophole that didn't permit the data sources to recuperate.
The means to stop the feedback cycle was quite uncomfortable - we needed to stop all web traffic to this data source collection, which suggested shutting off the website. As soon as the databases had recouped and the source had actually been dealt with, we gradually allowed more people back onto the site.
This obtained the site back up and also running today, as well as for now we've shut off the system that tries to correct setup values. We're checking out new designs for this arrangement system complying with style patterns of various other systems at Facebook that deal even more beautifully with feedback loopholes and also short-term spikes.
We ask forgiveness again for the website interruption, and also we want you to understand that we take the performance and integrity of Facebook very seriously.