It happens pretty rarely, but sometimes our platform is affected too by operational faults and malfunctions. Of course we try to keep these hiccups to an absolute minimum. Our content is stored and supplied from two different data processing centers for instance, which means that our users can be re-routed from one to the other dependently on demand and performance. All systems have a failover cluster operation guaranteeing high performance and stability, as well as having sufficient reserves to cope better with peak loads. Making sure our platform is both available and fast is our number one priority.
Sometimes though, like last Wednesday’s morning for instance, we do encounter unforeseen problems. There isn’t an IT system anywhere that is 100% fault-tolerant – there will always be systematic single points of failure that will send part or all of the entire system crashing with them. In this latest case a defect network component corrupted a mechanism, which was specifically required to safeguard against failure (Spanning Tree Protocol). We will be analyzing and intercepting this error to make sure it doesn’t crop up again.
A look at the figures for the past twelve months shows that we are very close to our high targets, an average uptime ratio of 99.95%. In other words, this means that XING should not be offline for more than four hours and 2 minutes over the entire year. Or to put it yet another way: This is equivalent to around 44 seconds a day. Crashes affecting just certain parts of the platform have also been included in this statistic.
Here are the availability figures for the past twelve months:
| Month | Availability |
| Nov 08 | 99,88% |
| Dez 08 | 99,75% |
| Jan 09 | 99,60% |
| Feb 09 | 99,97% |
| Mrz 09 | 99,80% |
| Apr 09 | 99,93% |
| Mai 09 | 99,87% |
| Jun 09 | 99,96% |
| Jul 09 | 99,95% |
| Aug 09 | 99,83% |
| Sep 09 | 99,76% |
| Oct 09 | 99,91% |
| Average | 99,85% |
To form a meaningful graph from these figures the vertical axis needs to be scaled up dramatically in order to see any fluctuations at all:
Despite this record, every minute of downtime is a blow to us – as we know how important it is that our members can send messages, make contacts and network at any time. That’s why we’ll keep doing everything in our power to maximize stability and availability of our platform.
Link to this article:
http://blog.xing.com/2009/11/xing-uptimes-downtimes-%e2%80%93-and-what-we-do/trackback/






XING´s official twitter account
Leave a comment
Hi Johannes,
Your blog reminded me about a feedback from one of my client. He said, “Anup, IT is amazing when it works but big pain in the rear when it doesn’t”. Ups and downs are part of any aspect. But if you feel that you use some help, do drop me a mail and i”ll try getting our team to help you with it. Keep up the great job. Cheers!