What happened?
One of external services, our system is using, was experiencing issues. This affected our web servers which were overloaded and finally crashed as the result of it.
What we did?
When our developers identified the issue, they fixed it by restarting failed services and servers for which it took some time to get back to the normal operating state.
Impact
During the estimated 2h time frame (12.30 to 14.30 CEST) of the incident no bookings have been able to be processed as well as our customers had issues with accessing our web and mobile application.
Learnings
In order to prevent similar incidents in the future we added additional monitoring system to the before mentioned external service as well as servers that the service is used on. With this in place, we will have a better overview in case similar issue happens again and this will allow us to act faster and minimize the impact this has on our customers.
We apologize for any inconvenience this might have caused you.