Incident Date: December 2nd 2024
Incident Duration: Approximately 5 hours (intermittently)
Affected Services: TrekkSoft Backoffice, POS Desk, TrekkSoft Mobile App (mPOS)
Incident Description:
On Monday, Dec 2nd, at 12:13 PM CET, system slowness was reported, leading to an inability to process bookings. Earlier that morning, an alert indicated web response times exceeding 1.0 seconds for over 10 minutes.
Impact:
Investigation revealed it was caused by resource-heavy queries, making the system unresponsive.
Resolution:
This issue coincided with increased demand for year-end reports, likely contributing to the slowdown. Queues for reporting and scheduling were overloaded, with pending messages being gradually consumed after being moved to temporary queues.
Preventive Measures:
Further monitoring and optimization are ongoing to prevent similar issues.