Major cloud provider outage

Incident Report for Templafy

Postmortem

Investigation

The issue was detected at 5:09 PM CET on October 29, 2025, when monitoring and user reports indicated a complete outage affecting the Templafy Hive application.
The engineering team immediately began investigating and identified that the outage coincided with a major Microsoft Azure incident impacting Azure Front Door and Networking Infrastructure services. This outage caused disruptions in key platform dependencies, resulting in failures across both Hive and Templafy One environments.

During the investigation, several mitigation strategies were explored to bypass Azure Front Door; however, the widespread nature of the cloud provider outage also affected internal build components, preventing successful deployments.

Mitigation

The engineering team initiated mitigation efforts aimed at reducing dependency on Azure Front Door by preparing temporary routing adjustments and evaluating possible bypass configurations.
Despite these efforts, the Azure networking layer continued to experience instability, limiting the effectiveness of the mitigation steps. Before the workaround could be fully implemented, Azure Front Door services began recovering, allowing normal connectivity and operations to gradually resume.

Resolution

By 9:52 PM CET on October 29, 2025, Microsoft had restored stability across Azure Front Door and related infrastructure components.
Once recovery was confirmed, Templafy Hive operations returned to normal without additional intervention. Continuous monitoring verified that all services were fully operational, and no residual impact remained.

Post-Incident Actions

While the root cause was outside Templafy’s control, the engineering team identified opportunities to improve detection and response to third-party outages:

  • Introduce additional availability checks for external CDN and Front Door endpoints.

Impact and Scope

The outage affected all customers using Templafy Hive and Templafy One, due to dependencies on Azure’s networking and Front Door infrastructure.
The impact was global across all production clusters during the Azure service interruption window between 5:09 PM CET and 9:52 PM CET on October 29, 2025.

We sincerely apologize for the disruption caused by this event. Although the issue originated from our cloud provider, we understand the importance of continuous reliability. We are reinforcing our monitoring and resilience strategies to minimize the impact of future third-party outages and maintain the dependable service our customers expect.

Posted Oct 31, 2025 - 10:12 CET

Resolved

The incident has been resolved, and further information will be provided in a postmortem shortly.

We apologize for the impact to affected customers.
Posted Oct 30, 2025 - 10:18 CET

Update

We are continuing to monitor for any further issues.
Posted Oct 29, 2025 - 21:31 CET

Monitoring

Engineers are monitoring the recovery of Microsoft's Azure Front Door services, which is leading to gradual recovery of our services. Full access for all users is expected to resume gradually.
Posted Oct 29, 2025 - 20:18 CET

Update

We're currently investigating an issue affecting Library App builds. The issue is related to a wider Microsoft Azure outage. Our engineers are monitoring the situation and working to restore normal functionality as soon as possible.
Posted Oct 29, 2025 - 19:27 CET

Identified

We have identified an issue that affects all customers.
Further updates will be posted here soon.
Posted Oct 29, 2025 - 17:20 CET
This incident affected: Templafy Hive (Account Management, Add-in Management, AI Assistant, Authentication, Email Signature, Engage, Integrations, Library & Dynamics, Productivity, Resource Management, User Management).