You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/reliability/reliability-ai-search.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -81,7 +81,7 @@ When an availability zone experiences an outage, your search service continues t
81
81
82
82
+**Traffic rerouting**: When a zone fails, Azure AI Search detects the failure and routes requests to active replicas in the surviving zones.
83
83
84
-
### Failback
84
+
### Zone recovery
85
85
86
86
When the availability zone recovers, Azure AI Search automatically restores normal operations and begins routing traffic to available replicas across all zones, including the recovered zone.
Copy file name to clipboardExpand all lines: articles/reliability/reliability-aks.md
+6-2Lines changed: 6 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -114,7 +114,11 @@ There's no extra charge to enable availability zone support in AKS. You pay for
114
114
115
115
AKS also attempts to rebalance the pods across the healthy zones. If you choose to manually scale your node pool in a zone-down scenario, your pods might remain in the *Pending* state when there are no nodes available in the healthy zones. Scaling out in the remaining zones is also subject to the availability of quota and capacity for the VM SKU that you use.
116
116
117
-
-**Notification:** AKS doesn't notify you when a zone is down. You can use your node or pod health metrics to monitor the health of your nodes and pods.
117
+
-**Notification:** AKS doesn't notify you when a zone is down. However, you can use [Azure Resource Health](/azure/service-health/resource-health-overview) to monitor for the health of your cluster. You can also use [Azure Service Health](/azure/service-health/overview) to understand the overall health of the AKS service, including any zone failures.
118
+
119
+
Set up alerts on these services to receive notifications of zone-level problems. For more information, see [Create Service Health alerts in the Azure portal](/azure/service-health/alerts-activity-log-service-notifications-portal) and [Create and configure Resource Health alerts](/azure/service-health/resource-health-alert-arm-template-guide).
120
+
121
+
You can also use your node or pod health metrics to monitor the health of your nodes and pods.
118
122
119
123
-**Active requests:** Any active requests might experience disruptions. Some requests can fail, and latency might increase while your workload fails over to another zone.
120
124
@@ -126,7 +130,7 @@ There's no extra charge to enable availability zone support in AKS. You pay for
126
130
127
131
For more information, see [Zone resiliency considerations for AKS](/azure/aks/aks-zone-resiliency).
128
132
129
-
### Failback
133
+
### Zone recovery
130
134
131
135
When the availability zone recovers, failback behavior depends on the component:
Copy file name to clipboardExpand all lines: articles/reliability/reliability-api-management.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -197,9 +197,9 @@ This section describes what to expect when API Management instances are configur
197
197
198
198
-*Zonal:* For zonal instances, when a zone is unavailable, your instance is unavailable. If you have a secondary instance in another availability zone, you're responsible for rerouting traffic to that secondary instance.
199
199
200
-
### Failback
200
+
### Zone recovery
201
201
202
-
The failback behavior depends on the availability zone configuration that your instance uses.
202
+
The zone recovery behavior depends on the availability zone configuration that your instance uses.
203
203
204
204
-**Automatic and zone-redundant:** For instances that are configured to use automatic availability zone support or are manually configured to use zone redundancy, when the availability zone recovers, API Management automatically restores units in the availability zone and reroutes traffic between your units as normal.
205
205
@@ -294,7 +294,7 @@ This section describes what to expect when API Management instances are configur
294
294
295
295
-**Traffic rerouting:** If a region goes offline, API requests are automatically routed around the failed region to the next closest gateway.
296
296
297
-
### Failback
297
+
### Region recovery
298
298
299
299
When the primary region recovers, API Management automatically restores units in the region and reroutes traffic between your units.
Copy file name to clipboardExpand all lines: articles/reliability/reliability-application-gateway-v2.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -177,9 +177,9 @@ The following section describes what to expect when Application Gateway v2 is co
177
177
178
178
-*Zonal:* When a zone is unavailable, your gateway is unavailable. If you have a secondary gateway in another availability zone, you're responsible for rerouting traffic to that secondary gateway.
179
179
180
-
### Failback
180
+
### Zone recovery
181
181
182
-
The failback behavior depends on the availability zone configuration that your gateway uses:
182
+
The zone recovery behavior depends on the availability zone configuration that your gateway uses:
183
183
184
184
-*Zone-redundant:* When the affected availability zone recovers, Application Gateway automatically:
@@ -79,27 +78,27 @@ There's no additional cost to use zone redundancy for Azure Bastion.
79
78
80
79
### Configure availability zone support
81
80
82
-
**New resources:** When you deploy a new Azure Bastion resource in a [region that supports availability zones](#regions-supported), you select the specific zones that you want to deploy to. For zone redundancy, you must select multiple zones.
83
-
84
-
>[!IMPORTANT]
85
-
> You can't change the availability zone setting after you deploy your Azure Bastion resource.
81
+
-**New resources:** When you deploy a new Azure Bastion resource in a [region that supports availability zones](#regions-supported), you select the specific zones that you want to deploy to. For zone redundancy, you must select multiple zones.
86
82
87
-
[!INCLUDE [Availability zone numbering](./includes/reliability-availability-zone-numbering-include.md)]
83
+
[!INCLUDE [Availability zone numbering](./includes/reliability-availability-zone-numbering-include.md)]
88
84
89
-
**Migration:** It's not possible to change the availability zone configuration of an existing Azure Bastion resource. Instead, you need to create an Azure Bastion resource with the new configuration and delete the old one.
85
+
-**Existing resources:** It's not possible to change the availability zone configuration of an existing Azure Bastion resource. Instead, you need to create an Azure Bastion resource with the new configuration and delete the old one.
90
86
91
87
### Normal operations
92
88
93
89
This section describes what to expect when Azure Bastion resources are configured for availability zone support and all availability zones are operational.
94
90
95
-
**Traffic routing between zones:** When you initiate an SSH or RDP session, it can be routed to an Azure Bastion instance in any of the availability zones you selected.
91
+
-**Traffic routing between zones:** When you initiate an SSH or RDP session, it can be routed to an Azure Bastion instance in any of the availability zones you selected.
96
92
97
-
If you configure zone redundancy on Azure Bastion, a session might be sent to an Azure Bastion instance in an availability zone that's different from the virtual machine you're connecting to. In the following diagram, a request from the user is sent to an Azure Bastion instance in zone 2, although the virtual machine is in zone 1:
93
+
If you configure zone redundancy on Azure Bastion, a session might be sent to an Azure Bastion instance in an availability zone that's different from the virtual machine you're connecting to. In the following diagram, a request from the user is sent to an Azure Bastion instance in zone 2, although the virtual machine is in zone 1:
98
94
95
+
<!-- Art Library Source# ConceptArt-0-000-015- -->
96
+
:::image type="content" source="./media/bastion/bastion-instance-zone-traffic.png" alt-text="Diagram that shows Azure Bastion with three instances. A user request goes to an Azure Bastion instance in zone 2 and is sent to a VM in zone 1." border="false":::
99
97
100
-
:::image type="content" source="./media/bastion/bastion-instance-zone-traffic.png" alt-text="Diagram that shows Azure Bastion with three instances. A user request goes to an Azure Bastion instance in zone 2 and is sent to a VM in zone 1." border="false":::
98
+
>[!TIP]
99
+
>In most scenarios, the amount of cross-zone latency isn't significant. However, if you have unusually stringent latency requirements your workloads, you should deploy a dedicated single-zone Azure Bastion instance in the virtual machine's availability zone. Keep in mind that this configuration doesn't provide zone redundancy, and we don't recommend it for most customers.
101
100
102
-
In most scenarios, the small amount of cross-zone latency isn't significant. However, if you have unusually stringent latency requirements for your Azure Bastion workloads, you should deploy a dedicated single-zone Azure Bastion instance in the virtual machine's availability zone. This configuration doesn't provide zone redundancy, and we don't recommend it for most customers.
101
+
-**Data replication between zones:** Because Azure Bastion doesn't store state, there's no data to replicate between zones.
103
102
104
103
### Zone-down experience
105
104
@@ -113,13 +112,9 @@ This section describes what to expect when an Azure Bastion resource is configur
113
112
114
113
-**Traffic rerouting:** When you use zone redundancy, new connections use Azure Bastion instances in the surviving availability zones. Overall, Azure Bastion remains operational.
115
114
116
-
### Failback
117
-
118
-
When the availability zone recovers, Azure Bastion:
115
+
### Zone recovery
119
116
120
-
- Automatically restores instances in the availability zone.
121
-
- Removes any temporary instances created in the other availability zones.
122
-
- Reroutes traffic between your instances as normal.
117
+
When the availability zone recovers, Azure Bastion automatically restores instances in the availability zone, and reroutes traffic between your instances as normal.
123
118
124
119
### Testing for zone failures
125
120
@@ -135,7 +130,7 @@ If you have a disaster recovery site in another Azure region, be sure to deploy
135
130
136
131
## Service-level agreement
137
132
138
-
The service-level agreement (SLA) for Azure Bastion describes the expected availability of the service and the conditions that must be met to achieve that availability expectation. To understand those conditions, it's important that you review the [SLA for Online Services](https://www.microsoft.com/licensing/docs/view/Service-Level-Agreements-SLA-for-Online-Services).
Copy file name to clipboardExpand all lines: articles/reliability/reliability-container-registry.md
+22-10Lines changed: 22 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ author: chasedmicrosoft
6
6
ms.topic: reliability-article
7
7
ms.custom: subject-reliability
8
8
ms.service: azure-container-registry
9
-
ms.date: 07/23/2025
9
+
ms.date: 08/22/2025
10
10
#Customer intent: As an engineer responsible for business continuity, I want to understand the details of how Azure Container Registry works from a reliability perspective and plan disaster recovery strategies in alignment with the exact processes that Azure services follow during different kinds of situations.
11
11
---
12
12
@@ -73,14 +73,18 @@ For client applications that use Container Registry, implement appropriate retry
73
73
74
74
Zone redundancy protects your container registry against single zone failures by distributing registry data and operations across multiple availability zones within the region. Container image pull and push operations continue to function during zone outages, with automatic failover to healthy zones.
75
75
76
-
Zone redundancy is enabled by default for all Azure Container Registries in regions that support availability zones, making your resources more resilient automatically and at no additional cost. This enhancement applies to all SKUs including Basic and Standard and has been rolled out to both new and existing registries in supported regions.
76
+
Zone redundancy is enabled by default for all registries in regions that support availability zones, making your resources more resilient automatically and at no additional cost. This enhancement applies to all service tiers, including Basic and Standard, and has been applied to both new and existing registries.
77
77
78
-
>[!IMPORTANT]
79
-
>The Azure portal and CLI may not yet reflect the zone redundancy update accurately. The `zoneRedundancy` property in your registry’s configuration might still show as false even though zone redundancy is active for all registries in supported regions. We’re actively updating the portal and API surfaces to reflect this default behavior more transparently. All previously enabled features will continue to function as expected.
78
+
> [!IMPORTANT]
79
+
> The Azure portal and other tooling might not yet reflect the zone redundancy update accurately.
80
+
>
81
+
> The `zoneRedundancy` property in your registry’s configuration might still show as `false`, but zone redundancy is active for all registries in supported regions.
82
+
>
83
+
> We're actively updating the portal and API surfaces to reflect this default behavior more transparently. All previously enabled features continue to function as expected.
80
84
81
85
### Region support
82
86
83
-
Zone-redundant registries can only be deployed into [a region that supports availability zones](./regions-list.md).
87
+
Zone-redundant registries can be deployed into [any region that supports availability zones](./regions-list.md).
84
88
85
89
### Considerations
86
90
@@ -98,7 +102,7 @@ Zone redundancy is included with container registries at no extra cost.
98
102
99
103
- To migrate your artifacts between registries, you can [create a transfer pipeline](/azure/container-registry/container-registry-transfer-prerequisites). Alternatively, you can [import container images to a container registry](/azure/container-registry/container-registry-import-images).
100
104
101
-
- If your registry uses [geo-replication](#multi-region-support) in a region that supports Availablity Zones. Your replica will be zoneredundant by default. For more information, see [Create a zone-redundant replica in Container Registry](/azure/container-registry/zone-redundancy-replica). After a geo-replication is created, you can only change the zone redundancy setting by deleting and recreating the replication.
105
+
- If your registry uses [geo-replication](#multi-region-support) in a region that supports availability zones, the replica in that region will be zone-redundant automatically. For more information, see [Create a zone-redundant replica in Container Registry](/azure/container-registry/zone-redundancy-replica). After a geo-replication is created, you can only change the zone redundancy setting by deleting and recreating the replication.
102
106
103
107
-**Disable zone redundancy.** Zone redundancy can't be disabled.
104
108
@@ -122,7 +126,11 @@ When a zone becomes unavailable, Container Registry automatically handles the fa
122
126
123
127
-**Detection and response:** The Container Registry platform automatically detects failures in an availability zone and initiates a response. The service automatically routes traffic to the remaining healthy zones. No manual intervention is required to initiate a zone failover.
124
128
125
-
-**Notification:** Zone failure events can be monitored through Azure Service Health and through registry availability metrics in Azure Monitor. Set up alerts on these services to receive notifications about zone-level problems.
129
+
-**Notification**: Azure Container Registry doesn't notify you when a zone is down. However, you can use [Azure Service Health](/azure/service-health/overview) to understand the overall health of the Azure Container Registry service, including any zone failures.
130
+
131
+
Set up alerts to receive notifications of zone-level problems. For more information, see [Create Service Health alerts in the Azure portal](/azure/service-health/alerts-activity-log-service-notifications-portal).
132
+
133
+
You can also monitor registry availability metrics in Azure Monitor.
126
134
127
135
-**Active requests:** When an availability zone is unavailable, any requests in progress that are connected to resources in the faulty availability zone are terminated. They need to be retried.
128
136
@@ -132,7 +140,7 @@ When a zone becomes unavailable, Container Registry automatically handles the fa
132
140
133
141
-**Traffic rerouting:** The platform automatically reroutes traffic to healthy zones without requiring you to make any configuration changes.
134
142
135
-
### Failback
143
+
### Zone recovery
136
144
137
145
When the affected availability zone recovers, Container Registry automatically distributes operations across all available zones, including the recovered zone. The service rebalances traffic and data distribution without requiring manual intervention or causing service disruption.
138
146
@@ -206,7 +214,11 @@ When a region becomes unavailable, container operations can continue to use alte
206
214
207
215
-**Detection and response:** Container Registry monitors the health of each regional replica and is responsible for redirecting traffic to another region.
208
216
209
-
-**Notification:** Region health can be monitored through Azure Service Health. Set up alerts to receive notifications of region-level problems. You can also monitor registry availability metrics for each regional endpoint to detect problems.
217
+
-**Notification**: Azure Container Registry doesn't notify you when a region is down. However, you can use [Azure Service Health](/azure/service-health/overview) to understand the overall health of the Azure Container Registry service, including any region failures.
218
+
219
+
Set up alerts to receive notifications of region-level problems. For more information, see [Create Service Health alerts in the Azure portal](/azure/service-health/alerts-activity-log-service-notifications-portal).
220
+
221
+
You can also monitor registry availability metrics for each regional endpoint in Azure Monitor.
210
222
211
223
-**Active requests:** Any active requests currently in flight to an unavailable region will fail and must be retried so that they can be directed to a healthy region.
212
224
@@ -218,7 +230,7 @@ When a region becomes unavailable, container operations can continue to use alte
218
230
219
231
-**Traffic rerouting:** When a region becomes unavailable, container operations are automatically routed to another replica in a healthy region. Clients don't need to change the endpoint in which they interact with the registry. Microsoft automatically handles routing, failover, and failback.
220
232
221
-
### Failback
233
+
### Region recovery
222
234
223
235
When a region recovers, data plane operations automatically resume for that regional endpoint through Traffic Manager routing. The service synchronizes any changes that occur during the outage by using asynchronous replication with eventual consistency.
0 commit comments