Skip to content

Commit fd15d90

Browse files
authored
Clarify deployment patterns and availability terms
Updated descriptions of deployment patterns for clarity and consistency in availability terminology.
1 parent f2d58ca commit fd15d90

1 file changed

Lines changed: 3 additions & 23 deletions

File tree

learn-pr/wwl-azure/implement-resilient-ai-ready-infrastructure/includes/2-configure-microsoft-foundry-hubs-region.md

Lines changed: 3 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -12,11 +12,11 @@ The hub itself doesn't replicate data between regions—that responsibility belo
1212

1313
## Deployment patterns for production resilience
1414

15-
Three deployment patterns address different availability requirements and cost constraints. A single-region hub provides 99.9% availability based on Azure's service-level agreement for individual components, suitable for development environments where temporary unavailability during regional outages is acceptable. With this approach, recovery requires manual intervention to recreate resources in a different region and restore data from backups, typically taking several hours.
15+
Three deployment patterns address different availability requirements and cost constraints. A single-region hub provides high availability based on Azure's individual components suitable for development environments where temporary unavailability during regional outages is acceptable. With this approach, recovery requires manual intervention to recreate resources in a different region and restore data from backups, typically taking several hours.
1616

17-
Active-passive deployment creates a secondary hub in a standby region with replicated storage and container images but minimal or no compute resources provisioned until failover occurs. This pattern achieves 99.95% availability by maintaining infrastructure readiness while optimizing costs—you only pay for compute in the secondary region when you scale up during an outage. Your application layer monitors primary hub health and switches traffic to the secondary hub when it detects failures, then provisions compute resources on demand to handle the redirected workload.
17+
Active-passive deployment creates a secondary hub in a standby region with replicated storage and container images but minimal or no compute resources provisioned until failover occurs. This pattern achieves high availability by maintaining infrastructure readiness while optimizing costs—you only pay for compute in the secondary region when you scale up during an outage. Your application layer monitors primary hub health and switches traffic to the secondary hub when it detects failures, then provisions compute resources on demand to handle the redirected workload.
1818

19-
Active-active deployment runs production workloads simultaneously across both hubs with full compute capacity in each region. This pattern delivers 99.99% availability and reduces latency for global users by routing requests to the nearest hub, but costs double because you maintain full infrastructure in both regions continuously. Organizations choose this pattern when they need geographic load distribution for regulatory compliance, data residency requirements, or minimal-latency access from multiple continents. At the same time, active-active increases operational complexity because you must coordinate model deployments and configuration changes across both hubs to maintain consistency.
19+
Active-active deployment runs production workloads simultaneously across both hubs with full compute capacity in each region. This pattern delivers high availability and reduces latency for global users by routing requests to the nearest hub, but costs double because you maintain full infrastructure in both regions continuously. Organizations choose this pattern when they need geographic load distribution for regulatory compliance, data residency requirements, or minimal-latency access from multiple continents. At the same time, active-active increases operational complexity because you must coordinate model deployments and configuration changes across both hubs to maintain consistency.
2020

2121
:::image type="content" source="../media/geographic-load-distribution-regulatory-compliance.png" alt-text="Diagram showing how the active-active approach increases operational stability.":::
2222

@@ -28,26 +28,6 @@ During failover, the secondary hub uses its connections to geo-replicated storag
2828

2929
:::image type="content" source="../media/foundry-multi-region-hub-architecture.png" alt-text="Diagram showing how dependent resource connections and failover routing between primary and secondary regions.":::
3030

31-
```mermaid
32-
graph TD
33-
A[Microsoft Foundry Hub - Primary Region] -->|Contains| B[AI Projects]
34-
A -->|Connects to| C[Azure Storage Account]
35-
A -->|Connects to| D[Azure Container Registry]
36-
A -->|Connects to| E[Compute Clusters]
37-
F[Microsoft Foundry Hub - Secondary Region] -->|Contains| G[AI Projects]
38-
F -->|Connects to| H[Geo-Replicated Storage]
39-
F -->|Connects to| I[Geo-Replicated ACR]
40-
F -->|Connects to| J[Compute Clusters]
41-
K[Application Layer] -->|Routes requests| A
42-
K -->|Failover routes| F
43-
C -.->|Replicates data| H
44-
D -.->|Replicates images| I
45-
```
46-
47-
*Microsoft Foundry multi-region hub architecture showing dependent resource connections and failover routing between primary and secondary regions*
48-
49-
Alt text: Architecture diagram showing two Microsoft Foundry hubs deployed in primary and secondary regions. The primary hub in the top section contains AI projects and connects to storage account, container registry, and compute clusters in the same region. The secondary hub in the bottom section mirrors this structure with its own AI projects and regional resources. Dotted lines show data replication flowing from primary storage to geo-replicated storage and from primary container registry to geo-replicated registry. An application layer at the bottom routes inference requests to the primary hub with a failover path to the secondary hub when needed.
50-
5131
## More resources
5232

5333
- [Microsoft Foundry hub management](/azure/ai-studio/how-to/create-azure-ai-resource) - Step-by-step guide for creating and configuring hubs with dependent resources

0 commit comments

Comments
 (0)