Skip to content

Commit 97f9cec

Browse files
authored
Merge pull request #311757 from Padmalathas/HPC-TOC-Revamp
Update HPC migration and benchmarking sections in TOC - Part 1
2 parents 4c28724 + 84785b4 commit 97f9cec

25 files changed

Lines changed: 844 additions & 209 deletions
Lines changed: 103 additions & 63 deletions
Original file line numberDiff line numberDiff line change
@@ -1,72 +1,112 @@
1-
- name: High-Performance Computing on-premises to cloud lift and shift
1+
- name: High-Performance Computing on Azure
22
href: index.yml
3-
- name: Get started
3+
############################
4+
# SECTION 1: HPC MIGRATION
5+
############################
6+
- name: Migrate HPC workloads to Azure
47
expanded: true
58
items:
6-
- name: Overview
7-
href: lift-and-shift-overview.md
8-
- name: Migration guide
9-
expanded: true
10-
items:
11-
- name: Proof-of-concept migration guide
12-
href: lift-and-shift-proof-of-concept.md
13-
- name: Production-level environment migration guide
14-
expanded: true
9+
10+
- name: Get started
11+
items:
12+
- name: Plan your on-premises HPC migration to Azure
13+
href: lift-and-shift-overview.md
14+
15+
- name: Migration guide
1516
items:
16-
- name: Overview
17-
href: lift-and-shift-production-level-overview.md
18-
- name: Deployment step 1 - Basic infrastructure
17+
- name: Build a proof-of-concept HPC cluster on Azure
18+
href: lift-and-shift-proof-of-concept.md
19+
20+
- name: Production deployment guide
1921
items:
2022
- name: Overview
2123
href: lift-and-shift-step-1-overview.md
22-
- name: Resource group
23-
href: lift-and-shift-step-1-resource-group.md
24-
- name: Network access
25-
href: lift-and-shift-step-1-networking.md
26-
- name: Storage
27-
href: lift-and-shift-step-1-storage.md
28-
- name: Deployment step 2 - Base services
29-
items:
30-
- name: Overview
31-
href: lift-and-shift-step-2-overview.md
32-
- name: Job scheduler
33-
href: lift-and-shift-step-2-job-scheduler.md
34-
- name: Resource orchestrator
35-
href: lift-and-shift-step-2-resource-orchestrator.md
36-
- name: Identity management
37-
href: lift-and-shift-step-2-identity.md
38-
- name: Accounting
39-
href: lift-and-shift-step-2-accounting.md
40-
- name: Monitoring
41-
href: lift-and-shift-step-2-monitor.md
42-
- name: Deployment step 3 - Storage
43-
items:
44-
- name: Overview
45-
href: lift-and-shift-step-3-overview.md
46-
- name: Storage
47-
href: lift-and-shift-step-3-storage.md
48-
- name: Data migration
49-
href: lift-and-shift-step-3-data-migration.md
50-
- name: Deployment step 4 - Compute nodes
51-
items:
52-
- name: Overview
53-
href: lift-and-shift-step-4-overview.md
54-
- name: VM images
55-
href: lift-and-shift-step-4-vm-images.md
56-
- name: Deployment step 5 - End user entry point
57-
items:
58-
- name: Overview
59-
href: lift-and-shift-step-5-overview.md
60-
- name: End-user entry point
61-
href: lift-and-shift-step-5-end-user-entry-point.md
62-
- name: HPC Performance and Benchmarking
24+
25+
- name: Set up resource groups, networking, and foundational storage
26+
displayName: resource groups, VNet, NSG, Bastion, basic storage
27+
items:
28+
- name: Resource group configuration
29+
href: lift-and-shift-step-1-resource-group.md
30+
31+
- name: Network access and security configuration
32+
href: lift-and-shift-step-1-networking.md
33+
34+
- name: Basic storage account setup
35+
href: lift-and-shift-step-1-storage.md
36+
37+
- name: Configure CycleCloud, schedulers, identity, and monitoring
38+
displayName: CycleCloud, Slurm, PBS, LSF, Azure AD, Grafana, cost management
39+
items:
40+
- name: Resource orchestration with Azure CycleCloud
41+
href: lift-and-shift-step-2-resource-orchestrator.md
42+
43+
- name: Job scheduler deployment and migration
44+
href: lift-and-shift-step-2-job-scheduler.md
45+
46+
- name: Identity management and user authentication
47+
href: lift-and-shift-step-2-identity.md
48+
49+
- name: Monitoring with Azure Monitor and Grafana
50+
href: lift-and-shift-step-2-monitor.md
51+
52+
- name: Cost accounting and budget tracking
53+
href: lift-and-shift-step-2-accounting.md
54+
55+
- name: Configure HPC storage and migrate data
56+
displayName: Lustre, ANF, Azure NetApp Files, Blob, AzCopy, rsync, Data Box
57+
items:
58+
- name: Choose the right storage for your HPC workload
59+
href: /azure/virtual-machines/workload-guidelines-best-practices-storage.md
60+
61+
- name: HPC storage options (Lustre, ANF, Blob, Azure Files)
62+
href: lift-and-shift-step-3-storage.md
63+
64+
- name: Migrate data to Azure (Data Box, AzCopy, rsync)
65+
href: lift-and-shift-step-3-data-migration.md
66+
67+
- name: Provision HPC and GPU compute nodes
68+
displayName: VM SKUs, HPC images, AlmaLinux, Ubuntu, CUDA, InfiniBand
69+
items:
70+
- name: Compute node selection and quota planning
71+
href: lift-and-shift-step-4-overview.md
72+
73+
- name: VM image configuration (Azure HPC images)
74+
href: lift-and-shift-step-4-vm-images.md
75+
76+
- name: Enable user access and job submission
77+
displayName: VPN, Bastion, VDI, web portal, SSH, remote visualization
78+
items:
79+
- name: End user entry access
80+
href: lift-and-shift-step-5-overview.md
81+
82+
- name: User entry access configuration and job submission
83+
href: lift-and-shift-step-5-end-user-entry-point.md
84+
85+
86+
###########################################
87+
# SECTION 2: HPC PERFORMANCE & BENCHMARKING
88+
###########################################
89+
- name: HPC and AI performance and benchmarking
6390
expanded: true
6491
items:
65-
- name: Performance and benchmarking overview
66-
href: ./performance-benchmarking/overview.md
67-
- name: HPC workload best practices and storage solutions
68-
href: ./performance-benchmarking/hpc-storage-options.md
69-
- name: HPC performance and benchmarking applications
70-
href: ./performance-benchmarking/high-performance-computing-performance-benchmarking-applications.md
71-
- name: Performance optimization for HPC and AI VMs
72-
href: ./performance-benchmarking/optimize-performance.md
92+
93+
- name: HPC and AI benchmarking overview
94+
href: performance-benchmarking/overview.md
95+
96+
- name: Best practices for HPC workloads and storage performance
97+
href: performance-benchmarking/hpc-storage-options.md
98+
99+
- name: HPC and AI benchmarking applications reference
100+
href: performance-benchmarking/high-performance-computing-performance-benchmarking-applications.md
101+
102+
- name: Optimize performance for HPC and AI virtual machines
103+
href: performance-benchmarking/optimize-performance.md
104+
105+
- name: Run your first benchmark using STREAM
106+
href: performance-benchmarking/stream-benchmark.md
107+
108+
- name: AI training
109+
items:
110+
111+
- name: Choose your platform for AI training and fine-tuning
112+
href: performance-benchmarking/platform-selection-best-practices-for-hpc-ai-models.md

articles/high-performance-computing/lift-and-shift-overview.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "End-to-end lift and shift architecture overview"
3-
description: Learn how to conduct a lift and shift migration of HPC infrastructure and workloads from an on-premises environment to the cloud.
2+
title: Plan your on-premises HPC migration to cloud
3+
description: Learn how to plan and execute a lift-and-shift migration of HPC workloads from on-premises to cloud, including personas, architecture components, and best practices.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/12/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: "As an HPC administrator, I want to understand the lift and shift process for migrating on-premises infrastructure to the cloud, so that I can efficiently transition workloads while maintaining system performance and management."
1115
---
1216

13-
# End-to-end lift and shift architecture overview
17+
# Plan your on-premises HPC migration to cloud
1418

1519
"Lift and shift" in the context of High-Performance Computing (HPC) mostly refers to the process of migrating an on-premises environment and workload to the cloud. Ideally, modifications are kept to a minimum (for example, applications, job schedulers, and their configurations should remain mostly the same). Adjustments on storage and hardware are natural to happen because resources are different from on-premises to cloud platforms. With the lift and shift approach, organizations can start benefiting from the cloud more quickly.
1620

articles/high-performance-computing/lift-and-shift-production-level-overview.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,10 +3,14 @@ title: "Production level environment migration overview"
33
description: Learn what a high performance computing production-level environment migration entails.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: "As an IT administrator, I want to migrate my high-performance computing environment to the cloud, so that I can enhance performance, scalability, and efficiency while managing component dependencies effectively during the deployment process."
1115
---
1216

articles/high-performance-computing/lift-and-shift-proof-of-concept.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "Proof-of-concept migration overview"
3-
description: Learn what a high performance computing proof-of-concept migration entails.
2+
title: Build a proof-of-concept HPC cluster on Azure
3+
description: Deploy a proof-of-concept HPC cluster on Azure using CycleCloud and Slurm to validate your migration before moving to production.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: As an IT manager, I want to understand the steps for conducting a proof-of-concept migration of my HPC infrastructure to the cloud, so that I can evaluate its benefits and performance before transitioning to a full production environment.
1115
---
1216

13-
# Proof-of-concept migration overview
17+
# Build a proof-of-concept HPC cluster on Azure
1418

1519
When you move an HPC infrastructure from the on-premises environment to the cloud, there are various aspects to be taken into account. This document provides guidance on how to create such HPC environment in the cloud. We recommend
1620
a two-phase approach. First, a proof-of-concept, and then a production-level environment. Once the production environment is up and running, only certain components should be modified over time, including changes on VM types and storage capabilities to best meet the varying requirements of users, projects, and business.

articles/high-performance-computing/lift-and-shift-step-1-networking.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "Network access configuration"
3-
description: Learn how to configure network access during a migration of high performance computing architecture.
2+
title: Configure networking and security for Azure HPC clusters
3+
description: Set up virtual networks, subnets, NSGs, Azure Bastion, and VPN Gateway for secure HPC cluster access on Azure.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: As an IT administrator managing high performance computing resources, I want to configure network access during migration so that I can ensure secure and efficient communication between computational resources while maintaining proper security protocols.
1115
---
1216

13-
# Network access configuration
17+
# Configure networking and security for Azure HPC clusters
1418

1519
A key aspect of your high performance computing migration is the configuration of network access. Proper network access configuration ensures secure, efficient, and uninterrupted communication between computational resources. This part of the guide covers the needs, tools, services, and best practices associated with your network access configuration.
1620

articles/high-performance-computing/lift-and-shift-step-1-overview.md

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "Basic infrastructure migration overview"
2+
title: Production HPC environment on Azure - deployment overview
33
description: Get a high level idea of the steps involved when migrating basic infrastructure components during a migration of high performance computing architecture.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: "As an HPC architect, I want to understand the foundational components required for cloud migration, so that I can successfully establish a landing zone and manage resources effectively for high-performance computing workloads."
1115
---
1216

13-
# Basic infrastructure migration overview
17+
# Production HPC environment on Azure - deployment overview
1418

1519
The critical foundational components required to establish a landing zone in the cloud for an HPC environment are outlined here. The focus is on setting up resource groups, networking, and basic storage, which serve as the backbone of a successful HPC lift-and-shift deployment.
1620

articles/high-performance-computing/lift-and-shift-step-1-resource-group.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "Resource group configuration"
3-
description: Learn how to configure resource groups during a migration of high performance computing architecture.
2+
title: Configure resource groups for Azure HPC environments
3+
description: Set up and organize Azure resource groups for effective management, access control, and cost tracking in HPC deployments.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: As a cloud architect, I want to configure resource groups effectively during HPC migrations, so that I can optimize resource management, control access, and improve cost tracking for various workloads.
1115
---
1216

13-
# Resource group configuration
17+
# Configure resource groups for Azure HPC environments
1418

1519
A key aspect of this process is the configuration of resource groups. Resource groups in Azure serve as containers that hold related resources for an Azure solution. In an HPC environment, organizing resources into appropriate resource groups is essential for effective management, access control, and cost tracking. This part of the guide covers the needs and best practices associated with your resource groups.
1620

articles/high-performance-computing/lift-and-shift-step-1-storage.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "Storage options"
3-
description: Learn what basic storage infrastructure options to consider during a migration of high performance computing architecture.
2+
title: Set up foundational storage accounts for HPC on Azure
3+
description: Create and configure basic Azure Storage accounts for logs, backups, and general data management in HPC environments.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: As a cloud architect, I want to establish a basic storage infrastructure for high-performance computing migration, so that I can ensure effective data management, security, and resource accessibility during the deployment process.
1115
---
1216

13-
# Storage options
17+
# Set up foundational storage accounts for HPC on Azure
1418

1519
In any Azure subscription, setting up basic storage is essential for managing data, applications, and resources effectively. While more advanced and HPC-specific storage configurations are addressed separately, a solid foundation of basic storage is crucial for general resource management and initial deployment needs. This part of the guide covers the needs, tools, services, and best practices associated with your basic storage infrastructure.
1620

articles/high-performance-computing/lift-and-shift-step-2-accounting.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "Accounting configuration"
3-
description: Learn how to configure accounting during a migration of high performance computing architecture.
2+
title: Configure cost accounting and budget tracking for Azure HPC
3+
description: Set up cost management and accounting to track HPC resource usage, project costs, and budget forecasting on Azure.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: "As an HPC administrator, I want to configure accounting systems during migration, so that I can ensure efficient resource utilization, optimize costs, and maintain compliance with organizational policies."
1115
---
1216

13-
# Accounting configuration
17+
# Configure cost accounting and budget tracking for Azure HPC
1418

1519
A key aspect of your high performance computing migration is the configuration of accounting systems. Your accounting components ensure efficient resource utilization, cost management, and compliance. This part of the guide covers the needs, tools, services, and best practices associated with your accounting systems.
1620

articles/high-performance-computing/lift-and-shift-step-2-identity.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,20 @@
11
---
2-
title: "Identity management configuration"
3-
description: Learn how to configure identity management during a migration of high performance computing architecture.
2+
title: Configure identity management for Azure HPC clusters
3+
description: Set up user authentication and identity management for HPC environments using Azure Active Directory and on-premises directory services.
44
author: tomvcassidy
55
ms.author: tomcassidy
6-
ms.date: 04/10/2025
6+
ms.date: 02/19/2026
77
ms.topic: how-to
88
ms.service: azure-virtual-machines
99
ms.subservice: hpc
10+
ms.collections:
11+
- hpc-migration-content
12+
- migration
13+
- onprem-to-azure
1014
# Customer intent: As an IT administrator overseeing high performance computing migration, I want to configure identity management systems, so that I can ensure secure access to resources and comply with organizational policies during the migration process.
1115
---
1216

13-
# Identity management configuration
17+
# Configure identity management for Azure HPC clusters
1418

1519
A key aspect of your high performance computing migration is identity management configuration. Your identity management system ensures secure access to resources, seamless integration, and compliance with organizational policies. It should allow the creation and deletion of users and groups, allow updating and resetting of passwords, and support single sign-on. This part of the guide covers the needs, tools, services, and best practices associated with identity management.
1620

0 commit comments

Comments
 (0)