Skip to content

Commit e152ae2

Browse files
authored
Merge pull request #53970 from MicrosoftDocs/NEW-introduction-sql-server-2025-data-virtualization
Syncing to Main from release branch
2 parents fe9520b + 66686a5 commit e152ae2

34 files changed

Lines changed: 1208 additions & 0 deletions
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.introduction
3+
title: Introduction
4+
metadata:
5+
title: Introduction
6+
description: "Learn about data virtualization in SQL Server 2025."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
durationInMinutes: 5
14+
content: |
15+
[!include[](includes/1-introduction.md)]
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.polybase
3+
title: Introduction to PolyBase
4+
metadata:
5+
title: Introduction to PolyBase
6+
description: "Learn about PolyBase and the evolution of PolyBase from SQL Server 2016 to SQL Server 2025."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
durationInMinutes: 8
14+
content: |
15+
[!include[](includes/2-polybase.md)]
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.polybase-credentials-data-sources
3+
title: PolyBase credentials and data sources
4+
metadata:
5+
title: PolyBase credentials and data sources
6+
description: "This unit goes over PolyBase objects, supported data sources, and PolyBase operations."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
durationInMinutes: 6
14+
content: |
15+
[!include[](includes/3-polybase-credentials-data-sources.md)]
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.exercise-query-parquet-file
3+
title: Exercise - Use PolyBase to query a Parquet file
4+
metadata:
5+
title: Exercise - Use PolyBase to query a Parquet file
6+
description: "Complete this exercise to learn how to use PolyBase to query an external data source Parquet file and manipulate the data."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
durationInMinutes: 10
14+
content: |
15+
[!include[](includes/4-exercise-query-parquet-file.md)]
Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.exercise-connect-azure-sql-database-use-polybase
3+
title: Exercise - Create an external table from a database in Azure SQL Database
4+
metadata:
5+
title: Exercise - Create an external table from a database in Azure SQL Database
6+
description: "An exercise using data virtualization to connect to Azure SQL Database."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- sfi-ropc-nochange
13+
- build-2023
14+
durationInMinutes: 10
15+
content: |
16+
[!include[](includes/5-exercise-connect-azure-sql-database-use-polybase.md)]
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.create-external-table-as-select
3+
title: CREATE EXTERNAL TABLE AS SELECT (CETAS)
4+
metadata:
5+
title: CREATE EXTERNAL TABLE AS SELECT (CETAS)
6+
description: "Learn about the use cases for CETAS, its structure, and how to enable it in SQL Server 2025."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
durationInMinutes: 8
14+
content: |
15+
[!include[](includes/6-create-external-table-as-select.md)]
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.exercise-create-external-table-as-select
3+
title: Exercise - CREATE EXTERNAL TABLE AS SELECT
4+
metadata:
5+
title: Exercise - CREATE EXTERNAL TABLE AS SELECT
6+
description: "An exercise on using CREATE EXTERNAL TABLE AS SELECT (CETAS)."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
durationInMinutes: 10
14+
content: |
15+
[!include[](includes/7-exercise-create-external-table-as-select.md)]
Lines changed: 49 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,49 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.knowledge-check
3+
title: "Module assessment"
4+
metadata:
5+
title: "Knowledge check"
6+
description: "Knowledge check"
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
module_assessment: true
14+
durationInMinutes: 3
15+
content: "Choose the best response for each of the following questions."
16+
quiz:
17+
title: Knowledge check - Choose the best answer
18+
questions:
19+
- content: "SQL Server 2025 can access external data sources by:"
20+
choices:
21+
- content: "A. Using REST API to S3-compatible object storage providers."
22+
isCorrect: false
23+
explanation: "SQL Server 2025 can also query Delta table files on REST API data sources from a SELECT."
24+
- content: "B. Querying Delta table files."
25+
isCorrect: false
26+
explanation: "SQL Server 2025 can also access S3-compatible object storage providers through REST API."
27+
- content: "C. A and B."
28+
isCorrect: true
29+
explanation: "SQL Server 2025 can access S3-compatible object providers and query Delta table files through REST API."
30+
- content: "PolyBase allows SQL Server to connect to:"
31+
choices:
32+
- content: "A. SQL Server."
33+
isCorrect: false
34+
explanation: "PolyBase also allows SQL Server to connect to Oracle, Azure Blob Storage, Azure Data Lake Storage, and Teradata."
35+
- content: "B. Oracle."
36+
isCorrect: false
37+
explanation: "PolyBase also allows SQL Server to connect to SQL Server, Azure Blob Storage, Azure Data Lake Storage, and Teradata."
38+
- content: "C. Azure Blob Storage."
39+
isCorrect: false
40+
explanation: "PolyBase also allows SQL Server to connect to SQL Server, Oracle, Azure Data Lake Storage, and Teradata."
41+
- content: "D. Azure Data Lake Storage."
42+
isCorrect: false
43+
explanation: "PolyBase also allows SQL Server to connect to SQL Server, Oracle, Azure Blob Storage, and Teradata."
44+
- content: "E. Teradata."
45+
isCorrect: false
46+
explanation: "PolyBase also allows SQL Server to connect to SQL Server, Oracle, Azure Blob Storage, and Azure Data Lake Storage."
47+
- content: "F. A, B, C, D, and E."
48+
isCorrect: true
49+
explanation: "PolyBase allows SQL Server to connect to SQL Server, Oracle, Azure Blob Storage, Azure Data Lake Storage, Teradata, and more."
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.sql-server-data-virtualization.summary
3+
title: Summary
4+
metadata:
5+
title: Summary
6+
description: "Summary of what was learned in this data virtualization and PolyBase module."
7+
ms.date: 02/26/2026
8+
author: HugoMSFT
9+
ms.author: hudequei
10+
ms.topic: unit
11+
ms.custom:
12+
- build-2023
13+
durationInMinutes: 2
14+
content: |
15+
[!include[](includes/9-summary.md)]
Lines changed: 58 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,58 @@
1+
Data virtualization in SQL Server 2025 is the ability to access data where it lives. Data virtualization integrates data at query time, without replicating or moving the original data.
2+
3+
This training module reviews the data virtualization options in SQL Server 2025, including:
4+
5+
- PolyBase services
6+
- REST API connectors such as Azure Data Lake Storage, Azure Blob Storage, and Amazon S3-compatible object storage that allow for new access with data virtualization
7+
- Transact-SQL (T-SQL) used for data virtualization, including OPENROWSET, CREATE EXTERNAL TABLE (CET), and CREATE EXTERNAL TABLE AS SELECT (CETAS)
8+
9+
## Principles of data virtualization
10+
11+
Data virtualization relies on three principles:
12+
13+
- **Data abstraction:** Data abstraction hides the complexities of data access from the underlying data system, formats, and structures.
14+
15+
- **Zero replication:** Unlike traditional extract-transform-load (ETL), data virtualization doesn't need to collect the data into a separate repository to transform it to the destination format, but handles transformation and aggregation on the fly.
16+
17+
- **Real-time data:** Because data virtualization connects to the data source on the fly, it always uses the latest available data.
18+
19+
## Benefits of data virtualization
20+
21+
Data virtualization has the following major benefits:
22+
23+
- **No data movement:** Accesses data in its current location.
24+
25+
- **T-SQL language:** Uses all the benefits of the T-SQL language, its commands, enhancements, and familiarity.
26+
27+
- **One source for all your data:** Uses SQL Server 2025 as a single data source and data hub for all required data, hiding data complexity from applications. Database administrators and data engineers can maintain a single environment.
28+
29+
- **Security**: Uses SQL Server security features for granular permissions, credential management, and control.
30+
31+
- **Cost flexibility:** Is available in all SQL Server 2025 editions.
32+
33+
## Data virtualization use cases
34+
35+
SQL Server 2025 offers the following major data virtualization use cases:
36+
37+
- **In-database analytics:** Use and combine all SQL Server capabilities and familiarity when using data virtualization.
38+
- **Offload or export data to other data sources.**
39+
- **Data hub:** Use SQL Server as a centralized hub to connect, protect, and query different data sources and files, hiding the complexity from applications. There's no need to use an ETL tool to aggregate, copy, or move the data to a staging area.
40+
41+
:::image type="content" source="../media/sql-server-data-hub.png" alt-text="Screenshot of SQL Server as a data hub for data virtualization." border="false":::
42+
43+
## Learning objectives
44+
45+
After you complete this module, you:
46+
47+
- Understand the benefits and principles of data virtualization.
48+
- Know what PolyBase is and how to use its capabilities.
49+
- Are familiar with object storage solutions and SQL Server 2025 support for S3-compatible object storage.
50+
- Know how to install and configure PolyBase on SQL Server 2025.
51+
- Know how to access and query external data by using PolyBase in SQL Server 2025.
52+
53+
## Prerequisites
54+
55+
- Basic working knowledge of SQL Server 2025
56+
- Fundamental knowledge of T-SQL and SQL query execution
57+
- SQL Server 2025 installed
58+
- SQL Server Management Studio (SSMS) installed

0 commit comments

Comments
 (0)