Skip to content

Commit 535e522

Browse files
authored
Merge pull request #54105 from weslbo/freshness-update
Freshness update
2 parents f9ee2cc + 76c6507 commit 535e522

48 files changed

Lines changed: 123 additions & 101 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/1-introduction.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Introduction
44
metadata:
55
title: Introduction
66
description: "Introduction"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/2-azure-data-lake-gen2.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Understand Azure Data Lake Storage Gen2
44
metadata:
55
title: Understand Azure Data Lake Storage Gen2
66
description: "Understand Azure Data Lake Storage Gen2"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/3-create-data-lake-account.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Enable Azure Data Lake Storage Gen2 in Azure Storage
44
metadata:
55
title: Enable Azure Data Lake Storage Gen2 in Azure Storage
66
description: "Enable Azure Data Lake Storage Gen2 in Azure Storage"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/4-azure-data-lake-and-blob-storage.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Compare Azure Data Lake Store to Azure Blob storage
44
metadata:
55
title: Compare Azure Data Lake Store to Azure Blob storage
66
description: "Compare Azure Data Lake Store to Azure Blob storage"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/5-stages-for-processing-big-data.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Understand the stages for processing big data
44
metadata:
55
title: Understand the Stages for Processing Big Data
66
description: "Understand the stages for processing big data"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/6-use-cases.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Use Azure Data Lake Storage Gen2 in data analytics workloads
44
metadata:
55
title: Use Azure Data Lake Storage Gen2 in Data Analytics Workloads
66
description: "Use Azure Data Lake Storage Gen2 in data analytics workloads"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/7-knowledge-check.yml

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Module assessment
44
metadata:
55
title: Module Assessment
66
description: "Knowledge check"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit
@@ -18,21 +18,21 @@ quiz:
1818
choices:
1919
- content: "A document database hosted in Azure Cosmos DB."
2020
isCorrect: false
21-
explanation: "Incorrect. Azure Data Lake Storage Gen2 does not store data in Azure Cosmos DB."
22-
- content: "An HDFS-compatible file system hosted in Azure Storage."
21+
explanation: "Incorrect. Azure Data Lake Storage Gen2 doesn't store data in Azure Cosmos DB."
22+
- content: "A hierarchical file system hosted in Azure Blob Storage."
2323
isCorrect: true
24-
explanation: "Correct. Azure Data Lake Storage Gen2 stores data in an HDFS compatible file system in an Azure Storage blob container."
25-
- content: "A relational data warehouse hosted in Azure Synapse Analytics."
24+
explanation: "Correct. Azure Data Lake Storage Gen2 stores data in a hierarchical file system built on Azure Blob Storage, accessible through open APIs compatible with modern analytics platforms such as Azure Databricks and Microsoft Fabric."
25+
- content: "A relational data warehouse hosted in Microsoft Fabric."
2626
isCorrect: false
27-
explanation: "Incorrect. Azure Data Lake Storage Gen2 does not store data in an Azure Synapse Analytics SQL database."
27+
explanation: "Incorrect. Azure Data Lake Storage Gen2 doesn't store data in a Microsoft Fabric warehouse."
2828
- content: "What option must you enable to use Azure Data Lake Storage Gen2?"
2929
choices:
3030
- content: "Global replication"
3131
isCorrect: false
32-
explanation: "Incorrect. Global replication does not enable Azure Data Lake Storage Gen2 containers."
32+
explanation: "Incorrect. Global replication doesn't enable Azure Data Lake Storage Gen2 containers."
3333
- content: "Data encryption"
3434
isCorrect: false
35-
explanation: "Incorrect. data encryption does not enable Azure Data Lake Storage Gen2 containers."
35+
explanation: "Incorrect. Data encryption doesn't enable Azure Data Lake Storage Gen2 containers."
3636
- content: "Hierarchical namespace"
3737
isCorrect: true
3838
explanation: "Correct. To enable Azure Data Lake Storage Gen2 containers, you must turn on the Hierarchical namespace option."

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/8-summary.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Summary
44
metadata:
55
title: Summary
66
description: "Summary"
7-
ms.date: 08/21/2025
7+
ms.date: 04/03/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/includes/2-azure-data-lake-gen2.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -8,21 +8,23 @@ Azure Data Lake Storage combines a file system with a storage platform to help y
88

99
Data Lake Storage is designed to deal with this variety and volume of data at exabyte scale while securely handling hundreds of gigabytes of throughput. With this, you can use Data Lake Storage Gen2 as the basis for both real-time and batch solutions.
1010

11-
### Hadoop compatible access
11+
### Open analytics platform access
1212

13-
A benefit of Data Lake Storage is that you can treat the data as if it's stored in a Hadoop Distributed File System (HDFS). With this feature, you can store the data in one place and access it through compute technologies including Azure Databricks, Azure HDInsight, and Azure Synapse Analytics without moving the data between environments. The data engineer also has the ability to use storage mechanisms such as the parquet format, which is highly compressed and performs well across multiple platforms using an internal columnar storage.
13+
A benefit of Data Lake Storage is that it exposes a hierarchical file system through open APIs, enabling you to store data in one place and access it through modern compute technologies including Azure Databricks and Microsoft Fabric without moving the data between environments. Data engineers can also use open file formats such as Parquet and Delta Lake, which are highly compressed, support schema enforcement, and perform well across multiple analytics platforms.
1414

1515
### Security
1616

17-
Data Lake Storage supports access control lists (ACLs) and Portable Operating System Interface (POSIX) permissions that don't inherit the permissions of the parent directory. In fact, you can set permissions at a directory level or file level for the data stored within the data lake, providing a much more secure storage system. This security is configurable through technologies such as Hive and Spark or utilities such as Azure Storage Explorer, which runs on Windows, macOS, and Linux. All data that is stored is encrypted at rest by using either Microsoft or customer-managed keys.
17+
Azure Data Lake Storage uses a layered access control model. Azure role-based access control (Azure RBAC) lets you grant coarse-grained access—such as read or write access to all data in a container—to users, groups, and service principals. Azure Attribute-based access control (Azure ABAC) refines those role assignments by adding conditions, such as restricting access to data with a specific tag. For precise, file-level control, access control lists (ACLs) with Portable Operating System Interface (POSIX) permissions let you set permissions at the directory or file level.
18+
19+
Permissions aren't automatically inherited from parent directories after a child item is created. However, you can configure default permissions on a parent directory, which are then applied to new child items at the time they're created. You can manage these settings using utilities such as Azure Storage Explorer, which runs on Windows, macOS, and Linux. All data that is stored is encrypted at rest by using either Microsoft-managed or customer-managed keys.
1820

1921
### Performance
2022

2123
Azure Data Lake Storage organizes the stored data into a hierarchy of directories and subdirectories, much like a file system, for easier navigation. As a result, data processing requires less computational resources, reducing both the time and cost.
2224

2325
### Data redundancy
2426

25-
Data Lake Storage takes advantage of the Azure Blob replication models that provide data redundancy in a single data center with locally redundant storage (LRS), or to a secondary region by using the Geo-redundant storage (GRS) option. This feature ensures that your data is always available and protected if catastrophe strikes.
27+
Data Lake Storage inherits all Azure Blob Storage replication models. Locally redundant storage (LRS) keeps multiple copies within a single data center, while zone-redundant storage (ZRS) replicates data across availability zones in the same region. For broader geographic protection, geo-redundant storage (GRS) or read-access geo-redundant storage (RA-GRS) replicates data to a secondary region. For the highest level of resilience, geo-zone-redundant storage (GZRS or RA-GZRS) combines zone and geographic redundancy. This range of options ensures your data is always available and protected regardless of the scale of disruption.
2628

2729
> [!TIP]
2830
> Whenever planning for a data lake, a data engineer should give thoughtful consideration to structure, data governance, and security. This should include consideration of factors that can influence lake structure and organization, such as:
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
1-
Azure Data Lake Storage Gen2 isn't a standalone Azure service, but rather a configurable capability of a **StorageV2 (General Purpose V2)** Azure Storage.
1+
Azure Data Lake Storage Gen2 isn't a standalone Azure service, but rather a configurable capability of an Azure Storage account. You can enable it on a **Standard general-purpose v2** account (the most common choice) or a **Premium block blob** account for workloads that require higher throughput and lower latency.
22

33
To enable Azure Data Lake Storage Gen2 in an Azure Storage account, you can select the option to **Enable hierarchical namespace** in the **Advanced** page when creating the storage account in the Azure portal:
44

55
![Screenshot of Advanced Settings for Creating Storage Account.](../media/3-create-storage-account-advanced.png)
66

77
Alternatively, if you already have an Azure Storage account and want to enable the Azure data Lake Storage Gen2 capability, you can use the **Data Lake Gen2 upgrade** wizard in the Azure portal page for your storage account resource.
88

9-
![Screenshot of Advanced Settings for Creating Storage Account.](../media/3-data-lake-upgrade.png)
9+
![Screenshot of upgrading a Storage Account to Data Lake Gen2.](../media/3-data-lake-upgrade.png)

0 commit comments

Comments
 (0)