You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/7-knowledge-check.yml
+8-8Lines changed: 8 additions & 8 deletions
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ title: Module assessment
4
4
metadata:
5
5
title: Module Assessment
6
6
description: "Knowledge check"
7
-
ms.date: 08/21/2025
7
+
ms.date: 04/03/2026
8
8
author: weslbo
9
9
ms.author: wedebols
10
10
ms.topic: unit
@@ -18,21 +18,21 @@ quiz:
18
18
choices:
19
19
- content: "A document database hosted in Azure Cosmos DB."
20
20
isCorrect: false
21
-
explanation: "Incorrect. Azure Data Lake Storage Gen2 does not store data in Azure Cosmos DB."
22
-
- content: "An HDFS-compatible file system hosted in Azure Storage."
21
+
explanation: "Incorrect. Azure Data Lake Storage Gen2 doesn't store data in Azure Cosmos DB."
22
+
- content: "A hierarchical file system hosted in Azure Blob Storage."
23
23
isCorrect: true
24
-
explanation: "Correct. Azure Data Lake Storage Gen2 stores data in an HDFS compatible file system in an Azure Storage blob container."
25
-
- content: "A relational data warehouse hosted in Azure Synapse Analytics."
24
+
explanation: "Correct. Azure Data Lake Storage Gen2 stores data in a hierarchical file system built on Azure Blob Storage, accessible through open APIs compatible with modern analytics platforms such as Azure Databricks and Microsoft Fabric."
25
+
- content: "A relational data warehouse hosted in Microsoft Fabric."
26
26
isCorrect: false
27
-
explanation: "Incorrect. Azure Data Lake Storage Gen2 does not store data in an Azure Synapse Analytics SQL database."
27
+
explanation: "Incorrect. Azure Data Lake Storage Gen2 doesn't store data in a Microsoft Fabric warehouse."
28
28
- content: "What option must you enable to use Azure Data Lake Storage Gen2?"
29
29
choices:
30
30
- content: "Global replication"
31
31
isCorrect: false
32
-
explanation: "Incorrect. Global replication does not enable Azure Data Lake Storage Gen2 containers."
32
+
explanation: "Incorrect. Global replication doesn't enable Azure Data Lake Storage Gen2 containers."
33
33
- content: "Data encryption"
34
34
isCorrect: false
35
-
explanation: "Incorrect. data encryption does not enable Azure Data Lake Storage Gen2 containers."
35
+
explanation: "Incorrect. Data encryption doesn't enable Azure Data Lake Storage Gen2 containers."
36
36
- content: "Hierarchical namespace"
37
37
isCorrect: true
38
38
explanation: "Correct. To enable Azure Data Lake Storage Gen2 containers, you must turn on the Hierarchical namespace option."
Copy file name to clipboardExpand all lines: learn-pr/data-ai-cert/introduction-to-azure-data-lake-storage/includes/2-azure-data-lake-gen2.md
+6-4Lines changed: 6 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,21 +8,23 @@ Azure Data Lake Storage combines a file system with a storage platform to help y
8
8
9
9
Data Lake Storage is designed to deal with this variety and volume of data at exabyte scale while securely handling hundreds of gigabytes of throughput. With this, you can use Data Lake Storage Gen2 as the basis for both real-time and batch solutions.
10
10
11
-
### Hadoop compatible access
11
+
### Open analytics platform access
12
12
13
-
A benefit of Data Lake Storage is that you can treat the data as if it's stored in a Hadoop Distributed File System (HDFS). With this feature, you can store the data in one place and access it through compute technologies including Azure Databricks, Azure HDInsight, and Azure Synapse Analytics without moving the data between environments. The data engineer also has the ability to use storage mechanisms such as the parquet format, which is highly compressedand performs well across multiple platforms using an internal columnar storage.
13
+
A benefit of Data Lake Storage is that it exposes a hierarchical file system through open APIs, enabling you to store data in one place and access it through modern compute technologies including Azure Databricksand Microsoft Fabric without moving the data between environments. Data engineers can also use open file formats such as Parquet and Delta Lake, which are highly compressed, support schema enforcement, and perform well across multiple analytics platforms.
14
14
15
15
### Security
16
16
17
-
Data Lake Storage supports access control lists (ACLs) and Portable Operating System Interface (POSIX) permissions that don't inherit the permissions of the parent directory. In fact, you can set permissions at a directory level or file level for the data stored within the data lake, providing a much more secure storage system. This security is configurable through technologies such as Hive and Spark or utilities such as Azure Storage Explorer, which runs on Windows, macOS, and Linux. All data that is stored is encrypted at rest by using either Microsoft or customer-managed keys.
17
+
Azure Data Lake Storage uses a layered access control model. Azure role-based access control (Azure RBAC) lets you grant coarse-grained access—such as read or write access to all data in a container—to users, groups, and service principals. Azure Attribute-based access control (Azure ABAC) refines those role assignments by adding conditions, such as restricting access to data with a specific tag. For precise, file-level control, access control lists (ACLs) with Portable Operating System Interface (POSIX) permissions let you set permissions at the directory or file level.
18
+
19
+
Permissions aren't automatically inherited from parent directories after a child item is created. However, you can configure default permissions on a parent directory, which are then applied to new child items at the time they're created. You can manage these settings using utilities such as Azure Storage Explorer, which runs on Windows, macOS, and Linux. All data that is stored is encrypted at rest by using either Microsoft-managed or customer-managed keys.
18
20
19
21
### Performance
20
22
21
23
Azure Data Lake Storage organizes the stored data into a hierarchy of directories and subdirectories, much like a file system, for easier navigation. As a result, data processing requires less computational resources, reducing both the time and cost.
22
24
23
25
### Data redundancy
24
26
25
-
Data Lake Storage takes advantage of the Azure Blob replication models that provide data redundancy in a single data center with locally redundant storage (LRS), or to a secondary region by using the Geo-redundant storage (GRS) option. This feature ensures that your data is always available and protected if catastrophe strikes.
27
+
Data Lake Storage inherits all Azure Blob Storage replication models. Locally redundant storage (LRS) keeps multiple copies within a single data center, while zone-redundant storage (ZRS) replicates data across availability zones in the same region. For broader geographic protection, geo-redundant storage (GRS) or read-access geo-redundant storage (RA-GRS) replicates data to a secondary region. For the highest level of resilience, geo-zone-redundant storage (GZRS or RA-GZRS) combines zone and geographic redundancy. This range of options ensures your data is always available and protected regardless of the scale of disruption.
26
28
27
29
> [!TIP]
28
30
> Whenever planning for a data lake, a data engineer should give thoughtful consideration to structure, data governance, and security. This should include consideration of factors that can influence lake structure and organization, such as:
Azure Data Lake Storage Gen2 isn't a standalone Azure service, but rather a configurable capability of a **StorageV2 (General Purpose V2)** Azure Storage.
1
+
Azure Data Lake Storage Gen2 isn't a standalone Azure service, but rather a configurable capability of an Azure Storage account. You can enable it on a **Standard general-purpose v2** account (the most common choice) or a **Premium block blob** account for workloads that require higher throughput and lower latency.
2
2
3
3
To enable Azure Data Lake Storage Gen2 in an Azure Storage account, you can select the option to **Enable hierarchical namespace** in the **Advanced** page when creating the storage account in the Azure portal:
4
4
5
5

6
6
7
7
Alternatively, if you already have an Azure Storage account and want to enable the Azure data Lake Storage Gen2 capability, you can use the **Data Lake Gen2 upgrade** wizard in the Azure portal page for your storage account resource.
8
8
9
-

9
+

0 commit comments