MicrosoftDocs
diff --git a/‎docs/data-engineering/spark-jdbc-driver.md‎
Lines changed: 10 additions & 10 deletions b/‎docs/data-engineering/spark-jdbc-driver.md‎
Lines changed: 10 additions & 10 deletions
diff --git a/‎docs/data-engineering/spark-odbc-driver.md‎
Lines changed: 56 additions & 13 deletions b/‎docs/data-engineering/spark-odbc-driver.md‎
Lines changed: 56 additions & 13 deletions
diff --git a/‎docs/data-engineering/tutorial-lakehouse-introduction.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/data-engineering/tutorial-lakehouse-introduction.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/data-factory/apache-airflow-jobs-power-bi-semantic-model-refresh.md‎
Lines changed: 9 additions & 11 deletions b/‎docs/data-factory/apache-airflow-jobs-power-bi-semantic-model-refresh.md‎
Lines changed: 9 additions & 11 deletions
diff --git a/‎docs/data-factory/format-avro.md‎
Lines changed: 5 additions & 5 deletions b/‎docs/data-factory/format-avro.md‎
Lines changed: 5 additions & 5 deletions
diff --git a/‎docs/data-factory/format-binary.md‎
Lines changed: 4 additions & 4 deletions b/‎docs/data-factory/format-binary.md‎
Lines changed: 4 additions & 4 deletions
@@ -4,26 +4,26 @@ description: Learn how to connect, query, and manage Spark workloads in Microsof
 ms.reviewer: arali
 ms.topic: how-to
 ms.date: 12/05/2025
+ai-usage: ai-assisted
 ---
 
-# Microsoft JDBC driver for Microsoft Fabric Data Engineering (preview)
-
-[!INCLUDE [feature-preview](../includes/feature-preview-note.md)]
+# Microsoft JDBC driver for Microsoft Fabric Data Engineering
 
 JDBC (Java Database Connectivity) is a widely adopted standard that enables client applications to connect to and work with data from databases and big data platforms.
 
-The Microsoft JDBC Driver for Fabric Data Engineering lets you connect, query, and manage Spark workloads in Microsoft Fabric with the reliability and simplicity of the JDBC standard. Built on Microsoft Fabric's Livy APIs, the driver provides secure and flexible Spark SQL connectivity to your Java applications and BI tools. This integration allows you to submit and execute Spark code directly without needing to create separate Notebook or Spark Job Definition artifacts.
+The Microsoft JDBC Driver for Fabric Data Engineering lets you connect, query, and manage Spark workloads in Microsoft Fabric with the reliability and simplicity of the JDBC standard. Built on Microsoft Fabric's Livy APIs, the driver provides secure and flexible Spark SQL connectivity to your Java applications and BI tools. This integration allows you to submit and execute Spark code directly without needing to create separate Notebook or Spark Job Definition artifacts. The driver is compatible with popular JDBC clients such as DbVisualizer and DBeaver, as well as BI tools that support JDBC connectivity, including Tableau.
 
 ## Key Features
 
 - **JDBC 4.2 Compliant**: Full implementation of JDBC 4.2 specification
 - **Microsoft Entra ID Authentication**: Multiple authentication flows including interactive, client credentials, and certificate-based authentication
-- **Enterprise Connection Pooling**: Built-in connection pooling with health monitoring and automatic recovery
+- **Enterprise Connection Pooling**: Built-in connection pooling with health monitoring, automatic recovery, and HikariCP integration
 - **Spark SQL Native Query Support**: Direct execution of Spark SQL statements without translation
 - **Comprehensive Data Type Support**: Support for all Spark SQL data types including complex types (ARRAY, MAP, STRUCT)
 - **Asynchronous Result Set Prefetching**: Background data loading for improved performance
 - **Circuit Breaker Pattern**: Protection against cascading failures with automatic retry
 - **Auto-Reconnection**: Transparent session recovery on connection failures
+- **Advanced Retry Logic**: Retry with exponential backoff and session recovery for improved resilience
 - **Proxy Support**: HTTP and SOCKS proxy configuration for enterprise environments
 
 ## Prerequisites
@@ -32,12 +32,12 @@ Before using the Microsoft JDBC Driver for Microsoft Fabric Data Engineering, en
 
 - **Java Development Kit (JDK)**: Version 11 or higher (Java 21 recommended)
 - **Microsoft Fabric Access**: Access to a Microsoft Fabric workspace
-- **Azure Entra ID Credentials**: Appropriate credentials for authentication
+- **Microsoft Entra ID credentials**: Appropriate credentials for authentication
 - **Workspace and Lakehouse IDs**: GUID identifiers for your Fabric workspace and lakehouse
 
 ## Download and Installation
 
-Microsoft JDBC Driver for Microsoft Fabric Data Engineering version 1.0.0 is the public preview version and supports Java 11, 17 and 21. We're continually improving Java connectivity support and recommend that you work with the latest version of the Microsoft JDBC driver.
+Microsoft JDBC Driver for Microsoft Fabric Data Engineering version 1.0.0 supports Java 11, 17, and 21. We're continually improving Java connectivity support and recommend that you work with the latest version of the Microsoft JDBC driver.
 
 * [Download Microsoft JDBC Driver for Microsoft Fabric Data Engineering (zip)](https://download.microsoft.com/download/5e763393-274e-48c5-a55a-0375340bc520/ms-sparksql-jdbc-1.0.0.zip)
 * [Download Microsoft JDBC Driver for Microsoft Fabric Data Engineering (tar)](https://download.microsoft.com/download/5e763393-274e-48c5-a55a-0375340bc520/ms-sparksql-jdbc-1.0.0.tar)
@@ -152,7 +152,7 @@ Connection conn = DriverManager.getConnection(url);
 
 **Parameters:**
 - `AuthFlow=1`: Specifies interactive browser authentication
-- `AuthTenantID` (optional): Azure tenant ID
+- `AuthTenantID` (optional): Microsoft Entra tenant ID
 - `AuthClientID` (optional): Application (client) ID
 
 **Behavior:**
@@ -181,7 +181,7 @@ Connection conn = DriverManager.getConnection(url);
 - `AuthFlow=3`: Specifies client credentials authentication
 - `AuthClientID`: Application (client) ID from Microsoft Entra ID
 - `AuthClientSecret`: Client secret from Microsoft Entra ID
-- `AuthTenantID`: Azure tenant ID
+- `AuthTenantID`: Microsoft Entra tenant ID
 
 **Best Practices:**
 - Store secrets securely (Azure Key Vault, environment variables)
@@ -211,7 +211,7 @@ Connection conn = DriverManager.getConnection(url);
 - `AuthClientID`: Application (client) ID
 - `AuthCertificatePath`: Path to PFX/PKCS12 certificate file
 - `AuthCertificatePassword`: Certificate password
-- `AuthTenantID`: Azure tenant ID
+- `AuthTenantID`: Microsoft Entra tenant ID
 
 ### Access Token Authentication
 
 
@@ -5,6 +5,7 @@ author: ms-arali
 ms.reviewer: arali
 ms.topic: how-to
 ms.date: 03/18/2026
+ai-usage: ai-assisted
 ---
 
 # Microsoft ODBC driver for Microsoft Fabric Data Engineering (Preview)
@@ -17,15 +18,18 @@ The Microsoft ODBC Driver for Fabric Data Engineering lets you connect, query, a
 
 ## Key features
 
-- **ODBC 3.x Compliant**: Full implementation of ODBC 3.x specification
-- **Microsoft Entra ID Authentication**: Multiple authentication flows including Azure CLI, interactive, client credentials, certificate-based, and access token authentication
-- **Spark SQL Query Support**: Direct execution of Spark SQL statements
-- **Comprehensive Data Type Support**: Support for all Spark SQL data types including complex types (ARRAY, MAP, STRUCT)
-- **Session Reuse**: Built-in session management for improved performance
-- **Large Table Support**: Optimized handling for large result sets with configurable page sizes
-- **Async Prefetch**: Background data loading for improved performance
-- **Proxy Support**: HTTP proxy configuration for enterprise environments
-- **Multi-Schema Lakehouse Support**: Connect to specific schema within a Lakehouse
+- **ODBC 3.x compliant**: Full implementation of ODBC 3.x specification
+- **Microsoft Entra ID authentication**: Multiple authentication flows including Azure CLI, interactive, client credentials, certificate-based, and access token authentication
+- **Spark SQL query support**: Direct execution of Spark SQL statements
+- **Comprehensive data type support**: Support for all Spark SQL data types including complex types (ARRAY, MAP, STRUCT)
+- **Session reuse**: Built-in session management for improved performance
+- **Large table support**: Optimized handling for large result sets with configurable page sizes
+- **Async prefetch**: Background data loading for improved performance
+- **Proxy support**: HTTP proxy configuration for enterprise environments
+- **Multi-schema Lakehouse support**: Connect to specific schema within a Lakehouse
+- **OneLake integration**: Access Lakehouse data stored in Microsoft OneLake, including tables across multiple schemas, through a unified ODBC interface without separate storage configuration
+- **Environment items support**: Attach Fabric environment items during job execution to apply workspace libraries, Spark properties, and variables to each session
+- **Custom Spark configuration**: Pass Spark configuration properties directly through the connection string to tune session behavior
 
 > [!NOTE]
 > In open-source Apache Spark, database and schema are used synonymously. For example, running `SHOW SCHEMAS` or `SHOW DATABASES` in a Fabric Notebook returns the same result — a list of all schemas in the Lakehouse.
@@ -36,7 +40,7 @@ Before using the Microsoft ODBC Driver for Microsoft Fabric Data Engineering, en
 
 - **Operating System**: Windows 10/11 or Windows Server 2016+
 - **Microsoft Fabric Access**: Access to a Microsoft Fabric workspace
-- **Azure Entra ID Credentials**: Appropriate credentials for authentication
+- **Microsoft Entra ID credentials**: Appropriate credentials for authentication
 - **Workspace and Lakehouse IDs**: GUID identifiers for your Fabric workspace and lakehouse
 - **Azure CLI** (optional): Required for Azure CLI authentication method
 
@@ -332,6 +336,44 @@ These parameters must be present in every connection string:
 | ProxyUsername | String | None | Proxy authentication username |
 | ProxyPassword | String | None | Proxy authentication password |
 
+#### Environment settings
+
+You can attach a Fabric environment item to the Spark session started by the driver. The selected environment's libraries, Spark properties, and variables are automatically applied when the session is created.
+
+| Parameter | Type | Default | Description |
+|-----------|------|---------|-------------|
+| EnvironmentId | UUID | None | Fabric environment item identifier (GUID) to apply during Spark session creation |
+
+**Example connection string with an environment item:**
+
+```
+DRIVER={Microsoft ODBC Driver for Microsoft Fabric Data Engineering};WorkspaceId=<workspace-id>;LakehouseId=<lakehouse-id>;AuthFlow=AZURE_CLI;EnvironmentId=<environment-id>
+```
+
+> [!NOTE]
+> The environment is applied when the Spark session starts. If you also specify custom Spark configuration properties, session-level properties take precedence over the environment defaults.
+
+#### Custom Spark configuration
+
+You can pass Spark configuration properties directly in the connection string. Any parameter prefixed with `spark.` is automatically applied to the Spark session at creation time, allowing you to override workspace or runtime defaults.
+
+**Example Spark configurations:**
+
+```
+spark.sql.shuffle.partitions=200
+spark.sql.adaptive.enabled=true
+spark.sql.autoBroadcastJoinThreshold=10485760
+```
+
+**Example connection string with custom Spark properties:**
+
+```
+DRIVER={Microsoft ODBC Driver for Microsoft Fabric Data Engineering};WorkspaceId=<workspace-id>;LakehouseId=<lakehouse-id>;AuthFlow=AZURE_CLI;spark.sql.shuffle.partitions=200;spark.sql.adaptive.enabled=true
+```
+
+> [!NOTE]
+> Spark configuration properties are applied when the session is created. They apply to all queries run within that session and override environment or runtime defaults for the same properties.
+
 ## DSN configuration
 
 ### Create a system DSN
@@ -341,21 +383,22 @@ These parameters must be present in every connection string:
    %SystemRoot%\System32\odbcad32.exe
    ```
 
-2. **Create New System DSN**
+1. **Create New System DSN**
    - Go to "System DSN" tab
    - Select "Add"
    - Select "Microsoft ODBC Driver for Microsoft Fabric Data Engineering"
    - Select "Finish"
 
-3. **Configure DSN Settings**
+1. **Configure DSN Settings**
    - **Data Source Name**: Enter a unique name (e.g., `FabricODBC`)
    - **Description**: Optional description
    - **Workspace ID**: Your Fabric workspace GUID
    - **Lakehouse ID**: Your Fabric lakehouse GUID
    - **Authentication**: Select authentication method
+   - **Environment ID** (optional): Enter the GUID of the Fabric environment item to attach during session creation
    - Configure additional settings as needed
 
-4. **Test Connection**
+1. **Test Connection**
    - Select "Test Connection" to verify settings
    - Select "OK" to save
 
 
@@ -85,6 +85,8 @@ The following image shows the source, destination, and data transformation:
 
 * **Consume**: Power BI can consume data from the lakehouse for reporting and visualization. Each lakehouse has a built-in TDS endpoint called the *SQL analytics endpoint* for easy connectivity and querying of data in the lakehouse tables from other reporting tools. You can also use Direct Lake over OneLake to let Power BI query lakehouse tables directly without import or a dedicated semantic model refresh cycle. Additionally, you can make your data available to non-Microsoft reporting tools by using the TDS/SQL analytics endpoint to connect and run SQL queries for analytics.
 
+  For Spark SQL workloads specifically, ODBC-compatible clients can connect using the [Microsoft ODBC Driver for Microsoft Fabric Data Engineering (Preview)](./spark-odbc-driver.md) with Microsoft Entra ID authentication (interactive, Azure CLI, service principal, certificate, or access token).
+
 ## Next step
 
 > [!div class="nextstepaction"]
 
@@ -4,16 +4,14 @@ description: Learn to refresh Power BI semantic model with Apache Airflow Job.
 ms.reviewer: abnarain
 ms.topic: tutorial
 ms.custom: airflows, sfi-image-nochange
-ms.date: 12/18/2024
+ms.date: 04/24/2026
 ---
 
 # Tutorial: Refresh Power BI Semantic Model with Apache Airflow Job
 
 [!INCLUDE[apache-airflow-note](includes/apache-airflow-note.md)]
 
-In today's data-driven world, maintaining up-to-date and accurate data models is crucial for informed business decisions. As data evolves, it's essential to refresh these models regularly to ensure that reports and dashboards reflect the most current information. Manual refreshes can be time-consuming and prone to errors, which is where Apache Airflow's orchestration, scheduling, and monitoring capabilities come into play. By leveraging Airflow, organizations can automate the refresh process of Power BI semantic models, ensuring timely and accurate data updates with minimal manual intervention.
-
-This article talks about the integration of Apache Airflow with Power BI to automate semantic model refreshes using Data Workflows. It provides a step-by-step guide to setting up the environment, configuring connections, and creating workflows to seamlessly update Power BI semantic models.
+This tutorial shows how to automate Power BI semantic model refreshes using Apache Airflow in Data Factory in Microsoft Fabric. You configure a connection, create a DAG (Directed Acyclic Graph), and schedule automatic refreshes so your reports and dashboards always reflect current data.
 
 ## Prerequisites
 
@@ -42,19 +40,19 @@ To get started, you must complete the following prerequisites:
 
    :::image type="content" source="media/apache-airflow-jobs/configure-airflow-environment.png" lightbox="media/apache-airflow-jobs/configure-airflow-environment.png" alt-text="Screenshot to Add Airflow requirement.":::
 
-## Create an Apache Airflow connection to connect with Power BI workspace
+## Create an Apache Airflow connection to Power BI
 
-1. Select on the "View Airflow connections" to see a list of all the connections are configured.
+1. Select **View Airflow connections** to see all configured connections.
 
    :::image type="content" source="media/apache-airflow-jobs/view-apache-airflow-connection.png" lightbox="media/apache-airflow-jobs/view-apache-airflow-connection.png" alt-text="Screenshot to view Apache Airflow connection.":::
 
 2. Add the new connection. You may use `Generic` connection type. Store the following fields:
 
-   - <strong>Connection ID:</strong> The Connection ID.
-   - <strong>Connection Type:</strong>Generic
-   - <strong>Login:</strong>The Client ID of your service principal.
-   - <strong>Password:</strong>The Client secret of your service principal.
-   - <strong>Extra:</strong>{"tenantId": The Tenant ID of your service principal.}
+   - **Connection ID**: The Connection ID.
+   - **Connection Type**: Generic
+   - **Login**: The Client ID of your service principal.
+   - **Password**: The Client secret of your service principal.
+   - **Extra**: `{"tenantId": "<your-tenant-id>"}`
 
 3. Select Save.
 
 
@@ -3,14 +3,14 @@ title: How to configure Avro format in the pipeline of Data Factory in Microsoft
 description: This article explains how to configure Avro format in the pipeline of Data Factory in Microsoft Fabric.
 ms.reviewer: jianleishen
 ms.topic: how-to
-ms.date: 06/25/2024
+ms.date: 04/24/2026
 ms.custom:
   - template-how-to
 ---
 
 # Avro format in Data Factory in [!INCLUDE [product-name](../includes/product-name.md)]
 
-This article outlines how to configure Avro format in the pipeline of Data Factory in [!INCLUDE [product-name](../includes/product-name.md)].
+Avro is a row-based data serialization format commonly used in Apache Hadoop workloads. This article outlines how to configure Avro format in a copy activity pipeline in Data Factory in [!INCLUDE [product-name](../includes/product-name.md)].
 
 ## Supported capabilities
 
@@ -67,13 +67,13 @@ Under **Advanced** settings in the **Destination** tab, the following Avro forma
 - **Max rows per file**: When writing data into a folder, you can choose to write to multiple files and specify the maximum rows per file. 
 - **File name prefix**: Applicable when **Max rows per file** is configured. Specify the file name prefix when writing data to multiple files, resulted in this pattern: `<fileNamePrefix>_00000.<fileExtension>`. If not specified, the file name prefix is auto generated. This property doesn't apply when the source is a file based store or a partition option enabled data store.
 
-## Table summary
+## Avro copy activity properties
 
 ### Avro as source
 
 The following properties are supported in the copy activity **Source** section when using the Avro format.
 
-|Name |Description |Value|Required |Avro script property |
+|Name |Description |Value|Required |JSON script property |
 |:---|:---|:---|:---|:---|
 | **File format**|The file format that you want to use.| **Avro**|Yes|type (*under `datasetSettings`*):<br>Avro|
 |**Compression type**|The compression codec used to read Avro files.|**None**<br>**deflate**|No|avroCompressionCodec:  <br><br>deflate|
@@ -83,7 +83,7 @@ The following properties are supported in the copy activity **Source** section w
 
 The following properties are supported in the copy activity **Destination** section when using the Avro format.
 
-|Name |Description |Value|Required |Avro script property |
+|Name |Description |Value|Required |JSON script property |
 |:---|:---|:---|:---|:---|
 | **File format**|The file format that you want to use.| **Avro**|Yes|type (*under `datasetSettings`*):<br>Avro|
 |**Compression type**|The compression codec used to write Avro files.|**None**<br>**deflate**|No|avroCompressionCodec:  <br><br>deflate|
 
@@ -3,14 +3,14 @@ title: How to configure Binary format in the pipeline of Data Factory in Microso
 description: This article explains how to configure Binary format in the pipeline of Data Factory in Microsoft Fabric.
 ms.reviewer: jianleishen
 ms.topic: how-to
-ms.date: 06/25/2024
+ms.date: 04/24/2026
 ms.custom:
   - template-how-to
 ---
 
-# Binary format for Data Factory in Microsoft Fabric
+# Binary format in Data Factory in [!INCLUDE [product-name](../includes/product-name.md)]
 
-This article outlines how to configure Binary format in Data Factory.
+Binary format copies files as-is without parsing, which is useful for moving files between storage locations without transformation. This article outlines how to configure Binary format in a copy activity pipeline in Data Factory in [!INCLUDE [product-name](../includes/product-name.md)].
 
 ## Supported capabilities
 
@@ -87,7 +87,7 @@ You can choose from the **None**, **bzip2**, **gzip**, **deflate**, **ZipDeflate
   - **Fastest**: The compression operation should complete as quickly as possible, even if the resulting file isn't optimally compressed.
   - **Optimal**: The compression operation should be optimally compressed, even if the operation takes a longer time to complete. For more information, go to the [Compression Level](/dotnet/api/system.io.compression.compressionlevel) article.
 
-## Table summary
+## Binary copy activity properties
 
 ### Binary as source