updated units

weslbo · weslbo · commit 82ff537ef722 · 2026-01-15T16:25:48.000+01:00
diff --git a/learn-pr/wwl-databricks/implement-lakeflow-jobs/2-create-lakeflow-job.yml b/learn-pr/wwl-databricks/implement-lakeflow-jobs/2-create-lakeflow-job.yml
@@ -4,11 +4,11 @@ title: Create job setup and configuration
 metadata:
   title: Create Job Setup and Configuration
   description: Learn how to create and configure a Lakeflow Job in Azure Databricks, including setting up tasks, selecting compute resources, organizing task dependencies, and configuring job access permissions.
-  ms.date: 01/14/2026
+  ms.date: 01/15/2026
   author: weslbo
   ms.author: wedebols
   ms.topic: unit
   ai-usage: ai-generated
-durationInMinutes: 9
+durationInMinutes: 11
 content: |
   [!include[](includes/2-create-lakeflow-job.md)]
diff --git a/learn-pr/wwl-databricks/implement-lakeflow-jobs/4-schedule-job.yml b/learn-pr/wwl-databricks/implement-lakeflow-jobs/4-schedule-job.yml
@@ -4,11 +4,11 @@ title: Schedule a job
 metadata:
   title: Schedule a Job
   description: Learn how to schedule Lakeflow Jobs in Azure Databricks using simple intervals or advanced cron expressions to automate your data pipelines.
-  ms.date: 12/07/2025
+  ms.date: 01/15/2026
   author: weslbo
   ms.author: wedebols
   ms.topic: unit
   ai-usage: ai-generated
-durationInMinutes: 6
+durationInMinutes: 9
 content: |
   [!include[](includes/4-schedule-job.md)]
diff --git a/learn-pr/wwl-databricks/implement-lakeflow-jobs/includes/2-create-lakeflow-job.md b/learn-pr/wwl-databricks/implement-lakeflow-jobs/includes/2-create-lakeflow-job.md
@@ -130,4 +130,28 @@ To configure permissions, navigate to **Jobs & Pipelines**, select your job, ope
 
 When a job runs, it executes with the job owner's permissions or the configured service principal's permissions—not the triggering user's permissions. For production jobs, grant `CAN MANAGE` to the pipeline team, `CAN RUN` to users who need manual execution, and `CAN VIEW` to stakeholders requiring visibility.
 
+## Configure run identity and Unity Catalog access
+
+When your job accesses Unity Catalog objects—such as tables, views, or volumes—the job's **run identity** must have the required Unity Catalog privileges. This is a critical prerequisite before configuring any job that reads from or writes to Unity Catalog-managed data.
+
+The run identity is the principal whose permissions Unity Catalog evaluates during job execution. By default, jobs run as the **job owner** (the user who created the job). For production workloads, you can configure a **service principal** as the run identity to avoid dependency on individual user accounts.
+
+Before creating your job, verify that the run identity has the necessary privileges:
+
+| Operation | Required Unity Catalog privilege |
+| --------- | -------------------------------- |
+| Read from a table | `SELECT` on the table |
+| Write to a table | `MODIFY` on the table |
+| Create tables in a schema | `CREATE TABLE` and `USE SCHEMA` on the schema |
+| Access a volume | `READ VOLUME` or `WRITE VOLUME` on the volume |
+
+To grant privileges to a service principal or user, use SQL commands like:
+
+```sql
+GRANT SELECT, MODIFY ON TABLE catalog.schema.table TO `service-principal-id`;
+GRANT USE SCHEMA ON SCHEMA catalog.schema TO `service-principal-id`;
+```
+
+If the run identity lacks the required privileges, the job fails at runtime with an authorization error—even if the job configuration itself is valid. Always verify Unity Catalog access before scheduling production jobs.
+
 With your job created, tasks configured, dependencies set, and permissions assigned, you're ready to run your workflow. The next step is understanding how to monitor job execution and handle run outcomes.
diff --git a/learn-pr/wwl-databricks/implement-lakeflow-jobs/includes/4-schedule-job.md b/learn-pr/wwl-databricks/implement-lakeflow-jobs/includes/4-schedule-job.md
@@ -85,6 +85,45 @@ Consider these factors when choosing a time zone:
 > [!TIP]
 > For jobs that must run at exact intervals regardless of local time changes, always use UTC.
 
+## Control concurrent job runs
+
+When scheduled jobs take longer than expected, a new run might start before the previous one finishes. This overlap can cause data corruption, duplicate processing, or resource contention. Azure Databricks provides concurrency settings to control this behavior.
+
+### Configure maximum concurrent runs
+
+The **Maximum concurrent runs** setting limits how many instances of the same job can execute simultaneously. By default, jobs allow multiple concurrent runs. For jobs that must not overlap—such as those writing to the same tables—set this value to **1**.
+
+To configure maximum concurrent runs:
+
+1. Open your job in the **Jobs & Pipelines** workspace UI.
+2. In the **Job details** panel, locate the **Maximum concurrent runs** setting.
+3. Set the value to control how many runs can execute at once.
+
+When a new run is triggered but the maximum concurrent runs limit is reached, Azure Databricks must decide what to do with the incoming run.
+
+### Configure queue behavior for overlapping runs
+
+When concurrent runs exceed your configured limit, you choose how the scheduler handles the new run:
+
+| Behavior | Description | Use case |
+|----------|-------------|----------|
+| **Queue the run** | The new run waits until a slot becomes available, then executes | Jobs that must eventually run—no triggers should be missed |
+| **Cancel the run** | The new run is immediately canceled | Jobs where stale triggers are not valuable |
+| **Skip the run** | Similar to cancel—the run doesn't execute | Jobs where missing occasional runs is acceptable |
+
+For most data pipelines, **queue the run** ensures that all scheduled executions eventually complete. This approach prevents data gaps when a job occasionally runs longer than its schedule interval.
+
+Consider a job scheduled to run every hour. If a run takes 75 minutes to complete, the next scheduled trigger arrives while the job is still running. With concurrency set to 1 and queue enabled:
+
+1. The first run continues processing.
+2. The second run enters the queue.
+3. When the first run completes, the queued run starts immediately.
+
+This pattern ensures sequential, non-overlapping execution while preserving all scheduled runs.
+
+> [!NOTE]
+> Queued runs consume no compute resources while waiting. They only start when a concurrent slot becomes available.
+
 ## Scheduling considerations for production workloads
 
 The Azure Databricks job scheduler handles most scenarios reliably, but it's not designed for low-latency requirements. Network conditions or cloud service issues can occasionally delay job starts by several minutes. When service recovers, scheduled jobs run immediately.
diff --git a/learn-pr/wwl-databricks/implement-lakeflow-jobs/index.yml b/learn-pr/wwl-databricks/implement-lakeflow-jobs/index.yml
@@ -3,7 +3,7 @@ uid: learn.wwl.implement-lakeflow-jobs
 metadata:
   title: Implement Lakeflow Jobs with Azure Databricks
   description: Learn how to create, configure, schedule, and monitor Lakeflow Jobs in Azure Databricks to automate your data pipelines.
-  ms.date: 01/14/2026
+  ms.date: 01/15/2026
   author: weslbo
   ms.author: wedebols
   ms.topic: module