Skip to content

Commit f0c3482

Browse files
committed
added video
1 parent 3401489 commit f0c3482

15 files changed

Lines changed: 21 additions & 9 deletions

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/1-introduction.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Introduction
44
metadata:
55
title: Introduction
66
description: "Introduction to monitoring, troubleshooting, and optimizing workloads in Azure Databricks"
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/2-monitor-manage-cluster-consumption.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Monitor and manage cluster consumption
44
metadata:
55
title: Monitor and Manage Cluster Consumption
66
description: Learn how to monitor and manage cluster consumption in Azure Databricks to optimize performance and cost using metrics, auto-termination, autoscaling, and budgets.
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/3-troubleshoot-repair-lakeflow-jobs.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Troubleshoot and repair Lakeflow Jobs
44
metadata:
55
title: Troubleshoot and Repair Lakeflow Jobs
66
description: Learn how to troubleshoot, repair, restart, and stop Lakeflow Jobs in Azure Databricks to maintain reliable data pipelines.
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/4-troubleshoot-repair-spark-jobs-notebooks.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Troubleshoot Spark jobs and notebooks
44
metadata:
55
title: Troubleshoot Spark Jobs and Notebooks
66
description: Learn how to troubleshoot and repair Apache Spark jobs and notebooks in Azure Databricks, including diagnosing performance issues, resolving resource bottlenecks, and restarting clusters.
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/5-resolve-cache-skew-spill-shuffle.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Investigate caching, skewing, spilling, shuffle
44
metadata:
55
title: Investigate Caching, Skewing, Spilling, Shuffle
66
description: Learn how to investigate and resolve caching, skewing, spilling, and shuffle issues in Azure Databricks using the DAG, Spark UI, and query profile.
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/6-implement-log-streaming-azure-analytics.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Implement log streaming with Azure Log Analytics
44
metadata:
55
title: Implement Log Streaming with Azure Log Analytics
66
description: Learn how to implement log streaming from Azure Databricks to Azure Log Analytics for centralized monitoring, querying with KQL, and creating alerts for proactive troubleshooting.
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/7-knowledge-check.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Module assessment
44
metadata:
55
title: Module assessment
66
description: "Knowledge check"
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/8-summary.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: Summary
44
metadata:
55
title: Summary
66
description: "Summary"
7-
ms.date: 12/07/2025
7+
ms.date: 03/10/2026
88
author: weslbo
99
ms.author: wedebols
1010
ms.topic: unit

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/includes/1-introduction.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,5 @@
1+
>[!VIDEO https://learn-video.azurefd.net/vod/player?id=aafb4cc3-3e5a-4168-8e10-d78366db752c]
2+
13
Running data workloads in Azure Databricks at scale requires more than setting up clusters and writing pipelines—it demands continuous attention to resource consumption, job health, and performance optimization. When compute costs climb unexpectedly, jobs fail without clear explanations, or queries that once ran quickly now take hours, you need systematic approaches to identify problems and apply targeted fixes.
24

35
Azure Databricks provides comprehensive tools for monitoring and troubleshooting across all compute types. **Cluster metrics** reveal CPU, memory, and network utilization patterns that help you right-size resources. **The Spark UI** exposes job execution details, stage-level metrics, and task distributions that pinpoint bottlenecks. **System tables** enable you to query billing data and usage patterns directly, supporting cost attribution and chargeback processes.

learn-pr/wwl-databricks/monitor-troubleshoot-optimize-workloads-azure-databricks/includes/2-monitor-manage-cluster-consumption.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,5 @@
1+
>[!VIDEO https://learn-video.azurefd.net/vod/player?id=56720b1a-acba-450c-b425-e96495ac48a9]
2+
13
When clusters run longer than needed or consume more resources than necessary, costs can escalate quickly. As a data engineer, you're responsible for **monitoring cluster usage** and taking action to balance **performance requirements** with **cost efficiency**.
24

35
## Understand the impact of cluster consumption

0 commit comments

Comments
 (0)