You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: support/azure/service-fabric/cluster/troubleshoot-service-fabric-repair-jobs.md
+12-12Lines changed: 12 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
---
2
-
title: Troubleshoot repair jobs that aren't approved by using Service Fabric Explorer
3
-
description: This article provides guidance on troubleshooting repair jobs that aren't being approved in a Service Fabric cluster using Service Fabric Explorer (SFX). It explains the concepts of repair tasks and repair jobs, their states, and how to analyze them through SFX.
2
+
title: Troubleshoot unapproved repair jobs using Service Fabric Explorer
3
+
description: Learn how to troubleshoot stuck repair jobs in Azure Service Fabric clusters using Service Fabric Explorer. Analyze repair task states, safety checks, and health checks to resolve approval issues.
4
4
ms.topic: troubleshooting-general
5
5
ms.author: jarrettr
6
6
ms.reviewer: ashukumar, v-ryanberg
@@ -11,7 +11,7 @@ ms.date: 01/20/2026
11
11
# Customer intent: As a Service Fabric customer, I want to analyze the reason why a repair job is stuck using Service Fabric Explorer.
12
12
---
13
13
14
-
# Troubleshoot repair jobs that aren't approved by using Service Fabric Explorer
14
+
# Troubleshoot unapproved repair jobs using Service Fabric Explorer
15
15
16
16
## Summary
17
17
@@ -82,19 +82,19 @@ To view jobs that Service Fabric receives for approval, go to the **Infrastructu
82
82
83
83
Jobs only appear here when they're present in the received document. In addition to the **Job ID** and **Acknowledgement Status**, the **Impact Types** section displays the nature of the job’s impact. The **Current Repair Task** section shows which repair task is actively running for job approval on the Service Fabric side. By selecting **All Repair Tasks**, you can view the status of every repair task associated with the current job.
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-infrastructure-job-view.png" alt-text="Screenshot of the Infrastructure Jobs tab in Service Fabric Explorer showing job ID, acknowledgement status, and impact types." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-infrastructure-job-view.png":::
86
86
87
87
### Repair Jobs and Health Check view
88
88
89
89
To view individual and all repair tasks associated with a cluster, go to the **Repair Jobs** tab. This displays pending repair tasks, completed repair tasks, or cancelled repair tasks. You can also see the state for any pending task.
90
90
91
91
If a repair task state is Created, Claimed, or Preparing, it's not yet approved by Service Fabric. Once a repair task transitions to the Approved state, it's considered approved and is then forwarded to the Repair Executor for the corresponding job.
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/repair-task-view.png" alt-text="Screenshot of the Repair Jobs tab in Service Fabric Explorer showing repair task states." lightbox="media/troubleshoot-service-fabric-repair-jobs/repair-task-view.png":::
94
94
95
95
If a repair task gets stuck in the Preparing state, it's either stuck in a health check or a safety check. An unhealthy entity in the cluster (including customer applications as well as system applications) can cause the health check to fail. To determine if the task is stuck in a health check, first verify whether **Preparing** or **Restoring Health Check** is enabled based on the state where the task is stuck. In the **Repair Task** view, expanding the task shows the health check status, indicating if it's enabled.
96
96
97
-
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-health-check.png" alt-text="Cluster Health Check view." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-health-check.png":::
97
+
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-health-check.png" alt-text="Screenshot of an expanded repair task showing health check status and preparing health check details." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-health-check.png":::
98
98
99
99
If enabled, **Repair Task History** shows that the health check started but didn't complete, confirming that the task is stuck in the Health Check phase.
100
100
@@ -104,26 +104,26 @@ A repair task can get stuck in the Safety Check phase only if it has an impact o
104
104
105
105
For example, in the following screenshot, the repair task is stuck in the **EnsureSeedNodeQuorum** safety check.
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/safety-check-view.png" alt-text="Screenshot of the Safety Check view in Service Fabric Explorer showing the specific check where the task is stuck." lightbox="media/troubleshoot-service-fabric-repair-jobs/safety-check-view.png":::
108
108
109
109
If there are no errors in **Infrastructure Service** related to a repair task and the task has entered the Executing state, it means the job’s acknowledgment status is Acknowledged for Impact Start. Similarly, if the repair task transitions to the Completed state, it indicates that the job’s acknowledgment status is Acknowledged for Impact End.
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-repair-task-executing.png" alt-text="Screenshot of a repair task in the Executing state with job acknowledgment status Acknowledged for Impact Start." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-repair-task-executing.png":::
112
112
113
113
All completed or cancelled repair tasks for the cluster can be viewed by selecting **Completed Repair Tasks**. This provides a comprehensive list of repair tasks that have either successfully finished or were terminated.
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/completed-repair-task-view.png" alt-text="Screenshot of the Completed Repair Task view in Service Fabric Explorer." lightbox="media/troubleshoot-service-fabric-repair-jobs/completed-repair-task-view.png":::
116
116
117
117
### Infrastructure Service and Repair Manager Service health check
118
118
119
119
To check the health of the Infrastructure Service or Repair Manager Service, select the service from the list and select **Health Evaluation**. This view shows whether the service is healthy, in a Warning state, or in an Error state, along with further details.
120
120
121
-
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-infrastructure-service-health.png" alt-text="Infrastructure Service Health view." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-infrastructure-service-health.png":::
121
+
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-infrastructure-service-health.png" alt-text="Screenshot of the Infrastructure Service Health view in Service Fabric Explorer." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-infrastructure-service-health.png":::
122
122
123
-
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-repairmanager-service-health.png" alt-text="Repair Manager Service Health view." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-repairmanager-service-health.png":::
123
+
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-repairmanager-service-health.png" alt-text="Screenshot of the Repair Manager Service Health view in Service Fabric Explorer." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-repairmanager-service-health.png":::
124
124
125
125
### Job throttling status for Infrastructure Service
126
126
127
127
To check if any job is being throttled for a specific Infrastructure Service, select the service > **Health Evaluation** > **All**. Look for health events related to job throttling. If a job is throttled, the job ID along with the reason for throttling is displayed.
:::image type="content" source="media/troubleshoot-service-fabric-repair-jobs/cluster-job-throttling-status.png" alt-text="Screenshot of the Job throttling view in Service Fabric Explorer." lightbox="media/troubleshoot-service-fabric-repair-jobs/cluster-job-throttling-status.png":::
0 commit comments