You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: support/windows-server/backup-and-storage/troubleshoot-data-corruption-and-disk-errors.md
+27-21Lines changed: 27 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,5 @@
1
1
---
2
-
title: Guidance for troubleshooting data corruption and disk errors
2
+
title: Guidance for Troubleshooting Data Corruption and Disk Errors
3
3
description: Provides guidance to help troubleshoot data corruption and disk errors in Windows.
4
4
ms.date: 01/15/2025
5
5
author: kaushika-msft
@@ -16,7 +16,7 @@ appliesto:
16
16
---
17
17
# Data corruption and disk errors troubleshooting guidance
18
18
19
-
Disk, file system, and storage issues in Windows Server environments can cause inaccessible drives, file or folder corruption, unexpected drive state changes, and application or backup failures. If not resolved promptly, these issues can compromise data integrity, disrupt service availability, and lead to downtime or data loss.
19
+
Disk, file system, and storage issues in Windows Server environments can cause inaccessible drives, file or folder corruption, unexpected drive state changes, and application or backup failures. If they're not resolved promptly, these issues can compromise data integrity, disrupt service availability, and cause downtime or data loss.
20
20
21
21
This article provides guidance to help you identify, diagnose, and repair data corruption and disk errors effectively.
22
22
@@ -25,18 +25,18 @@ This article provides guidance to help you identify, diagnose, and repair data c
25
25
- Before you perform any repairs or modify any disks, back up all critical data.
26
26
- Make sure that you have administrative permissions on the affected devices.
27
27
- Familiarize yourself with storage subsystem architecture and event log monitoring.
28
-
- Make sure that you use modern volume management tools. Avoid deprecated features such as dynamic disks
28
+
- Make sure that you use modern volume management tools. Avoid deprecated features, such as dynamic disks
29
29
30
30
> [!NOTE]
31
-
> This article describes commands you have to run at an administrative Windows command prompt or an administrative Windows PowerShell command prompt.
31
+
> This article discusses commands that you have to run at an administrative Windows command prompt or an administrative Windows PowerShell command prompt.
32
32
33
33
## Troubleshooting checklist
34
34
35
35
Use this checklist for systematic troubleshooting.
36
36
37
37
**Check drivers and firmware**
38
38
39
-
Make sure that the storage-related drivers and firmware are up to date. Consult your hardware vendor for the latest drivers and diagnostic tools, if necessary. This step includes drivers for components such as:
39
+
Make sure that the storage-related drivers and firmware are up to date. Consult your hardware vendor for the latest drivers and diagnostic tools, if it's necessary. This step includes drivers for components such as:
40
40
41
41
- iSCSI port
42
42
- RAID controller
@@ -46,15 +46,15 @@ Make sure that the storage-related drivers and firmware are up to date. Consult
46
46
47
47
**Scan the health of the storage system**
48
48
49
-
Use`chkdsk` in scan mode to look for potential storage system issues without making changes. Open a Windows Command Prompt window as an administrator, and then run the following command:
49
+
Run`chkdsk` in scan mode to look for potential storage system issues without making changes. Open a Windows Command Prompt window as an administrator, and then run the following command:
50
50
51
51
```
52
52
console chkdsk /scan
53
53
```
54
54
55
55
**Review the event log for relevant events**
56
56
57
-
The following Event IDs indicate that there's data corruption or a disk error:
57
+
The following event IDs indicate data corruption or a disk error:
58
58
59
59
- Event ID 55, `The file system structure on the disk is corrupt and unusable. Please run the chkdsk utility on the volume.`
60
60
- Event ID 98, `Volume C: (\Device\HarddiskVolume3) needs to be taken offline to perform a Full Chkdsk. Please run "CHKDSK /F" locally via the command line or run "REPAIR-VOLUME \<drive:>" locally or remotely via PowerShell.`
@@ -65,7 +65,7 @@ The following Event IDs indicate that there's data corruption or a disk error:
65
65
**Scan and repair NTFS volumes**
66
66
67
67
> [!NOTE]
68
-
> Resilient file system (ReFS) volumes can automatically fix corruption issues. You can still run `chkdsk` to scan them.
68
+
> Resilient file system (ReFS) volumes can automatically fix corruption issues. You can still run `chkdsk`if you want to scan the volumes.
69
69
70
70
1. To get detailed information about a volume, run the following command at a Windows command prompt:
71
71
@@ -76,7 +76,7 @@ The following Event IDs indicate that there's data corruption or a disk error:
76
76
> [!NOTE]
77
77
> In this command, \<RootPath> represents the drive letter of the drive root.
78
78
79
-
1. To verify if the volume is dirty, run the following command at a Windows command prompt:
79
+
1. To check whether the volume is dirty, run the following command at a Windows command prompt:
80
80
81
81
```console
82
82
fsutil dirty query <VolumePath>:
@@ -85,7 +85,7 @@ The following Event IDs indicate that there's data corruption or a disk error:
85
85
> [!NOTE]
86
86
> In this command, \<VolumePath> represents the drive letter.
87
87
88
-
1. If the result of the previous step indicates that the volume is dirty, schedule a maintenance window for the volume. The disk isn't accessible during the repair process. During the maintenance window, run the following command at the Windows command prompt:
88
+
1. If the results of the previous step indicate that the volume is dirty, schedule a maintenance window for the volume. The disk isn't accessible during the repair process. During the maintenance window, run the following command at the Windows command prompt:
89
89
90
90
```console
91
91
chkdsk /f /r
@@ -96,19 +96,24 @@ The following Event IDs indicate that there's data corruption or a disk error:
96
96
97
97
**Advanced troubleshooting**
98
98
99
-
If errors persist, follow these steps to further test the system and isolate the issue:
99
+
If errors persist, further test the system to isolate the issue. Follow these steps:
100
100
101
101
1. Uninstall any third-party disk management software, such as Diskeeper.
102
102
1. Remove or update filter drivers.
103
103
1. Switch to different types of drivers. For example, RAID controller drivers or monolithic drivers.
104
104
1. Check the multipath I/O configuration. For more information, see [Multipath I/O (MPIO) troubleshooting guidance](windows-server-mpio-troubleshooting.md).
105
-
1. To isolate an issue to specific hardware, remove individual disks from the cluster and then test the system.
105
+
1. To isolate an issue to specific hardware, remove individual disks from the cluster, and then test the system.
106
106
107
107
## Common issues and solutions
108
108
109
109
### Event ID 153: The IO operation at logical block address 123456 for Disk 2 was retried
110
110
111
-
Event ID 153 indicates that the storage subsystem is overloaded, which is causing requests to time out. Event ID 153 is similar to Event ID 129, but the difference is that Event ID 153 is logged when the Storport miniport driver (sometimes known as an adapter or HBA driver) times out a request, while Event ID 129 is logged when the Storport driver (*Storport.sys*) times out a request to the disk. Because of the way the Storport miniport driver and the Storport driver interact, Event ID 153 might not be accompanied by an error.
111
+
Event ID 153 indicates that the storage subsystem is overloaded. This condition causes requests to time out. Event ID 153 is similar to Event ID 129. The events differ as follows:
112
+
113
+
- Event ID 153 is logged when the Storport miniport driver (sometimes known as an adapter or HBA driver) times out a request.
114
+
- Event ID 129 is logged when the Storport driver (*Storport.sys*) times out a request to the disk.
115
+
116
+
Because of the manner in which the Storport miniport driver and the Storport driver interact, Event ID 153 might not be accompanied by an error message.
112
117
113
118
To fix this issue, you have to relieve the overload. Follow these steps:
114
119
@@ -118,16 +123,16 @@ To fix this issue, you have to relieve the overload. Follow these steps:
118
123
1. If the system traffic doesn't flow as expected, check for the following issues:
119
124
- iSCSI configuration issues, such as damaged cables, damaged network adapters, or network adapters that handle non-storage traffic and storage traffic.
120
125
- MPIO configuration issues, such as insufficient or incorrectly configured multipaths.
121
-
1. If the previous steps can't fix the timeout issue, contact your hardware vendor for information about your specific driver timeouts.
126
+
1. If the previous steps don't fix the timeout issue, contact your hardware vendor for information about your specific driver timeouts.
122
127
123
128
### Event ID 129: Reset to device, \Device\RaidPort1, was issued
124
129
125
-
Similar to Event ID 153, Event ID 129 indicates that the storage subsystem is overloaded, which is causing requests to time out. Event ID 129 is logged when the Storport driver (*Storport.sys*) times out a request to the disk. The event information includes the name of the storage adapter (HBA) driver (also known as the miniport driver) that's associated with the affected Storport driver.
130
+
Similar to Event ID 153, Event ID 129 indicates that the storage subsystem is overloaded. This condition causes requests to time out. Event ID 129 is logged when the Storport driver (*Storport.sys*) times out a request to the disk. The event information includes the name of the storage adapter (HBA) driver (also known as the miniport driver) that's associated with the affected Storport driver.
126
131
127
132
The following issues can cause this behavior:
128
133
129
134
- LUNs aren't responding.
130
-
- Hardware issues such as faulty SAN routers are causing request drops.
135
+
- Hardware issues such as faulty SAN routers cause request drops.
131
136
132
137
To fix this issue, follow these steps:
133
138
@@ -142,13 +147,14 @@ This issue most often occurs when something disrupts communication between the s
142
147
143
148
- A SAN fabric error
144
149
- A SCSI bus issue
145
-
- A disk that fails
146
-
- A user unplugs a disk while the system is running.
150
+
- A disk failure
151
+
- A user unplugging a disk while the system is running
147
152
148
153
To fix this issue, follow these steps:
149
154
150
155
1. Verify that the disk subsystem is healthy.
151
-
1. Check the state of the disk hardware, and check storage connections for disruptions.
156
+
1. Check the state of the disk hardware.
157
+
2. Check storage connections for disruptions.
152
158
153
159
### Event ID 55 and Event ID 98: Please run the chkdsk utility
154
160
@@ -158,7 +164,7 @@ NTFS events such as Event ID 55, 50, 140, and 98 indicate file system corruption
158
164
- I/O requests that the file system sends to the disk subsystem aren't completed successfully.
159
165
160
166
> [!NOTE]
161
-
> Because NTFS couldn't write data to the transaction log, this issue could affect the ability of NTFS to stop or roll back the operations in which the transaction data couldn't be written.
167
+
> Because NTFS can't write data to the transaction log, this issue could affect the ability of NTFS to stop or roll back the operations in which the transaction data couldn't be written.
162
168
163
169
To fix these issues, follow these steps:
164
170
@@ -178,6 +184,6 @@ To fix these issues, follow these steps:
178
184
179
185
1. Update third-party storage drivers or firmware.
180
186
181
-
1. If the issue persists, try switching to different types of drivers. For example, RAID controller drivers or monolithic drivers, or try rearranging hardware into various combinations.
187
+
1. If the issue persists, try switching to different kinds of drivers. For example, RAID controller drivers or monolithic drivers, or try rearranging hardware into various combinations.
0 commit comments