Skip to content

Commit 307c2ed

Browse files
authored
Update node-not-ready-then-recovers.md
Edit review per CI 2962
1 parent 163ee7d commit 307c2ed

1 file changed

Lines changed: 11 additions & 9 deletions

File tree

support/azure/azure-kubernetes/availability-performance/node-not-ready-then-recovers.md

Lines changed: 11 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,34 +1,36 @@
11
---
22
title: Node not ready but then recovers
3-
description: Troubleshoot scenarios in the status of an Azure Kubernetes Service (AKS) cluster node is Node Not Ready, but then the node recovers.
3+
description: Troubleshoot scenarios in which the status of an AKS cluster node is Node Not Ready, but then the node recovers.
44
ms.date: 12/09/2024
55
ms.reviewer: rissing, chiragpa, momajed, v-leedennis
66
ms.service: azure-kubernetes-service
7-
#Customer intent: As an Azure Kubernetes user, I want to prevent the Node Not Ready status for nodes that later recover so that I can avoid future errors within an Azure Kubernetes Service (AKS) cluster.
7+
#Customer intent: As an Azure Kubernetes user, I want to prevent the Node Not Ready status for nodes that later recover so that I can avoid future errors within an AKS cluster.
88
ms.custom: sap:Node/node pool availability and performance
99
---
1010
# Troubleshoot Node Not Ready failures that are followed by recoveries
1111

12-
This article provides a guide to troubleshoot and resolve node "Not ready" issues in AKS clusters. When a node enters a "Not Ready" state, it can disrupt the application's functionality, causing it to stop responding. Typically, the node recovers automatically after a short period. However, to prevent recurring issues and maintain a stable environment, it's important to understand the underlying causes and implementing effective resolutions.
12+
This article provides a guide to troubleshoot and resolve "Node Not Ready" issues in Azure Kubernetes Service (AKS) clusters. When a node enters a "Not Ready" state, it can disrupt the application's functionality and cause it to stop responding. Typically, the node recovers automatically after a short period. However, to prevent recurring issues and maintain a stable environment, it's important to understand the underlying causes to be able to implement effective resolutions.
1313

1414
## Cause
1515

16-
There are several scenarios that could lead to this issue:
16+
There are several scenarios that could cause a "Not Ready" state to occur:
1717

18-
- One possible reason for a node entering a "Not Ready" state is the unavailability of the API server, which causes the readiness probe to fail. This failure prevents the pod from being attached to the service, resulting in traffic not being forwarded to the pod instance.
18+
- The unavailability of the API server. This causes the readiness probe to fail. This prevents the pod from being attached to the service so that traffic is no longer forwarded to the pod instance.
1919

20-
- Virtual machine (VM) host faults occur. To determine whether VM host faults occurred, check the following information sources:
20+
- Virtual machine (VM) host faults. To determine whether VM host faults occurred, check the following information sources:
2121
- [AKS diagnostics](/azure/aks/concepts-diagnostics)
2222
- [Azure status](https://status.azure.com/)
2323
- Azure notifications (for any recent outages or maintenance periods)
2424

2525
## Resolution
2626

27-
Check the API server availability by running the following command: `kubectl get apiservices`.
27+
Check the API server availability by running the following command:
2828

29-
Ensure that the readiness probe is correctly configured in the deployment YAML file.
29+
`kubectl get apiservices`
3030

31-
For further steps check here: [Basic troubleshooting of Node Not Ready failures](node-not-ready-basic-troubleshooting.md).
31+
Make sure that the readiness probe is correctly configured in the deployment YAML file.
32+
33+
For further steps, see [Basic troubleshooting of Node Not Ready failures](node-not-ready-basic-troubleshooting.md).
3234

3335
## Prevention
3436

0 commit comments

Comments
 (0)