You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#Customer intent: As an Azure Kubernetes user, I want to prevent the Node Not Ready status for nodes that later recover so that I can avoid future errors within an Azure Kubernetes Service (AKS) cluster.
@@ -13,29 +13,30 @@ This article helps troubleshoot scenarios in which a node within a Microsoft Azu
13
13
14
14
## Symptoms
15
15
16
-
You notice that your application stops responding while the node is reporting that it has a Not Ready status. However, the node recovers automatically, and now, it's looking for a root cause analysis (RCA).
16
+
Maintaining node readiness in Azure Kubernetes Service (AKS) clusters is crucial for ensuring application availability and performance. When a node enters a "Not Ready" state, it can disrupt the application's functionality, causing it to stop responding. Although the node typically recovers automatically after a short period, understanding the underlying causes and implementing effective resolutions is essential to prevent recurring issues and maintain a stable environment. This document provides a comprehensive guide to troubleshooting and resolving node readiness issues in AKS clusters.
17
17
18
18
## Cause
19
19
20
-
Possible causes of this issue include the following scenarios:
20
+
There are several scenarios that could lead to this issue:
21
21
22
-
- The API server isn't available, and you're using a readiness probe for the deployment.
23
-
24
-
If a pod is running but isn't ready, that situation means that the readiness probe is failing. If the readiness probe fails, the pod isn't attached to the service, and traffic isn't forwarded to the pod instance.
22
+
- One possible reason for a node entering a "Not Ready" state is the unavailability of the API server, which causes the readiness probe to fail. This failure prevents the pod from being attached to the service, resulting in traffic not being forwarded to the pod instance.
25
23
26
24
- Virtual machine (VM) host faults occur. To determine whether VM host faults occurred, check the following information sources:
0 commit comments