Create blog post on AKS NAP disruption management#5685
Conversation
Added a blog post on managing disruption with AKS Node Auto-Provisioning, covering best practices for Pod Disruption Budgets and consolidation.
There was a problem hiding this comment.
Pull request overview
Adds a new AKS blog post focused on managing voluntary disruption when using Node Auto-Provisioning (NAP), with guidance on Pod Disruption Budgets (PDBs), consolidation controls, disruption budgets, and maintenance windows.
Changes:
- Added a new blog post covering NAP disruption concepts and common pitfalls.
- Included YAML examples for PDBs and NodePool disruption settings (consolidation policy, budgets, schedules).
- Added operational guidance on observability and drift/image update considerations.
Co-authored-by: Copilot <[email protected]>
Clarified descriptions of Pod Disruption Budgets and their impact on voluntary evictions. Improved wording for clarity and corrected minor typos.
sabbour
left a comment
There was a problem hiding this comment.
Content review: solid technical depth on a high-value topic. The main blockers are em dashes throughout (banned by style guide), third-person voice in the opening, a cluster of typos in Part 4, and a duplicated troubleshooting section. Nine inline comments with specific fixes.
sabbour
left a comment
There was a problem hiding this comment.
Content review: solid technical depth on a high-value topic. The main blockers are em dashes throughout (banned by style guide), third-person voice in the opening, a cluster of typos in Part 4, and a duplicated troubleshooting section. Nine inline comments with specific fixes.
Co-authored-by: Copilot <[email protected]> Co-authored-by: Ahmed Sabbour <[email protected]>
Updated formatting and clarified sections on NAP disruption best practices, including node disruption budgets and observability.
Updated guidance on managing NAP node disruptions, including operational takeaways and common pitfalls with suggested fixes.
Updated questions and best practices regarding NAP disruption, including links and clarifications.
Updated link to the NAP scheduling fundamentals blog post.
Added a note about Spot VMs and their eviction behavior.
Updated the description to include 'in production' for clarity.
Updated the blog post to enhance the description of managing node disruption with NAP, including details on spot nodes and disruption controls. Improved clarity on Kubernetes-native concepts and added examples for better understanding.
Fixed typos and improved clarity in the NAP section regarding Azure Spot VMs and Node Problem Detector.
Updated the description and corrected a typo in the blog post about AKS Node Auto-Provisioning. Adjusted the link to the NAP scheduling fundamentals blog post for accuracy.
Added a blog post on managing disruption with AKS Node Auto-Provisioning, covering best practices for Pod Disruption Budgets and consolidation.