CloudsArk
Troubleshooting Kubernetes

Fix Evicted Pod Kubernetes

Learn practical fix evicted pod kubernetes with kubectl commands, manifests, verification steps, common mistakes, and production-focused guidance.

Fix Evicted Pod Kubernetes

Introduction

This guide explains fix evicted pod kubernetes with practical kubectl commands, realistic output, and production-focused checks. Use this workflow when an application is failing and you need evidence before changing manifests.

Symptoms

You may see pods stuck in a waiting state, failed rollouts, 4xx or 5xx responses, missing endpoints, failed probes, denied API calls, or repeated events in the namespace.

Common Causes

Common causes include node conditions, taints, resource requests, affinity, topology, and scheduler events. Always confirm with events and logs before editing the workload.

Step 1: Check Current State

kubectl get nodes -o wide
kubectl describe node worker-1

Expected output:

NAME       STATUS   ROLES    AGE   VERSION
worker-1   Ready    <none>   14d   v1.30.2

Step 2: Inspect Events and Logs

kubectl describe node worker-1
kubectl get pods -A --field-selector=status.phase=Pending

Events show scheduler, kubelet, image pull, mount, and probe errors. Previous logs are critical when the container restarts quickly.

Step 3: Verify the Manifest or Runtime Setting

kubectl describe pod pending-pod -n app
kubectl get pod web-7d9f8c-abcde -n app -o yaml

Check selectors, image names, probes, resource limits, service accounts, volumes, and namespace references.

Step 4: Apply the Fix

apiVersion: apps/v1
kind: Deployment
metadata:
  name: web
  namespace: app
spec:
  replicas: 3
  selector:
    matchLabels:
      app: web
  template:
    metadata:
      labels:
        app: web
    spec:
      containers:
      - name: web
        image: nginx:1.27
        ports:
        - containerPort: 80

Apply only the corrected field, then let the controller reconcile the desired state.

kubectl apply -f manifest.yaml
kubectl rollout status deployment/web -n app

Step 5: Confirm Recovery

kubectl get pods -n app
kubectl get events -n app --sort-by=.lastTimestamp

Common Mistakes

  • Deleting pods before reading the events that explain why they failed.
  • Changing probes, resources, images, and RBAC at the same time.
  • Troubleshooting only the pod while ignoring the service, PVC, node, or service account.

Quick Checklist

  • Check pod status and restart count.
  • Read describe output and recent events.
  • Inspect current and previous container logs.
  • Verify dependent objects such as Secrets, ConfigMaps, PVCs, Services, and RBAC.
  • Apply one fix and watch the rollout.

Summary

Treat fix evicted pod kubernetes as an evidence-driven debugging task. Events identify the failing layer, logs explain application behavior, and rollout checks prove the fix worked.