小编D. *_*ard的帖子

Autopilot GKE cluster insufficient cpu/mem

我正在尝试在 GKE 上部署 Autopilot 集群,但是在尝试部署 Pod 时遇到如下所示的 CPU/内存不足错误。Kubectl getnodes 返回 3 个节点,每个节点都有大约 0.5GB 的可用 cpu 和相同的内存,所以非常小。我正在尝试运行 GPU 繁重的作业,因此我希望 GKE 能够扩展,但它没有说资源不足。我究竟做错了什么?

  Warning  FailedScheduling   27m (x5 over 31m)      gke.io/optimize-utilization-scheduler  0/2 nodes are available: 2 Insufficient cpu, 2 Insufficient memory.
  Warning  FailedScheduling   26m                    gke.io/optimize-utilization-scheduler  0/3 nodes are available: 1 node(s) had taint {node.kubernetes.io/not-ready: }, that the pod didn't tolerate, 2 Insufficient cpu, 2 Insufficient memory.
  Normal   TriggeredScaleUp   26m                    cluster-autoscaler                     pod triggered scale-up: [{https://www.googleapis.com/compute/v1/projects/picdmo-342711/zones/us-central1-c/instanceGroups/gk3-picdmo-nap-1wcisjk4-2ba03e97-grp 0->1 (max: 1000)}]
  Normal   NotTriggerScaleUp  25m (x6 over 30m)      cluster-autoscaler …
Run Code Online (Sandbox Code Playgroud)

google-cloud-platform google-kubernetes-engine

5
推荐指数
1
解决办法
2152
查看次数