无法使用自定义容器 GCP 进行部署

gkv*_*gkv 5 google-cloud-platform google-ai-platform

我的镜像是 11.4 GB。我正在尝试以下命令:

gcloud beta ai-platform versions create v1 \
  --region=$REGION \
  --model=$MODEL \
  --machine-type=n1-highmem-2 \
  --image=$REGION-docker.pkg.dev/$PROJECT_ID/$REPO_NAME/$IMAGE_NAME \
  --ports=8000 
Run Code Online (Sandbox Code Playgroud)

它失败了:

Error: model server never became ready. Please validate that your model file or container configuration are valid, Error details: model server never became ready: status: "False" last_transition_time { seconds: 1603932493 } reason: "ContainersNotReady" message: "containers with unready status: [online-prediction-be-test85b70d-v1fb1b0606]" type: "ContainersReady"
Run Code Online (Sandbox Code Playgroud)

即使我可以使用docker run. 我需要改变什么?

当我在未指定区域且详细程度设置为 DEBUG 的情况下运行时,我得到:

RecursionError: maximum recursion depth exceeded while calling a Python object
ERROR: gcloud crashed (RecursionError): maximum recursion depth exceeded while calling a Python object

The Python stack trace is massive (last couple of lines):
  File "/googlecloudsdk/core/log.py", line 484, in ShowStructuredOutput
    show_messages = properties.VALUES.core.show_structured_logs.Get()
  File "/googlecloudsdk/core/properties.py", line 2380, in Get
    value = _GetProperty(self, named_configs.ActivePropertiesFile.Load(),
  File "/googlecloudsdk/core/properties.py", line 2679, in _GetProperty
    value = _GetPropertyWithoutDefault(prop, properties_file)
  File "/google-cloud-sdk/lib/googlecloudsdk/core/properties.py", line 2711, in _GetPropertyWithoutDefault
    value = _GetPropertyWithoutCallback(prop, properties_file)
  File "/google-cloud-sdk/lib/googlecloudsdk/core/properties.py", line 2741, in _GetPropertyWithoutCallback
    for value_flags in reversed(invocation_stack):

Run Code Online (Sandbox Code Playgroud)

Tsv*_*abo 0

从您收到的错误消息中无法理解任何内容,但是,我猜 gcloud 无法对您的容器运行运行状况检查,您可以使用以下命令指定它并--health-route查看吗?

  • 这不是一个答案。请利用评论功能。 (2认同)