Python apache beam dataflow worker-startup 错误:无法安装软件包:无法安装 SDK:退出状态 2

And*_*idy 5 python python-2.7 google-cloud-platform google-cloud-dataflow apache-beam

看之前:

RuntimeError: IOError: [Errno 2] No such file or directory:
'/beam-temp-andrew_mini_vocab-..../......andrew_mini_vocab' [while running .....]
Run Code Online (Sandbox Code Playgroud)

在我的 apache beam python 数据流作业中,我看到记录了这个错误:

A setup error was detected in __. Please refer to the worker-startup
log for detailed information. `
Run Code Online (Sandbox Code Playgroud)

我发现工作启动日志和有效负载错误是:

Failed to install packages: failed to install SDK: exit status 2
Run Code Online (Sandbox Code Playgroud)

该错误不够具体,我无法调试。对什么 SDK 没有被加载有任何见解吗?我的工作进口是非常基本的:

from __future__ import absolute_import
from __future__ import division
import argparse
import logging
import re
import apache_beam as beam
from apache_beam.io import WriteToText
from apache_beam.options.pipeline_options import PipelineOptions
from apache_beam.options.pipeline_options import SetupOptions
from apache_beam.pvalue import AsDict
Run Code Online (Sandbox Code Playgroud)

Pab*_*blo 0

pip检查您的with版本pip -V,并尝试更新它。

如果这没有帮助,请评论这个问题:)