ApplicationError:7处理mapreduce worker写入Google云端存储时的处理

Ips*_*gle 5 python google-app-engine mapreduce mapper google-cloud-storage

刚刚完成一个大型的Appengine mapreduce任务,我的许多分片都在终点线上被卡住了.这是设置:

    filenames = yield mapreduce_pipeline.MapperPipeline(
            'example mapper name',
            'main.MyMapper',
            input_reader_spec='mapreduce.input_readers.DatastoreInputReader',
            output_writer_spec='mapreduce.output_writers.FileOutputWriter',
            params={
                'input_reader':{
                    'entity_kind':'models.MyModel'
                },
                'output_writer':{
                    'filesystem':'gs',
                    'mime_type':'text/csv',
                    'gs_bucket_name':'myBucket',
                    'output_sharding':'input'
                }
            },
            shards=DUMP_SHARDS
            )
Run Code Online (Sandbox Code Playgroud)

我正在并行运行其中的3个,每个都有16个分片.一个映射器完成没有问题,另外两个映射器在其14和9个分片上取得了成功.

剩下的碎片都完全是石墙,然后返回UnknownError: ApplicationError: 7.(本文末尾的完整堆栈跟踪.)

请注意,映射器正在尝试写入Google云端存储.执行此写操作的位发生错误.

狩猎了一段时间后,我发现,在google.appengine.runtime.apiproxy(这似乎是有问题的代理),该错误7 OTHER_ERROR.

我一直在重试这些最终任务(从任务队列中)大约3个小时,自从这些错误开始以来没有一个成功; 无论发生什么,它都完全陷入困境.我也尝试停止运行的所有实例,以防它是一些奇怪的本地状态,但没有改变......

这是完整的堆栈跟踪:

I 2012-12-13 15:40:23.909
Processing done for shard 14 of job '1582444192075C233F6AA'
E 2012-12-13 15:40:23.969
ApplicationError: 7 
Traceback (most recent call last):
  File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1511, in __call__
    rv = self.handle_exception(request, response, e)
  File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1505, in __call__
    rv = self.router.dispatch(request, response)
  File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1253, in default_dispatcher
    return route.handler_adapter(request, response)
  File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1077, in __call__
    return handler.dispatch()
  File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 547, in dispatch
    return self.handle_exception(e, self.app.debug)
  File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 545, in dispatch
    return method(*args, **kwargs)
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/base_handler.py", line 65, in post
    self.handle()
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/handlers.py", line 231, in handle
    tstate.output_writer.finalize(ctx, shard_state.shard_number)
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/output_writers.py", line 631, in finalize
    files.finalize(self._filename)
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 568, in finalize
    f.close(finalize=True)
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 291, in close
    self._make_rpc_call_with_retry('Close', request, response)
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 427, in _make_rpc_call_with_retry
    _make_call(method, request, response)
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 252, in _make_call
    _raise_app_error(e)
  File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 186, in _raise_app_error
    raise UnknownError(e)
UnknownError: ApplicationError: 7 
Run Code Online (Sandbox Code Playgroud)

cas*_*rtm 2

我刚刚遇到了类似的问题。我认为这具体是一个写入谷歌云存储的问题。

我在这里获得了一些见解:Google App Engine Issue: 8775

摘要(TLDR):

  • 可能是一次性网络问题。
  • 可能是计费问题。
  • 结果:如果问题仍未消失并且解决结算问题也不起作用,请联系 Google 支持人员。