Ips*_*gle 5 python google-app-engine mapreduce mapper google-cloud-storage
刚刚完成一个大型的Appengine mapreduce任务,我的许多分片都在终点线上被卡住了.这是设置:
filenames = yield mapreduce_pipeline.MapperPipeline(
'example mapper name',
'main.MyMapper',
input_reader_spec='mapreduce.input_readers.DatastoreInputReader',
output_writer_spec='mapreduce.output_writers.FileOutputWriter',
params={
'input_reader':{
'entity_kind':'models.MyModel'
},
'output_writer':{
'filesystem':'gs',
'mime_type':'text/csv',
'gs_bucket_name':'myBucket',
'output_sharding':'input'
}
},
shards=DUMP_SHARDS
)
Run Code Online (Sandbox Code Playgroud)
我正在并行运行其中的3个,每个都有16个分片.一个映射器完成没有问题,另外两个映射器在其14和9个分片上取得了成功.
剩下的碎片都完全是石墙,然后返回UnknownError: ApplicationError: 7.(本文末尾的完整堆栈跟踪.)
请注意,映射器正在尝试写入Google云端存储.执行此写操作的位发生错误.
狩猎了一段时间后,我发现,在google.appengine.runtime.apiproxy(这似乎是有问题的代理),该错误7 OTHER_ERROR.
我一直在重试这些最终任务(从任务队列中)大约3个小时,自从这些错误开始以来没有一个成功; 无论发生什么,它都完全陷入困境.我也尝试停止运行的所有实例,以防它是一些奇怪的本地状态,但没有改变......
这是完整的堆栈跟踪:
I 2012-12-13 15:40:23.909
Processing done for shard 14 of job '1582444192075C233F6AA'
E 2012-12-13 15:40:23.969
ApplicationError: 7
Traceback (most recent call last):
File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1511, in __call__
rv = self.handle_exception(request, response, e)
File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1505, in __call__
rv = self.router.dispatch(request, response)
File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1253, in default_dispatcher
return route.handler_adapter(request, response)
File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 1077, in __call__
return handler.dispatch()
File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 547, in dispatch
return self.handle_exception(e, self.app.debug)
File "/base/python27_runtime/python27_lib/versions/third_party/webapp2-2.3/webapp2.py", line 545, in dispatch
return method(*args, **kwargs)
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/base_handler.py", line 65, in post
self.handle()
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/handlers.py", line 231, in handle
tstate.output_writer.finalize(ctx, shard_state.shard_number)
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/output_writers.py", line 631, in finalize
files.finalize(self._filename)
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 568, in finalize
f.close(finalize=True)
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 291, in close
self._make_rpc_call_with_retry('Close', request, response)
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 427, in _make_rpc_call_with_retry
_make_call(method, request, response)
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 252, in _make_call
_raise_app_error(e)
File "/base/data/home/apps/myserver/myinstance.363844686987482417/mapreduce/lib/files/file.py", line 186, in _raise_app_error
raise UnknownError(e)
UnknownError: ApplicationError: 7
Run Code Online (Sandbox Code Playgroud)
我刚刚遇到了类似的问题。我认为这具体是一个写入谷歌云存储的问题。
我在这里获得了一些见解:Google App Engine Issue: 8775
摘要(TLDR):
| 归档时间: |
|
| 查看次数: |
421 次 |
| 最近记录: |