我正在使用以下代码下载 pdf 文件列表:
wget -i list.txt -A .pdf
Run Code Online (Sandbox Code Playgroud)
某些 pdf 文件已正确下载。但是,某些 pdf 文件下载不正确。当我检查日志时,我看到以下报告:
--2013-04-09 11:25:42-- http://amazon.com/111.pdf
Reusing existing connection to amazon.com:80.
HTTP request sent, awaiting response... 200 No headers, assuming HTTP/0.9
Length: unspecified
Saving to: `111.pdf'
[ <=> ] 1,045 --.-K/s in 2m 9s
2013-04-09 11:27:51 (8.11 B/s) - Read error at byte 1045 (Connection reset by peer).Retrying.
--2013-04-09 11:27:52-- (try: 2) http://amazon.com/111.pdf
Connecting to amazon.com (amazon.com)|00.00.55.888|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 2680728 (2.6M) [application/pdf]
Saving …
Run Code Online (Sandbox Code Playgroud)