Resume incomplete download #11180

yichi-yang · 2022-06-10T22:18:24Z

Overview

This PR adds a feature that resumes download if the downloaded file is incomplete (e.g. when the Internet connection is poor). More specifically, if :

The initial response includes a Content-Length header,
The server sends fewer bytes than indicated in the Content-Length header,
The user enables the auto resumption feature using --incomplete-downloads=resume,

the downloader will make new requests and attempt to resume download using a Range header. If the initial response includes an ETag (preferred) or Date header, the downloader will ask the server to resume download only when it is safe (i.e., the file hasn't changed since the initial request) using an If-Range header.

If the server responds with a 200 (e.g. if the server doesn't support partial content or can't check if the file has changed), the downloader will restart the download (i.e. start from the very first byte); if the server responds with a 206 Partial Content, the downloader will resume the download from the partially downloaded file.

Note if the server always responds with 200, the downloader can potentially get stuck and waste unreasonable amounts of bandwidth downloading the first few bytes over and over again. Therefore, a retry limit is introduced to avoid this case.

If not enough bytes are received and auto resumption is disabled or the retry limit is exceeded, the downloader will clean up the incomplete file and fail with an exception.

Flags

To control the auto resumption behavior, two new flags are added:

--incomplete-downloads=resume,/discard controls whether the auto resumption feature is enabled (defaults to discard);
--incomplete-download-retries limits the maximum number of retries (defaults to 5).

Towards #4796

src/pip/_internal/network/download.py

yichi-yang · 2022-07-17T04:38:23Z

src/pip/_internal/network/download.py

+        if total_length is not None and bytes_received < total_length:
+            if self._resume_incomplete:
+                logger.critical(
+                    "Failed to download %s after %d resumption attempts.",
+                    link,
+                    self._resume_attempts,
+                )
+            else:
+                logger.critical(
+                    "Failed to download %s."
+                    " Set --incomplete-downloads=resume to automatically"
+                    "resume incomplete download.",
+                    link,
+                )
+            os.remove(filepath)
+            raise RuntimeError("Incomplete download")


Not quite sure what to do here. I don't think throwing an exception (L244) and printing a long (and useless) stack trace is user-friendly, but I can't think of a better alternative. Maybe we should just reuse the the same log messages above so it's at least helpful?
I think throwing an (subclassed) DiagnosticPipError here might be a good idea? We can let the user know that:

the download is incomplete

the incomplete file has been cleaned up

they can use --incomplete-downloads=resume to enable the feature if they haven't already

they can modify the retry limit with --incomplete-download-retries.

pip/src/pip/_internal/exceptions.py

Lines 54 to 63 in e5898ab

class DiagnosticPipError(PipError):

"""An error, that presents diagnostic information to the user.

This contains a bunch of logic, to enable pretty presentation of our error

messages. Each error gets a unique reference. Each error can also include

additional context, a hint and/or a note -- which are presented with the

main error message in a consistent style.

This is adapted from the error output styling in `sphinx-theme-builder`.

"""

Implemented the above in a separate commit: a082517.

uranusjr · 2022-07-17T19:19:04Z

I saw that a previous version also queries Accept-Ranges to determine whether to go into using Range for resuming. Was it removed because it’s considered not a good idea?

uranusjr · 2022-07-17T19:22:33Z

I wonder (discussions needed!) if we should aggressively set the default to resume. The logic seems safe enough and without much overhead (and the worse case scenario can only be caused by incorrect server implementation) that we should make the benefit opt-out instead opt-in.

yichi-yang · 2022-07-17T19:49:53Z

I saw that a previous version also queries Accept-Ranges to determine whether to go into using Range for resuming. Was it removed because it’s considered not a good idea?

As per RFC 7233 servers that don't support Range will simply ignore it and respond with 200 as if it is a normal GET request. Additionally, if the server supports Range but the document has changed since the initial request, we need to somehow figure that out and restart instead of resume (that's what the If-Range header does).

So to answer your question, checking Accept-Ranges is kind of redundant here (it's up to the server to decide if a partial response is appropriate). I don't think we should check Accept-Ranges unless there's a very good reason to do so.

yichi-yang · 2022-07-17T20:28:29Z

I think it might make more sense if we only have a single --incomplete-download-retries flag - setting it to 0 or a negative number disables the feature. This is more similar to how --retries works currently.

uranusjr · 2022-07-18T03:46:25Z

Ah yes that makes sense (perhaps with a shorter option name though)

yichi-yang · 2022-07-18T03:54:03Z

(perhaps with a shorter option name though)

Suggestions are welcome :)
Though we need to make sure users won't confuse this with --retries.

CTimmerman · 2022-07-19T20:26:12Z

Just use --retries. It's the same as this with 0 bytes.

yichi-yang · 2022-07-19T20:35:43Z

Just use --retries. It's the same as this with 0 bytes.

Could you elaborate? I don't think --retries on its own solves the issue #4796 here.

uranusjr · 2022-07-20T09:35:26Z

I think it’s “just use --retries to control resuming”, in the sense that right now a retry is conceptually the same as resuming from a previous download of zero bytes.

yichi-yang · 2022-07-20T23:25:43Z

I think it’s “just use --retries to control resuming”, in the sense that right now a retry is conceptually the same as resuming from a previous download of zero bytes.

Ah I see. Our resume function is conceptually similar to --retries, but implementation-wise --retries are handled by requests/urllib3. I don't really know a good way to combine the two. Consider the case where a user sets --retries 5, does it mean requests can make 5 retries AND pip can try resuming 5 times (kind of weird), or requests and pip can retry a total of 5 times (hard to implement, retry now means two different things)?

On a side note, part of the problem should get fixed upstream soon (psf/requests#4956, psf/requests#6092), but that doesn't change the fact that pip has to make additional partial requests to resume.

uranusjr · 2022-07-21T00:40:35Z

Yeah that’s the problem, we can’t really use the same counter between connection retries and resumes. But I’m guessing it’s not that big a problem and we could intentionally implement the wrong behaviour and find a way to fix that later… For example for retries=5 we would do 5 connection retries and 5 resume retries, and most of the time the user wouldn’t tell the difference, since it’s relatively rare to see a connection fails halfway, and when resuming starting to fail to connect. And even in that case users might be fine to do a few more retries, who knows. I think I’d prefer to get the user interface (CLI flags) right, and “fix” the relatively minor behaviour later (if ever).

yichi-yang · 2022-07-21T01:05:07Z

Yeah that’s the problem, we can’t really use the same counter between connection retries and resumes. But I’m guessing it’s not that big a problem and we could intentionally implement the wrong behaviour and find a way to fix that later… For example for retries=5 we would do 5 connection retries and 5 resume retries, and most of the time the user wouldn’t tell the difference, since it’s relatively rare to see a connection fails halfway, and when resuming starting to fail to connect. And even in that case users might be fine to do a few more retries, who knows. I think I’d prefer to get the user interface (CLI flags) right, and “fix” the relatively minor behaviour later (if ever).

Good point. But that's an easy change so I think we can wait a bit and see what others think.

CTimmerman · 2022-07-23T11:51:07Z

I think the default 5 connection retries should include resuming, and that resuming should not start from 0 by default, but would not mind reusing the user's --retries value if the actual one is not available. Resuming should be an upstream feature if retrying is.

horacehoff · 2022-09-02T22:23:52Z

Imo the download shouldn't resume by starting by 0. Let it resume using the already-partially-downloaded file.

yichi-yang · 2022-09-02T22:44:13Z

Imo the download shouldn't resume by starting by 0. Let it resume using the already-partially-downloaded file.

The implementation in this PR resumes from partially downloaded file when possible. There are cases where resuming is not possible, e.g. when the file has changed on the server after we started the download, and we have to start downloading from scratch again.

Is that the behavior you want?

horacehoff · 2022-09-03T08:53:44Z

Oh right, sorry. I misunderstood exactly what this PR does and from this new point of view I have nothing to say as it sounds much useful to many people who have poor/low bandwidth.

JD91B · 2022-12-14T15:54:13Z

I was having problems no being able to download large files with pip because of my slow internet but this helped so much. Thank you, it was really useful.

yichi-yang mentioned this pull request Jun 10, 2022

[Improvement] Pip could resume download package at halfway the connection is poor #4796

Open

yichi-yang force-pushed the feature-resume-download branch from 03859cb to c467a5c Compare June 10, 2022 22:31

q0w reviewed Jun 13, 2022

View reviewed changes

src/pip/_internal/network/download.py Outdated Show resolved Hide resolved

yichi-yang force-pushed the feature-resume-download branch 3 times, most recently from 25cab52 to c869789 Compare July 17, 2022 01:25

yichi-yang marked this pull request as ready for review July 17, 2022 04:25

yichi-yang commented Jul 17, 2022

View reviewed changes

Add support to resume incomplete download

d655669

yichi-yang force-pushed the feature-resume-download branch from c869789 to d655669 Compare July 17, 2022 17:22

Better incomplete download error message

a082517

yichi-yang force-pushed the feature-resume-download branch from 93b9d14 to a082517 Compare July 17, 2022 19:06

github-actions bot added the needs rebase or merge PR has conflicts with current master label Jan 18, 2023

zweger mentioned this pull request Aug 23, 2023

interrupted download reports as hash failure #11153

Open

1 task

njzjz mentioned this pull request Apr 19, 2024

Repeated timeouts in GitHub Actions fetching wheel for large packages astral-sh/uv#1912

Closed

ichard26 mentioned this pull request May 7, 2024

Continue downloads after network error. #12677

Closed

1 task

gmargaritis mentioned this pull request Oct 4, 2024

Introduce resumable downloads with --resume-retries #12991

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resume incomplete download #11180

Resume incomplete download #11180

yichi-yang commented Jun 10, 2022 •

edited

Loading

yichi-yang Jul 17, 2022 •

edited

Loading

yichi-yang Jul 17, 2022 •

edited

Loading

uranusjr commented Jul 17, 2022

uranusjr commented Jul 17, 2022

yichi-yang commented Jul 17, 2022 •

edited

Loading

yichi-yang commented Jul 17, 2022

uranusjr commented Jul 18, 2022

yichi-yang commented Jul 18, 2022

CTimmerman commented Jul 19, 2022 •

edited

Loading

yichi-yang commented Jul 19, 2022

uranusjr commented Jul 20, 2022

yichi-yang commented Jul 20, 2022

uranusjr commented Jul 21, 2022

yichi-yang commented Jul 21, 2022

CTimmerman commented Jul 23, 2022 •

edited

Loading

horacehoff commented Sep 2, 2022

yichi-yang commented Sep 2, 2022 •

edited

Loading

horacehoff commented Sep 3, 2022

JD91B commented Dec 14, 2022

	class DiagnosticPipError(PipError):
	"""An error, that presents diagnostic information to the user.

	This contains a bunch of logic, to enable pretty presentation of our error
	messages. Each error gets a unique reference. Each error can also include
	additional context, a hint and/or a note -- which are presented with the
	main error message in a consistent style.

	This is adapted from the error output styling in `sphinx-theme-builder`.
	"""

Resume incomplete download #11180

Are you sure you want to change the base?

Resume incomplete download #11180

Conversation

yichi-yang commented Jun 10, 2022 • edited Loading

Overview

Flags

yichi-yang Jul 17, 2022 • edited Loading

Choose a reason for hiding this comment

yichi-yang Jul 17, 2022 • edited Loading

Choose a reason for hiding this comment

uranusjr commented Jul 17, 2022

uranusjr commented Jul 17, 2022

yichi-yang commented Jul 17, 2022 • edited Loading

yichi-yang commented Jul 17, 2022

uranusjr commented Jul 18, 2022

yichi-yang commented Jul 18, 2022

CTimmerman commented Jul 19, 2022 • edited Loading

yichi-yang commented Jul 19, 2022

uranusjr commented Jul 20, 2022

yichi-yang commented Jul 20, 2022

uranusjr commented Jul 21, 2022

yichi-yang commented Jul 21, 2022

CTimmerman commented Jul 23, 2022 • edited Loading

horacehoff commented Sep 2, 2022

yichi-yang commented Sep 2, 2022 • edited Loading

horacehoff commented Sep 3, 2022

JD91B commented Dec 14, 2022

yichi-yang commented Jun 10, 2022 •

edited

Loading

yichi-yang Jul 17, 2022 •

edited

Loading

yichi-yang Jul 17, 2022 •

edited

Loading

yichi-yang commented Jul 17, 2022 •

edited

Loading

CTimmerman commented Jul 19, 2022 •

edited

Loading

CTimmerman commented Jul 23, 2022 •

edited

Loading

yichi-yang commented Sep 2, 2022 •

edited

Loading