-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When downloading a Blob, failed requests (HttpResponseError exceptions) are not correctly retried #22784
Comments
@tasherif-msft can you take a look? |
@jalauzon-msft fyi |
Hi @Neki we are aware of this issue. These resets are triggered by the service. We are currently in discussions on handling these types of errors the Python SDKs. We will update you once we have more information |
Hi @Neki Benoît, I took another look at this issue, and I believe this is separate from our other effort around Connection Resets that Tamer mentioned above. I spent some time trying to reproduce your Traceback where Connection Resets would not be retried during a streaming download, but I was unable to reproduce the issue. It seems that requests made in I was trying to reproduce this issue from the As a note, we are still investigating Connection Reset errors that are occurring even with retries so you may still see those, but I am more interested in if the retries are working properly in the latest versions. Thanks! |
Hi @Neki. Thank you for opening this issue and giving us the opportunity to assist. We believe that this has been addressed. If you feel that further discussion is needed, please add a comment with the text “ |
Hi @Neki, since you haven’t asked that we “ |
[Hub Generated] Review request for Microsoft.MachineLearningServices to add version preview/2023-02-01-preview (Azure#22436) * Adds base for updating Microsoft.MachineLearningServices from version preview/2022-12-01-preview to version 2023-02-01-preview * Updates readme * Updates API version in new specs and examples * Add managedResourceGroupTags to registries Feb 2023 Preview swagger (Azure#22617) * removed feb2023 registries.json * Revert "removed feb2023 registries.json" This reverts commit d2d1f7055e00284ba040cf0b792404956fb89c24. * add managedResourceGroupTags property --------- Co-authored-by: Komal Yadav <komalyadav@microsoft.com> * updating to feb 2023 preview (Azure#22456) * updating to feb 2023 preview * adding example files * add to custom words, remove spelling errors * Update specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/preview/2023-02-01-preview/mfe.json Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> * Update specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/preview/2023-02-01-preview/mfe.json Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> * Update specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/preview/2023-02-01-preview/mfe.json Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> * resolving model validation errors * try resolving operation async response validation errors * merge data container/ver, queue settings * add datacontainer and version example files, add LRO uri * update example jobs files, update LRO and registry name pattern * last LRO fixes * test that batchdeploymentproperties is resolved * adding maulik's swagger generation PR changes * update swagger with new deduped batch names * adding updated computeresource name * add boolean featuredatatype --------- Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> * ManagedNetwork Support and paramter parttern update for WorkspaceRP (Azure#22588) * add swagger support for managed network * update parameter pattern * Add rules * remove unused file * fix * fix format and add pattern * update format and example * revert manage network changes * small change * revert * Staging Branch Gate Fixes and Schema Bug Fix for ModelPackage (Azure#22784) * updating to feb 2023 preview * adding example files * add to custom words, remove spelling errors * Update specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/preview/2023-02-01-preview/mfe.json Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> * Update specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/preview/2023-02-01-preview/mfe.json Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> * Update specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/preview/2023-02-01-preview/mfe.json Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> * resolving model validation errors * try resolving operation async response validation errors * merge data container/ver, queue settings * add datacontainer and version example files, add LRO uri * update example jobs files, update LRO and registry name pattern * last LRO fixes * test that batchdeploymentproperties is resolved * adding maulik's swagger generation PR changes * update swagger with new deduped batch names * adding updated computeresource name * add boolean featuredatatype * fixing model validation pattern error * fix LRO gate * adding model package updates to swagger * un requiring property --------- Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> --------- Co-authored-by: Komal Yadav <23komal.yadav23@gmail.com> Co-authored-by: Komal Yadav <komalyadav@microsoft.com> Co-authored-by: Jianye Xi <59603451+jianyexi@users.noreply.github.com> Co-authored-by: ZhidaLiu <zhili@microsoft.com>
Describe the bug
We have an integration test that's using the following code
Sometimes, this test fails with this stacktrace (one line edited to remove a private filename, otherwise it's a copy-paste):
When looking at the relevant code:
the
HttpResponseError
is raised by theprocess_content
line, but is not caught here.self.client.download
doesn't raise an exception, it returns an iterator instead. The exception is raised whenprocess_content
iterates onresponse
.To Reproduce
Steps to reproduce the behavior:
Expected behavior
The retry code is triggered, and the network error does not surface to the application.
Screenshots
N/A
Additional context
Maybe I'm misinterpreting the stack trace here. The code never configures retry parameters, it uses the default provided by the Azure SDK. To avoid encountering this issue, should I change the retry parameters instead?
The text was updated successfully, but these errors were encountered: