Allow fetching commit by id #436

tgr · 2015-08-08T03:26:15Z

git fetch <remote> <commit id> usually does not work in git, as a security measure to avoid unintentional disclosure of information that has been made unavailable from every branch/tag but has not been garbage collected (this discussion has some details). Recently, the uploadpack.allowReachableSHA1InWant configuration flag has been added to enable fetching of reachable commits, but it is off by default and generally advised against on performance reasons.

This makes sense, but GitHub already has an API to get the contents of a particular commit, so a performant way to determine whether a commit can be reached already exists. It would be nice if commits that are accessible via the API would also be accessible via git fetch.

(The context in which this came up - but I'm sure there are many other uses - was verifying that no one tampered with a file when I have its commit id but I cannot trust any information I receive from GitHub, e.g. I need to use an insecure connection. In such a case the commit object that's obtained via git fetch could be used as a cryptographically secure fingerprint of the file. It can probably be constructed from the GitHub API responses as well, but that's rather tedious.)

The text was updated successfully, but these errors were encountered:

cirosantilli · 2015-08-08T06:43:22Z

This should be possible. Even Git could implement it by first checking if the commit is reachable or not.

I just don't think GitHub should modify the behaviour of Git on it's website, that would be confusing.

Out of curiosity, I don't understand your use case very well: you need to use an insecure connection to get the commits, but you can use a secure connection to do the fetch is that so?

tgr · 2015-08-10T06:45:26Z

Git is a standard with multiple implementations and I doubt GitHub uses the original one. That said, as it turns out they did implement it recently. I will update the ticket.

As for the use case, all connections could be insecure (a git commit is basically its own signature, it cannot be forged in a non-noticeable way), but that's not really the point as getting a secure connection is not that hard. Someone could have stolen the project owner's keys and changed the content though, so using mutable references like tag or branch names is not safe.

Let me put it another way: at some point in the past someone reviewed the code of a GitHub-hosted project, decided that a certain version is secure, and recorded its commit id. The threat model is that the review was sound, and the place where commit ids are recorded is reliable but every other source isn't. I.e. we need to download the actual code from somewhere (that somewhere is quite likely not GitHub, for performance reasons) and the downloaded code could have been tampered with.

cirosantilli · 2015-08-10T07:06:12Z

@tgr GitHub likely uses https://github.com/libgit2/libgit2

Although there are multiple implementations, the external facing API should be the same for all git clients. But true, a small extension like that could be considered.

But as you've found out (I didn't know!) it is already possible by a "standard" server config, so it definitely could be done.

I understand the use case better, thanks. Safe source for checking, unsafe for fast download.

* in case the used git does not support shallow submodules, just log a warning instead of throwing an exception * Javadoc texts are used from the already existing CloneCommand * tests are running fine now and cover more cases * file:// has to be used as protocol for local remotes, otherwise git doesn't perform shallow cloning * a local repository has to be used for shallow clone testing, because GitHub doesn't allow fetching dedicated commits * see isaacs/github#436 * this would result in the following error: Server does not allow request for unadvertised object * other minor improvements

akx · 2018-11-19T16:26:40Z

This is still an issue, unfortunately.

The reason I bumped into this is git clone --depth=1 --shallow-submodules (and judging by those linked issues above, I'm not the only one); --shallow-submodules seems to do the equivalent of git clone -b COMMITHASHHERE (which also does not work against GitHub).

Prcuvu · 2019-08-08T19:08:35Z

This is an issue to me. I want to re-create a branch from no-branch no-tag commits (head SHA1 known, tree viewable on webpage), but the restriction of GitHub server disables an easy way to do that.

uri-canva · 2019-10-23T08:31:14Z

There is now a uploadpack.allowAnySHA1InWant setting that doesn't have the issue of having to calculate reachability, introduced in git/git@f8edeaa.

uri-canva · 2019-10-24T00:22:01Z

It looks like this works as long as you use protocol v2, protocol v1 won't work.

marc-h38 · 2019-12-10T18:46:20Z

as a security measure to avoid unintentional disclosure of information that has been made unavailable from every branch/tag but has not been garbage collected (http://thread.gmane.org/gmane.comp.version-control.git/257807 has some details).

Very sadly, gmane is no more. @tgr do you remember the subject, date, participants, anything?

tgr · 2019-12-10T19:50:37Z

I don't. At a guess it might have been this thread.

Not super relevant though, the point is there are valid security reasons to limit what commits you expose, but GitHub has a web API for exposing commits by sha1, so it already has to deal with that and exposing the same commits via git fetch as well would not be much extra complexity.

marc-h38 · 2019-12-11T02:32:12Z

I don't. At a guess it might have been https://public-inbox.org/git/CAPBPrnsA4KxNximtKXcC37kuwBHK0Esytdm4nsgLHkrJSg3Ufw@mail.gmail.com/

Thanks for the prompt answer, indeed this one keeps popping up. For the record this is: "Can I fetch an arbitrary commit by sha1?", 2-9 Oct 2014.

the point is there are valid security reasons to limit what commits you expose,

Yes and while security is very often mentioned, I still couldn't find any good, official and clear git (!=github) documentation about these security aspects, which is why I asked for the link (which still doesn't provide much).

cirosantilli added the enhancement label Aug 8, 2015

srawlins mentioned this issue Jun 11, 2016

Shallow clones of submodules rust-lang/rust#34228

Closed

merryhime mentioned this issue Nov 19, 2017

lint: Fetch baseref instead of baserev dolphin-emu/sadm#93

Closed

darxriggs mentioned this issue Jul 6, 2018

[JENKINS-21248] Support shallow submodule update jenkinsci/git-client-plugin#303

Closed

MaEtUgR mentioned this issue Nov 21, 2018

[WIP] Appveyor: Shallow clone repo and submodules PX4/PX4-Autopilot#10893

Closed

irengrig mentioned this issue May 15, 2019

Refactor git_repository and new_git_repository rules implementations … bazelbuild/bazel#8264

Closed

mgoodness mentioned this issue Aug 16, 2019

Cannot reference non-HEAD or untagged commits kubernetes-sigs/kustomize#1452

Closed

ghost mentioned this issue Aug 28, 2019

Reducing the memory/bandwidth footprint of submodules with ./x.py? rust-lang/rust#63978

Closed

OpportunityLiu mentioned this issue Jan 20, 2020

improve tab complete xmake-io/xmake#673

Merged

dzolnai mentioned this issue Feb 4, 2020

submodule git clone issue eduvpn/android#226

Closed

jtnord mentioned this issue Feb 13, 2023

fix fallback URL jenkinsci/plugin-compat-tester#457

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow fetching commit by id #436

Allow fetching commit by id #436

tgr commented Aug 8, 2015

cirosantilli commented Aug 8, 2015

tgr commented Aug 10, 2015

cirosantilli commented Aug 10, 2015

akx commented Nov 19, 2018

Prcuvu commented Aug 8, 2019

uri-canva commented Oct 23, 2019

uri-canva commented Oct 24, 2019

marc-h38 commented Dec 10, 2019

tgr commented Dec 10, 2019

marc-h38 commented Dec 11, 2019

Allow fetching commit by id #436

Allow fetching commit by id #436

Comments

tgr commented Aug 8, 2015

cirosantilli commented Aug 8, 2015

tgr commented Aug 10, 2015

cirosantilli commented Aug 10, 2015

akx commented Nov 19, 2018

Prcuvu commented Aug 8, 2019

uri-canva commented Oct 23, 2019

uri-canva commented Oct 24, 2019

marc-h38 commented Dec 10, 2019

tgr commented Dec 10, 2019

marc-h38 commented Dec 11, 2019