Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KHR_draco_mesh_compression: Restrictions on accessor IDs #1267

Open
donmccurdy opened this issue Feb 26, 2018 · 24 comments
Open

KHR_draco_mesh_compression: Restrictions on accessor IDs #1267

donmccurdy opened this issue Feb 26, 2018 · 24 comments
Labels
mesh compression needs discussion Issue or PR requires working group discussion to resolve. needs pull request Issue requires spec clarification or implementation note, and is ready for someone to open a PR.

Comments

@donmccurdy
Copy link
Contributor

donmccurdy commented Feb 26, 2018

There was a proposed restriction from @JeremieA in PR #874, which I think we mostly agreed with but that didn't make it into the spec before release:

Restrictions on accessors ids

To facilitate converting between compressed and uncompressed versions, and allow loaders to implement decoding as a preprocess step, the accessor ids used for attributes and targets within the primitive must be used for a unique compressed mesh and attribute. Multiple primitives can refer to the same accessor only if it is associated with the same draco compressed geometry and draco attribute id within it. This is similar to the standard case where the data for a given accessor is unique.

(struck the portion about morph targets, which have been left for a future version or new extension)

I'm fine with adding this requirement, although the three.js implementation does not depend on it. We could also wait and see if the issue is raised by others implementing the extension.

@pjcozzi
Copy link
Member

pjcozzi commented Feb 26, 2018

CC @FrankGalligan @fanzhanggoogle

@JeremieA
Copy link

Thanks for considering my proposed change for inclusion.

One issue with this change that I did not mention, is that while this restriction would allow for greater flexibility for loaders implementation, it would make it a bit more complex for developers of "optimiser" tools such as gltf-pipeline, as it would need to be considered when de-duplicating accessors.

With this restriction in place, two accessors that have exactly the same json content would not be allowed to be replaced by a single instance if both were referenced from different Draco-compressed meshes. This is not an issue for "normal" accessors, as their json content refer to the actual data of the accessor (so their json can only be identical if they refer to the same data). This is definitely fixable in the optimiser implementation (by adding a reference to the Draco bufferView and attribute id within "internal" json attributes, such as extras._pipeline already used within gltf-pipeline, of any accessors that are referenced from this extension, such that this information is considered when looking for duplicate accessors), but it can have an unintended side-effect of requiring special support for this extension on generic optimisation passes that would otherwise not need to be aware of it.

For the proposed restriction, this is only an issue if the Draco compression extension is used as extensionRequired, not when only used as extensionUsed, as in this case the accessors still have a reference to the uncompressed data, so they can only be identical if this data is also identical. This is consistent with the mechanism that a loader (or optimiser) should bail out of processing an asset if it does not support all extensions in the extensionRequired list.

Given the choice, I would still prefer to have the restriction in place, as I guess more developers will implement loaders, which should be as easy to do as possible, than advanced optimiser tools.

Note that similar issues for optimisers are already present for de-duplicating or re-indexing other arrays such as bufferViews for instance. If the list of views is changed and ids have to be updated, the optimiser does need to know that the Draco extension includes a reference to a bufferView that also need to be updated. For this issue one possible solution would be to make bufferView a reserved json property name, that can (or must) only be used by all extensions (and extras?) for referring to ids in the bufferViews array. This way an optimiser can update all references from all extensions without having to understand them. In my current simplistic implementation I used this as an assumption (see here). If this is something that is worth considering to add in the standard, and that is not already covered somewhere else, we could create a separate ticket.

@FrankGalligan
Copy link
Contributor

Hi @JeremieA,
I'm trying to clarify the issue.

For example you have two primitives that have the exact same compressed data and they are referencing the same bufferView. Then you want to make sure that the "attributes" and "indices" in the two primitives must contain the same accessor ids?

I attached two gltf files. One that I think would be valid according to this restriction and one that would be invalid according to the restriction.

clarify_issue_1267.zip

@donmccurdy
Copy link
Contributor Author

donmccurdy commented May 5, 2018

Then you want to make sure that the "attributes" and "indices" in the two primitives must contain the same accessor ids?

^If so, my proposed wording in the original comment should mention indices. That was brought up in #1333, and to my understanding this restriction would be pointless if we are not also restricting index accessor IDs.

@donmccurdy donmccurdy added mesh compression needs discussion Issue or PR requires working group discussion to resolve. labels Oct 10, 2018
@donmccurdy
Copy link
Contributor Author

donmccurdy commented Oct 12, 2018

I think we are going to need to decide on this and (one way or another) add further detail to the extension specification. Another case that is not handled well by my proposed wording above would be this: suppose two primitives have the same POSITION/NORMAL data and share the same Draco buffer for these attributes. Now suppose each primitive also has a COLOR attribute stored in an uncompressed accessor. I think that would be allowed, but it isn't clear. If so we should amend the restriction to:

Restrictions on accessors reuse

To facilitate converting between compressed and uncompressed versions, and to allow loaders to implement decoding as a preprocess step, a restriction is imposed on accessor reuse. If an attribute or index accessor's data is stored in a compressed bufferView (either solely or with uncompressed fallback), that accessor cannot be referenced anywhere else in the asset, except by another primitive that also uses the same compressed bufferView.

@MiiBond
Copy link
Contributor

MiiBond commented Oct 18, 2018

I'm unclear on what this restriction means when the Draco extension is required. Currently, when Adobe Dimension outputs Draco, I assign dummy data to the "regular" attributes and indices to satisfy the validator but I expect that this data is never used. When multiple, different meshes are present, the bufferViews are, of course, different but the dummy data is the same.

This works fine in Babylon but three.js thinks the meshes are the same and just uses a copy of the same mesh rather than loading a new one. My assumption was that this was a bug in the Three.js loader but, if this thread implies that regular attributes must be different for different meshes, even if they aren't used, I should change the exporter.

@donmccurdy
Copy link
Contributor Author

donmccurdy commented Oct 18, 2018

My assumption was that this was a bug in the Three.js loader but, if this thread implies that regular attributes must be different for different meshes, even if they aren't used, I should change the exporter.

That is what the thread implies, yes — if the Draco extension is required and two meshes have dummy POSITION attributes pointing at accessor 0, but the meshes' compressed data is in different bufferviews, it would be invalid with this restriction. In other words, loaders should be able to pre-process an asset replacing dummy accessors with decompressed Draco data, without needing to create any new accessors in the process.

But, I thought we'd managed to avoid relying on this restriction in the three.js loader — do you mind sharing an example that reproduces the issue so I can debug? I'm still open to merging the restriction or omitting it, either way.

@MiiBond
Copy link
Contributor

MiiBond commented Oct 18, 2018

Sure. I'll send you a sample file.

@donmccurdy
Copy link
Contributor Author

Thanks, looked into this and it's a bug in the threejs loader. We cache on the accessor indices to determine when a mesh is reused (because glTF meshes cannot be instantiated apart independently of their materials), and didn't account for extensions in that caching. We'll fix that, but I'm still undecided on this restriction..

@donmccurdy
Copy link
Contributor Author

Should be fixed soon: mrdoob/three.js#15084.

@donmccurdy
Copy link
Contributor Author

To summarize from this week's call —

(1) There has been some misunderstanding about the meaning of attribute accessors when the draco extension is required, and there is no uncompressed fallback data. The intention was that accessors must still contain valid count, min, max, componentType, and type properties. Only bufferView and byteOffset are meaningless in this case, and can be omitted. I had thought this was in the extension spec already, but after re-reading it is not so clear. The nearest language we have is this:

When loading each accessor, you must ignore the bufferView and byteOffset of the accessor and go to the previously decoded Draco geometry in the primitive to get the data of indices and attributes. A loader must use the decompressed data to fill the accessors or render the decompressed Draco geometry directly

To satisfy the requirement that accessors properties be correct for both compressed and uncompressed data, generators should create uncompressed attributes and indices using data that has been decompressed from the Draco buffer, rather than the original source data.

The second quote is non-normative. I believe we should include normative requirements that these accessor properties correctly describe the compressed data ... the other interpretation is not implementable, because the Draco data does not (to my knowledge) include type or component type information.

@MiiBond any concerns with this?

(2) The other issue, and the original topic of this thread, was whether two accessors that really have the same properties (given the requirements above) could be referenced by two different primitives whose compressed data is stored in two different draco buffers.

I don't have a strong opinion on (2). We could add the restriction as a "bug fix" but the issue has not come up in practice.

@donmccurdy
Copy link
Contributor Author

@lexaknyazev did you have a preference on (2)? I didn't fully catch your comment on the call.

@lexaknyazev
Copy link
Member

On (1):
Draco data does contain number of vertices and type/componentType information in some form. Accessors are expected to be used to build VAOs ahead of time, before decompression ends. There was a discussion on that somewhere on GitHub with Microsoft folks.

(2):
We haven't discussed "accessor equality" on a high level. Probably, we should. Following (1), I'd say that meshes with different data can reuse accessor objects as long as bufferView is omitted.

@donmccurdy
Copy link
Contributor Author

Draco data does contain number of vertices and type/componentType information in some form.

Good to know. I'm having trouble finding the right enum to get that information back out at decode time, but it appears that the data exists anyway. Do you agree we should clarify that the accessor data still has to be correct, or are you suggesting we loosen this?

@lexaknyazev
Copy link
Member

I think that #1343 covers accessors' reqs.

@donmccurdy
Copy link
Contributor Author

Probably wouldn't hurt to specify all accessor properties except bufferView and byteOffset apply, but agreed.

@pjcozzi
Copy link
Member

pjcozzi commented Nov 6, 2018

@lexaknyazev @donmccurdy what are next steps here?

@donmccurdy
Copy link
Contributor Author

donmccurdy commented Nov 6, 2018

  • (1) Open PR to clarify that count, min, max, componentType, and type must reflect compressed data accurately.
  • Open PR to (2a) clarify that accessors without bufferView can be used with different compressed data, OR (2b) clarify that this is disallowed.

(1) is implied by the spec already, but could be clearer. The second point is not so clear, and it seems like we're leaning toward (2a) rather than (2b).

@emackey emackey added the needs pull request Issue requires spec clarification or implementation note, and is ready for someone to open a PR. label Nov 14, 2018
@emackey
Copy link
Member

emackey commented Nov 14, 2018

I think that once (1) is solved, (2) becomes less of an issue, as different meshes will typically have different stats for (1), requiring separate accessors.

For the rare case where two different Draco streams happen to have exactly matching metadata for (1), I don't have a strong opinion on (2a) vs (2b). Do implementations have a strong need for one or the other choice?

@MiiBond
Copy link
Contributor

MiiBond commented Nov 14, 2018

I'm just catching up on this thread. I've updated the Draco logic in the Adobe Dimension exporter but I'm wondering if I've misunderstood this again. I've set the accessor information to match the uncompressed stream, not the compressed one.
i.e. I encode the geometry and store the buffers but then I uncompress a temporary copy just so I can get the correct info for the accessor.
Are we now saying that the accessor info should be for the compressed stream, not the uncompressed one?

@donmccurdy
Copy link
Contributor Author

donmccurdy commented Nov 15, 2018

For the rare case where two different Draco streams happen to have exactly matching metadata for (1), I don't have a strong opinion on (2a) vs (2b). Do implementations have a strong need for one or the other choice?

I'm happy with either, just so long as something is decided. :)

I encode the geometry and store the buffers but then I uncompress a temporary copy just so I can get the correct info for the accessor... Are we now saying that the accessor info should be for the compressed stream, not the uncompressed one?

I'll make up some definitions for the sake of discussion:

  • uncompressed: accessor input data in an original non-draco glTF file.
  • compressed: accessor data compressed in draco buffer
  • decompressed: accessor output data that has been read and decompressed from draco buffer

The only way to compute accessor info for compressed data is by reading the decompressed data. If someone were writing fallback (non-draco) data into accessors, they must use the decompressed data. In no case should uncompressed data be used as a fallback, or to populate accessor info.

@MiiBond that sounds like what you've described, but I'm not sure?

@MiiBond
Copy link
Contributor

MiiBond commented Nov 15, 2018

Thanks, @donmccurdy. Yeah, I'm using decompressed data now.

I just wanted to make sure because you mentioned "Open PR to clarify that count, min, max, componentType, and type must reflect compressed data accurately".

@emackey
Copy link
Member

emackey commented Nov 16, 2018

For the rare case where two different Draco streams happen to have exactly matching metadata for (1), I don't have a strong opinion on (2a) vs (2b). Do implementations have a strong need for one or the other choice?

I'm happy with either, just so long as something is decided. :)

I'll solidify my vote to (2b), disallow sharing of the same accessor between different Draco streams.

My thinking is: The typical case is that different Draco streams within a model should not be expected to share bounding boxes and other such details, so, typically (1) will require a separate accessor for each unique Draco stream. It would be a rare case where two unrelated Draco streams are similar enough that there could be a shared accessor. This would be a micro-optimization, given that such an accessor doesn't contain any data, it's just a few bytes of JSON metadata. Thus it potentially places a burden on our readers to look for this micro-optimization and un-do it when necessary. So, my vote is to disallow that. Thoughts?

@donmccurdy
Copy link
Contributor Author

That works for me. 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
mesh compression needs discussion Issue or PR requires working group discussion to resolve. needs pull request Issue requires spec clarification or implementation note, and is ready for someone to open a PR.
Projects
None yet
Development

No branches or pull requests

8 participants