Refactor hashing #143

sdboyer · 2017-01-14T05:43:26Z

#139 was prematurely merged.

Refactor tests to pass with new input set
Cover combination cases with new tests
Separate tests for rootdata ops
Introduce logic for typed constraint string dumps
~~Dump Source unconditionally from constraints, normalizing via method~~

We DON'T actually want to dump the source unconditionally, as they're not semantically equivalent - an empty ProjectIdentifier.Source allows for the possibility of having the source switched on the fly by something that comes along later in the algorithm.

This provides a convenient way of letting the debugging func inject a newline after each write (for readability in debugging).

This separates a bunch of the static state/rules/information that comes from the root project and input parameters into a discrete subsystem. The only real benefit here is focusing the state tracked by the solver in on the actual algorithm of solving, and less so these static rules - which should make it a bit easier for other people to grok.

All changes are geared towards making "default"-type values explicit, as that increases the likelihood that equivalent inputs will produce identical hash digests.

Hashing functions are exquisitely sensitive to inputs - that's why they're useful. But it makes them a PITA to work with. Having an easy-to-scan visualization of hashing inputs in tests frees up cognitive capacity to focus on the algorithm.

codecov-io · 2017-01-15T01:28:43Z

Current coverage is 77.68% (diff: 90.29%)

Merging #143 into master will increase coverage by 0.39%

@@             master       #143   diff @@
==========================================
  Files            24         25     +1   
  Lines          3694       3759    +65   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
+ Hits           2855       2920    +65   
- Misses          633        635     +2   
+ Partials        206        204     -2

Powered by Codecov. Last update 65939b4...85a0fc3

To further improve debugging of issues with the input hashing, this adds "section headers" - strings that are output prior to each type of data that's present in the cache. Also partially switched to progressive mutation table-based tests for input hashing, and added test cases that cover salient combinations of overrides, imports, and constraints.

Makes it easier to see problem spots on a quick scan.

These solve the problem, at least in the hasher, of the possibility for strings representing different types of versions to collide. For example, prior to this change, a branch constraint named "foo" and a version constraint named "foo" could cause the hasher to produce the same hash, even though the two inputs would not have admitted the same solution set.

Refactor hashing

sdboyer added 5 commits January 11, 2017 20:50

Use an io.Writer to write hashing inputs

6109ef1

This provides a convenient way of letting the debugging func inject a newline after each write (for readability in debugging).

Remove blank/newlines from hashing tests

2488c3e

Remove pointless ifs

95c24a4

Comprehensive refactor of input hashing rules

28ed699

All changes are geared towards making "default"-type values explicit, as that increases the likelihood that equivalent inputs will produce identical hash digests.

sdboyer added this to the v0.14.0 milestone Jan 14, 2017

sdboyer self-assigned this Jan 14, 2017

sdboyer mentioned this pull request Jan 14, 2017

Do not include ineffectual constraints in hash #125

Closed

sdboyer mentioned this pull request Jan 14, 2017

This is the new hash on my machine golang/dep#86

Closed

sdboyer added 6 commits January 15, 2017 14:11

Add diff-ish indicators to hash diff output

0a9c6c6

Makes it easier to see problem spots on a quick scan.

Ensure hashing string inputs eq if bytes eq

d441d82

Add hashing test case for required AND imported

85a0fc3

Add rootdata-specific tests

366fea2

sdboyer merged commit 69fdac2 into master Jan 15, 2017

sdboyer deleted the refactor-hashing branch April 5, 2017 18:33

krisnova pushed a commit to krisnova/dep that referenced this pull request Apr 21, 2017

Merge pull request sdboyer/gps#143 from sdboyer/refactor-hashing

5cc9f45

Refactor hashing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor hashing #143

Refactor hashing #143

sdboyer commented Jan 14, 2017 •

edited

Loading

codecov-io commented Jan 15, 2017 •

edited

Loading

Refactor hashing #143

Refactor hashing #143

Conversation

sdboyer commented Jan 14, 2017 • edited Loading

codecov-io commented Jan 15, 2017 • edited Loading

Current coverage is 77.68% (diff: 90.29%)

sdboyer commented Jan 14, 2017 •

edited

Loading

codecov-io commented Jan 15, 2017 •

edited

Loading