Reintroduce case sensitive comparison optimization for FrozenDictionary in some cases #95232

andrewjsaid · 2023-11-25T21:38:00Z

In #94667 we fixed a bug where FrozenDictionary<string, T> was incorrectly using case sensitive comparison. The solution chosen minimized the diff as the PR would be ported to .NET 8. Upon reflection, alongside fixing the bug, it did de-optimize some cases where the optimization was in fact valid. The examples below give a quick summary.

// PR94667 correctly targeted this scenario. Case sensitivity matters here even if only hashing last character.
keys1 = ["abc1", "abc2", "abc3", "abc4", "abc5", "abc6"];

// PR94667 left this scenario unchanged. Case sensitivity doesn't matter but the hash is against the entire string.
keys2 = ["0", "1", "2", "3", "4", "5", "6", "7", "8", "9"];

// PR94667 unnecessarily de-optimized this scenario. Case sensitivity doesn't matter even if only hashing last character.
keys3 = ["001", "002", "003", "004", "005", "006"];

This PR I am submitting re-introduces the optimization for scenarios like keys3 above i.e. where:

The KeyAnalyzer has found a substring
The entirety of all keys are ASCII
There are no letters in any of the keys - not only checking the substring.

I am not submitting any benchmarks up-front as it is a re-introduction of an optimization in some cases and there are no benchmarks for case insensitive Frozen Dictionaries. Please let me know if there's any relevant benchmarks I should run.

…ry in some cases

ghost · 2023-11-25T21:38:14Z

Tagging subscribers to this area: @dotnet/area-system-collections
See info in area-owners.md if you want to be subscribed.

Issue Details

In #94667 we fixed a bug where FrozenDictionary<string, T> was incorrectly using case sensitive comparison. The solution chosen minimized the diff as the PR would be ported to .NET 8. Upon reflection, alongside fixing the bug, it did de-optimize some cases where the optimization was in fact valid. The examples below give a quick summary.

// PR94667 correctly targeted this scenario. Case sensitivity matters here even if only hashing last character.
keys1 = ["abc1", "abc2", "abc3", "abc4", "abc5", "abc6"];

// PR94667 left this scenario unchanged. Case sensitivity doesn't matter but the hash is against the entire string.
keys2 = ["0", "1", "2", "3", "4", "5", "6", "7", "8", "9"];

// PR94667 unnecessarily de-optimized this scenario. Case sensitivity doesn't matter even if only hashing last character.
keys3 = ["001", "002", "003", "004", "005", "006"];

This PR I am submitting re-introduces the optimization for scenarios like keys3 above i.e. where:

The KeyAnalyzer has found a substring
The entirety of all keys are ASCII
There are no letters in any of the keys - not only checking the substring.

I am not submitting any benchmarks up-front as it is a re-introduction of an optimization in some cases and there are no benchmarks for case insensitive Frozen Dictionaries. Please let me know if there's any relevant benchmarks I should run.

Author:	andrewjsaid
Assignees:	-
Labels:	`area-System.Collections`
Milestone:	-

src/libraries/System.Collections.Immutable/src/System/Collections/Frozen/String/KeyAnalyzer.cs

…reintroduce-optimization

andrewjsaid · 2024-01-28T23:40:36Z

@stephentoub I have addressed your comments. Thanks

danmoseley · 2024-01-29T00:18:54Z

Is it worth adding benchmarks, you mention they don't exist?

andrewjsaid · 2024-01-29T13:17:08Z

@danmoseley I would recommend that at least a few benchmarks are added to dotnet/performance with case insensitive comparison, yes.

I can try to find the time to do so but I can not reliably make that commitment at this moment.

stephentoub

Thanks!

andrewjsaid · 2024-02-15T17:54:07Z

Sorry to pester but as it's approved, could it also be merged, please?

Reintroduce case sensitive comparison optimization for FrozenDictiona…

7475242

…ry in some cases

dotnet-issue-labeler bot added the area-System.Collections label Nov 25, 2023

ghost added the community-contribution Indicates that the PR has been added by a community member label Nov 25, 2023

stephentoub reviewed Jan 2, 2024

View reviewed changes

src/libraries/System.Collections.Immutable/src/System/Collections/Frozen/String/KeyAnalyzer.cs Outdated Show resolved Hide resolved

stephentoub reviewed Jan 2, 2024

View reviewed changes

src/libraries/System.Collections.Immutable/src/System/Collections/Frozen/String/KeyAnalyzer.cs Outdated Show resolved Hide resolved

andrewjsaid added 2 commits January 2, 2024 17:37

Merge remote-tracking branch 'upstream/main' into frozen-collections-…

153319b

…reintroduce-optimization

Renamed a few symbols and added comments for additional clarity

2e84d9c

andrewjsaid requested a review from stephentoub January 2, 2024 18:30

Merge branch 'main' into frozen-collections-reintroduce-optimization

2ea45a6

build-analysis bot mentioned this pull request Jan 5, 2024

[wasm] Failing test System.Runtime.InteropServices.JavaScript.Tests.WebWorkerTest.ManagedDelay_ContinueWith #96493

Closed

eiriktsarpalis requested review from stephentoub and removed request for stephentoub January 8, 2024 15:49

andrewjsaid added 2 commits January 15, 2024 00:43

Merge branch 'main' into frozen-collections-reintroduce-optimization

5297ea3

Merge branch 'main' into frozen-collections-reintroduce-optimization

e4e2a53

This was referenced Jan 19, 2024

Checkout failure: "Git fetch failed with exit code 128" dotnet/arcade#9009

Open

Tracking issue for CI build timeouts #76454

Closed

Merge branch 'main' into frozen-collections-reintroduce-optimization

f24b06f

stephentoub approved these changes Feb 13, 2024

View reviewed changes

Merge branch 'main' into frozen-collections-reintroduce-optimization

a202f4b

stephentoub merged commit bbb97a7 into dotnet:main Feb 15, 2024
111 checks passed

github-actions bot locked and limited conversation to collaborators Mar 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reintroduce case sensitive comparison optimization for FrozenDictionary in some cases #95232

Reintroduce case sensitive comparison optimization for FrozenDictionary in some cases #95232

andrewjsaid commented Nov 25, 2023

ghost commented Nov 25, 2023

andrewjsaid commented Jan 28, 2024

danmoseley commented Jan 29, 2024

andrewjsaid commented Jan 29, 2024

stephentoub left a comment

andrewjsaid commented Feb 15, 2024

Reintroduce case sensitive comparison optimization for FrozenDictionary in some cases #95232

Reintroduce case sensitive comparison optimization for FrozenDictionary in some cases #95232

Conversation

andrewjsaid commented Nov 25, 2023

ghost commented Nov 25, 2023

andrewjsaid commented Jan 28, 2024

danmoseley commented Jan 29, 2024

andrewjsaid commented Jan 29, 2024

stephentoub left a comment

Choose a reason for hiding this comment

andrewjsaid commented Feb 15, 2024