-
Notifications
You must be signed in to change notification settings - Fork 182
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make NFKD and UTS 46 data store only the difference form NFD #1984
Conversation
…without ignoring default ignorables Default ignorables are not ignored, because doing so would violate the fundamental assumption of the normalizes that every input character produces non-empty output. The expectation is that real NFKC_CaseFold will be implemented by first filtering out default ignorables and then plugging the NFKD_CaseFold data into the upcoming `ComposingNormalizer` code that will turn NFD into NFC and NFKD into NFKC.
Saves 7332 bytes in data size.
…and allow dynamic further shortening in tailorings This makes the action of turning a value read from the table into a `CollationElement` super-simple (and branchless).
Makes testdata.postcard 25.8 KB smaller.
Codecov Report
@@ Coverage Diff @@
## main #1984 +/- ##
==========================================
- Coverage 78.47% 74.84% -3.64%
==========================================
Files 375 474 +99
Lines 27643 29089 +1446
==========================================
+ Hits 21694 21772 +78
- Misses 5949 7317 +1368
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Makes the postcard form of the data 25.8 KB smaller. Draft only because changesets continue on top of #1978.