Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Fix two minor bugs from the Ruby code First, `categroy` rather than `code` was used in constructing the `control_boundary` property as related to the characters U+200C and U+200D. This seemed incorrect and should be fixed. This could be an observable bugfix for any C code which inspects the `control_boundary` property. Second, when reading composition exclusions, Ruby's String hex method produces zero rather than nil if no number is found. For example $ ruby -e 'puts "# blah".hex' 0 This led to the character `'\0'` being included in the `exclusions` and `excl_versions` sets which is incorrect. However this seems asymptomatic because `'\0'` is never part of a composition. (In terms of the C code, the use of `comp_exclusion` is guarded by the `comb_index` property which is `UINT16_MAX` for `'\0'`.) * Cleanup: Remove sequence ordering hack This hack changed the ordering of sequences encoded in the sequences table and was added so we could easily prove equivalence to the Ruby data generator code. However, it's no longer needed and removing it shouldn't result in any functional change.
- Loading branch information