-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Normative: List new Unicode v14 Script
/Script_Extensions
values
#2515
Conversation
@mathiasbynens I believe last time we said we were going to ask for consensus for these. Do you want to present this or would you like the editors to take care of it? I imagine it will be an extremely short agenda item either way. |
My memory is that we would indeed ask for consensus for any new properties being added, but not for any new values to already-supported properties. Otherwise we’d be in this weird state where we claim to support the latest Unicode, and we claim to support IMHO, asking for consensus on new values for already-supported properties would be equivalent to asking for TC39 consensus on the set of new The meeting notes from when this was last discussed are here: https://github.com/tc39/notes/blob/master/meetings/2020-06/june-2.md#introducing-unicode-support But I agree it doesn’t clearly capture a decision on this specific case. My memory and intuition goes back to when I first proposed |
@mathiasbynens That sounds reasonable, but I don't think we explicitly got consensus for that at the last meeting. I want to bring this PR to the next meeting and more explicitly get consensus on
On the other hand, it's not totally clear to the editors what the purpose of the list of values in General_Category and Script/Script_Extensions serves, if the editors are always going to fast-track any updates to them without plenary approval, vs just having a normative reference to Unicode in place of those two tables. |
This sounds like re-establishing consensus on something that was already part of the approved As part of the If relitigated, it’d be problematic if this would not re-gain consensus for the reasons I described. |
@mathiasbynens any reason not to remove the tables that enumerate those values then? |
We discussed that, no? https://github.com/tc39/notes/blob/master/meetings/2020-06/june-2.md#introducing-unicode-support They were originally added to reduce the risk of interoperability issues, and IMHO we shouldn’t change that. |
@mathiasbynens It was clear that we didn't want to remove the table that lists properties. It was not clear that we didn't want to remove the table that lists value. |
Besides interop, listing the values + aliases explicitly is useful since we decided not to support loose matching in ECMAScript’s |
Added a topic for next plenary: tc39/agendas#1084 |
Per the 2021-12-14 TC39 meeting, patches that add new upstream Unicode values & value aliases to already-supported-in-ECMAScript properties do not require explicit consensus. Removing the label. Thanks for presenting this, @michaelficarra! |
@mathiasbynens One of the things we said we'd do as part of this is to document exactly how the names of the entries in this table are chosen, either in the spec text itself or in some metadata somewhere. Can you help us figure out the right way of saying it? Would "these names correspond to the first spelling (including casing) used for each value in Scripts.txt" be accurate? |
It was also suggested that we include that editorial process in the spec itself, so I plan to discuss the merits of that in the next editor call. |
What you were suggesting might lead to the same results for the Note that this logic unfortunately doesn’t cover ALL properties listed in the spec — in particular |
Right, these are the files to look at:
Formally speaking, the particular use of case/space/underscore is unlikely to change but not (I think) guaranteed to not change. |
f13634e
to
6f6c232
Compare
I've pushed up a commit with the following paragraph just before the
@mathiasbynens PTAL. If this does not seem correct to you, can you suggest an alternative? |
spec.html
Outdated
@@ -35141,6 +35141,9 @@ <h1> | |||
<emu-note> | |||
<p>This algorithm differs from <a href="https://unicode.org/reports/tr44/#Matching_Symbolic">the matching rules for symbolic values listed in UAX44</a>: case, <emu-xref href="#sec-white-space">white space</emu-xref>, U+002D (HYPHEN-MINUS), and U+005F (LOW LINE) are not ignored, and the `Is` prefix is not supported.</p> | |||
</emu-note> | |||
<emu-note> | |||
<p>The spellings of entries in these tables (including casing) were chosen to match the first occurrence of each property in the files PropertyAliases.txt and PropertyValueAliases.txt in the Unicode Character Database at the time each entry was added to this specification. However, because the precise spellings in those files are not guaranteed to be stable, implementations are required to follow this table rather than those files.</p> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍 Some nits/thoughts:
- Let’s add links to these files? https://unicode.org/Public/UCD/latest/ucd/PropertyAliases.txt and https://unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt are stable URLs.
- Should the file names be wrapped in
<code>
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So like this?
<p>The spellings of entries in these tables (including casing) were chosen to match the first occurrence of each property in the files PropertyAliases.txt and PropertyValueAliases.txt in the Unicode Character Database at the time each entry was added to this specification. However, because the precise spellings in those files are not guaranteed to be stable, implementations are required to follow this table rather than those files.</p> | |
<p>The spellings of entries in these tables (including casing) were chosen to match the first occurrence of each property in the files <a href="https://unicode.org/Public/UCD/latest/ucd/PropertyAliases.txt"><code>PropertyAliases.txt</code></a> and <a href="https://unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt"><code>PropertyValueAliases.txt</code></a> in the Unicode Character Database at the time each entry was added to this specification. However, because the precise spellings in those files are not guaranteed to be stable, implementations are required to follow this table rather than those files.</p> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would that also apply to the other UCD file references (cf. #2594)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@gibson042 Yes, please do.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
6f6c232
to
63bef6d
Compare
…c39#2515) The following new values for the already-supported properties `Script` and `Script_Extensions` are added: - Cypro_Minoan (Cpmn) - Old_Uyghur (Ougr) - Tangsa (Tnsa) - Toto (Toto) - Vithkuqi (Vith) Issue: tc39#2514 Tests: tc39/test262#3199
63bef6d
to
2ee71d8
Compare
The following new values for the already-supported properties
Script
andScript_Extensions
are added:Issue: #2514
Tests: tc39/test262#3199