-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ben Kaduk's Transport Comment 35 #4647
Comments
I hope not. How can we qualify this to avoid any implication that it does? |
BCP 18 says:
That really does sound like it covers this field. Would it be enough to say that the field SHOULD begin with a language tag if it is not empty? |
This is a great example of red-tape. In any case, I think we can explicitly disavow anything here and say that the content of the field is not-defined and while it might be a good idea to make it something that can be rendered as text, it is for diagnostic purposes and does not need to be comprehensible to anyone other than the entity that generated it. |
see #187 (comment) (As an individual, I don't see any action required to fix something last spoken about, with little interest, ~4 years ago) |
For a more substantial discussion, please see #1990 |
In case my intention isn't clear (again as an individual), the WG discussed this previously and found a resolution reflected in the current text. I'm not seeing any new information in the meantime to warrant revisiting that decision. I am concerned that a change at this stage is disruptive to existing implementations and deployments of QUIC. |
Though we recommend the use of text in UTF-8, we don't provide all of the machinery necessary for true internationalization, like multiple translations and all that mess. Be very explicit about that. (This is a recurring problem; we had the same problem with HTTP/2 and it might be worth updating BCP 18 for this.) Closes #4647.
For clarity, I do not plan to take a position on the question since I am not much of an authority on the matter; I have asked the ART ADs if they can weigh in. |
I think @martinthomson 's fix in #4722 makes it so that this is not necessarily "human readable". I think that fixes this issue by defining the field differently (though still accurately), so I'm going to merge that and close this issue. |
Sorry to come in later, after Jana already closed this, but I think the result is fine. I sent the following reply to Ben K: It's generally true that we don't worry about language tagging for strings that aren't expected to be directly handled by end users. The failures caused by not knowing the intended language involve semantic issues (trying to do translation or other semantic interpretation of the strings), case mapping (such as handling "i" to "I" in English, vs Turkish), sorting ("ä" sorts after "a" in German and after "z" in Swedish, and Spanish actually eliminated "ch" as a single letter in 1994 to avoid such trouble), and comparisons. The strings in question here have none of those issues. So I would let this go, under the general assumption, which we make in very many other places in our protocol suites, that making it UTF-8 allows implementations to swap in German or Swedish or Turkish or Chinese versions of the messages, that installations will choose which language to operate in, and that language tagging really won't be needed. |
Thanks for the additional information Barry. |
@kaduk said:
The text was updated successfully, but these errors were encountered: