Skip to content
This repository has been archived by the owner on Dec 8, 2017. It is now read-only.

Add support for rendering characters from unicode supplementary planes #29

Closed
lucaswoj opened this issue Oct 26, 2016 · 4 comments
Closed

Comments

@lucaswoj
Copy link

We currently only support rendering characters from the Basic Multilingual Plane. We may need to support supplementary planes as we expand into markets that use non-Latin alphabets.

https://en.wikipedia.org/wiki/CJK_Unified_Ideographs#CJK_Unified_Ideographs_Extension_E notes that 98 of those characters come from “Chinese Academy of Surveying and Mapping ideographs”, so they probably will be cropping up in OSM eventually. -- @1ec5

@1ec5
Copy link
Contributor

1ec5 commented Oct 26, 2016

I’m satisfied that the CJK Unified Ideographs Extension E characters will take awhile to make their way into OpenStreetMap, given that the block was only introduced to Unicode last year. However, note that the same work that goes into CJK E would also enable (colorless) emoji. Hopefully that’ll give this issue a bit more traction. 😉

@1ec5
Copy link
Contributor

1ec5 commented Oct 26, 2016

My mistake: CJK E isn’t the only CJK block that’s in the Supplementary Ideographic Plane; CJK Unified Ideographs Extension B–D are also up there. Besides historical and Vietnamese characters, CJK B includes 1,702 characters from the Hong Kong Supplementary Character Set, which apparently includes a lot of Cantonese characters used in official Hong Kong place names.

@1ec5
Copy link
Contributor

1ec5 commented Nov 14, 2016

As of Unicode 9.0, the following astral-plane blocks allow ideographic breaking:

  • Meroitic Hieroglyphs
  • Egyptian Hieroglyphs
  • Anatolian Hieroglyphs
  • Ideographic Symbols and Punctuation
  • Tangut
  • Tangut Components
  • Kana Supplement
  • Tai Xuan Jing Symbols
  • Counting Rod Numerals
  • Mahjong Tiles
  • Domino Tiles
  • Playing Cards
  • Enclosed Alphanumeric Supplement
  • Enclosed Ideographic Supplement
  • Miscellaneous Symbols and Pictographs
  • Emoticons
  • Ornamental Dingbats
  • Transport and Map Symbols
  • Alchemical Symbols
  • Geometric Shapes Extended
  • Supplemental Symbols and Pictographs
  • CJK Unified Ideographs Extension B
  • CJK Unified Ideographs Extension C
  • CJK Unified Ideographs Extension D
  • CJK Unified Ideographs Extension E
  • CJK Compatibility Ideographs Supplement

As of Unicode 9.0 and revision 16 of UTR #50, the following astral-plane blocks have upright vertical orientation:

  • Meroitic Hieroglyphs
  • Siddham
  • Egyptian Hieroglyphs
  • Anatolian Hieroglyphs
  • Ideographic Symbols and Punctuation
  • Tangut
  • Tangut Components
  • Kana Supplement
  • Byzantine Musical Symbols
  • Musical Symbols
  • Tai Xuan Jing Symbols
  • Counting Rod Numerals
  • Sutton SignWriting
  • Mahjong Tiles
  • Domino Tiles
  • Playing Cards
  • Enclosed Alphanumeric Supplement
  • Enclosed Ideographic Supplement
  • Miscellaneous Symbols and Pictographs
  • Emoticons
  • Ornamental Dingbats
  • Transport and Map Symbols
  • Alchemical Symbols
  • Geometric Shapes Extended
  • Supplemental Symbols and Pictographs
  • CJK Unified Ideographs Extension B
  • CJK Unified Ideographs Extension C
  • CJK Unified Ideographs Extension D
  • CJK Unified Ideographs Extension E
  • CJK Compatibility Ideographs Supplement

The following astral-plane blocks have neutral vertical orientation:

  • Supplementary Private Use Area-A
  • Supplementary Private Use Area-B

1ec5 added a commit to mapbox/mapbox-gl-js that referenced this issue Nov 14, 2016
1ec5 added a commit to mapbox/mapbox-gl-js that referenced this issue Nov 14, 2016
* Refined vertical punctuation logic

Base the Unicode character blocks off of the official Unicode 9.0 character database.

Refined the logic distinguishing upright, rotated, and neutral characters based on Unicode Technical Report 50 (with some simplifications). In particular, not everything in the General Punctuation block is treated as having neutral orientation; instead, the vertical punctuation table is consulted.

* Removed commented-out supplementary plane entries

See mapbox/DEPRECATED-mapbox-gl#29 (comment) for an updated list.
@lucaswoj
Copy link
Author

Migrated to mapbox/mapbox-gl-js#4001

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants