Add full support for traditional and simplified Chinese labels #1955

peitili · 2021-06-14T22:13:50Z

Tilezen remaps localized names into 2-char languages codes from OpenStreetMap, Natural Earth, and OpenStreetMap – which each has their own way of representing name localizations.

In the case of Chinese (and possibly other languages), this "spoken" language has multiple "written" character sets (Traditional and Simplified) and is spoken and written in multiple countries using different configs.

But in Tilezen we only export a generic and ambiguous name:zh value. In UX design generally it's best practice to target each language as a combination of language + country code to allow for local colloquialisms. But for mapping sometimes less is better / mostly we're dealing in proper nouns - so another alternative is to say zh-hans (Chinese simplified irresepctive of country) and zh-hant (Chinese traditiional irrespective of country). Let's pick one and stick with it, and make it work with the point-of-view / worldview being introduced in v5.

For example:

Locale	Description
zh-CN	Chinese (Simplified, PRC)
zh-SG	Chinese (Simplified, Singapore)
zh-TW	Chinese (Traditional, Taiwan)
zh-HK	Chinese (Traditional, Hong Kong S.A.R.)

vector-datasource/vectordatasource/transform.py

Lines 523 to 558 in 024909e

    
           def _convert_wof_l10n_name(x): 
        
               lang_str_iso_639_3 = x[:3] 
        
               if len(lang_str_iso_639_3) != 3: 
        
                   return None 
        
               try: 
        
                   lang = pycountry.languages.get(alpha_3=lang_str_iso_639_3) 
        
               except KeyError: 
        
                   return None 
        
               return LangResult(code=_alpha_2_code_of(lang), priority=0) 
        
           def _convert_ne_l10n_name(x): 
        
               if len(x) != 2: 
        
                   return None 
        
               try: 
        
                   lang = pycountry.languages.get(alpha_2=x) 
        
               except KeyError: 
        
                   return None 
        
               return LangResult(code=_alpha_2_code_of(lang), priority=0) 
        
           def _normalize_osm_lang_code(x): 
        
               # first try an alpha-2 code 
        
               try: 
        
                   lang = pycountry.languages.get(alpha_2=x) 
        
               except KeyError: 
        
                   # next, try an alpha-3 code 
        
                   try: 
        
                       lang = pycountry.languages.get(alpha_3=x) 
        
                   except KeyError: 
        
                       # finally, try a "bibliographic" code 
        
                       try: 
        
                           lang = pycountry.languages.get(bibliographic=x) 
        
                       except KeyError: 
        
                           return None 
        
               return _alpha_2_code_of(lang)

The text was updated successfully, but these errors were encountered:

nvkelso · 2021-07-20T05:50:04Z

New preview files:

https://github.com/nvkelso/natural-earth-vector/files/6845759/v5.0.0-pre8-boundaries-pov-chinese.zip

This would need to be updated here:

https://github.com/tilezen/vector-datasource/blob/master/data/assets.yaml#L121

nvkelso · 2021-07-20T05:52:20Z

I think we take country names from OSM, so not clear why these are in the import statement:

https://github.com/tilezen/vector-datasource/blob/master/data/assets.yaml#L189-L199

This was referenced Jun 15, 2021

Chinese parser for OSM #1956

Merged

Add Chinese parser for WOF #1957

Merged

nvkelso changed the title ~~Add Chinese language parser~~ Add full support for traditional and simplified Chinese labels Jun 17, 2021

This was referenced Jun 17, 2021

Add full support for traditional and simplified Chinese labels #1952

Closed

Support both traditional and simplified Chinese name localizations nvkelso/natural-earth-vector#533

Closed

peitili mentioned this issue Jul 30, 2021

Add Chinese parser for NE and fix a few edge cases #1961

Merged

2 tasks

nvkelso mentioned this issue Jul 30, 2021

Remove name:zh for v2 #1962

Open

peitili self-assigned this Jan 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add full support for traditional and simplified Chinese labels #1955

Add full support for traditional and simplified Chinese labels #1955

peitili commented Jun 14, 2021 •

edited by nvkelso

Loading

nvkelso commented Jul 20, 2021

nvkelso commented Jul 20, 2021

Add full support for traditional and simplified Chinese labels #1955

Add full support for traditional and simplified Chinese labels #1955

Comments

peitili commented Jun 14, 2021 • edited by nvkelso Loading

nvkelso commented Jul 20, 2021

nvkelso commented Jul 20, 2021

peitili commented Jun 14, 2021 •

edited by nvkelso

Loading