Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLDR-10478 Add latest data from Macau census #4063

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

conradarcturus
Copy link
Contributor

This updates the Macau population language data with up-to-date census information.

CLDR-10478

  • This PR completes the ticket.

In order to build the auto-generated files, use mvn package -DskipTests=true && java -jar tools/cldr-code/target/cldr-code.jar ConvertLanguageData && java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelySubtags

ALLOW_MANY_COMMITS=true

@conradarcturus
Copy link
Contributor Author

The only error affects the whole repo " (TestAll.java:165) Error: (TestAll.java:165) java.lang.IllegalArgumentException: Unknown system zone id: Etc/Unknown" -- not related to this change

@jira-pull-request-webhook
Copy link

Notice: the branch changed across the force-push!

  • common/supplemental/supplementalData.xml is different

View Diff Across Force-Push

~ Your Friendly Jira-GitHub PR Checker Bot

@srl295 srl295 deleted the branch unicode-org:main October 25, 2024 16:35
@srl295 srl295 closed this Oct 25, 2024
@srl295 srl295 reopened this Oct 25, 2024
@srl295 srl295 added the ddl DDL-SC specific work label Oct 25, 2024
@@ -5818,5 +5819,6 @@ XXX Code for transations where no currency is involved
<reference type="R1334" uri="https://en.wikipedia.org/wiki/Laz_people#cite_note-ethnologue-1">[missing]</reference>
<reference type="R1335" uri="https://en.wikipedia.org/wiki/Greeks_in_Russia_and_Ukraine#cite_ref-15">Greek population in Russia -- most ancestrally used Pontic Greek -- modern usage almost certainly has dropped off but we don't have clear statistics on current usage.</reference>
<reference type="R1336" uri="https://joshuaproject.net/people_groups/14444/TU">[missing]</reference>
<reference type="R1337" uri="https://stat.gov.pl/spisy-powszechne/nsp-2021/nsp-2021-wyniki-ostateczne/">Languages spoken at home in the 2021 Polish Census</reference>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the Polish census covers Macao?

Macao SAR China MO "682,070" 96% "71,820,000,000" official Portuguese pt 2.3% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language
Macao SAR China MO "682,070" 96% "71,820,000,000" de_facto_official Cantonese yue 86.2% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language
Macao SAR China MO "682,070" 96% "71,820,000,000" Filipino fil "20,879" https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language
Macao SAR China MO "682,070" 96% "71,820,000,000" Hokkien nan 3.7% https://www.dsec.gov.mo/getAttachment/7a3b17c2-22cc-4197-9bd5-ccc6eec388a2/E_CEN_PUB_2011_Y.aspx 2011 Census -- the language is not distinguished in the 2021 census
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this the reference intended?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes -- do you want me to re-write it?

Hokkien is a distinguished language in the 2011 Census but it does not appear in the 2021 Census for Macao.

@srl295 srl295 changed the base branch from _ddl/v47 to main October 25, 2024 17:14
@jira-pull-request-webhook
Copy link

Hooray! The files in the branch are the same across the force-push. 😃

~ Your Friendly Jira-GitHub PR Checker Bot

@jira-pull-request-webhook
Copy link

Notice: the branch changed across the force-push!

  • common/supplemental/likelySubtags.xml is different
  • common/supplemental/supplementalData.xml is different
  • common/testData/localeIdentifiers/likelySubtags.txt is different
  • common/testData/localeIdentifiers/localeDisplayName.txt is now changed in the branch
  • tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv is different

View Diff Across Force-Push

~ Your Friendly Jira-GitHub PR Checker Bot

@jira-pull-request-webhook
Copy link

Notice: the branch changed across the force-push!

  • common/supplemental/likelySubtags.xml is different
  • common/supplemental/supplementalData.xml is different
  • common/testData/localeIdentifiers/likelySubtags.txt is different
  • common/testData/localeIdentifiers/localeDisplayName.txt is no longer changed in the branch
  • tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv is different

View Diff Across Force-Push

~ Your Friendly Jira-GitHub PR Checker Bot

This updates the Macau population language data with up-to-date census information.

CLDR-10478 Update GenerateLikelyTestData

 java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelyTestData

CLDR-10478 Fix official languages for Macau

CLDR-10478 Remove `cmn` from Macau because of overlap with `zh`

Even though `cmn` knowledge is at 45% of Macau, since `zh` is implied to be `cmn` it ends up being double counted. Potentially we can separate `zh` from `cmn` -- but that's a whole new discussion that's best saved for later.

CLDR-10478 Add Cantonese (Macau) locale xml

Since I added a new locale that has "de_facto_official" status I need to add a new xml -- easy enough, I'll just have it inherit from root for now.

I also re-generated the test data with `java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelyTestData`

CLDR-10478 Add latest data from Macau census

This updates the Macau population language data with up-to-date census information.

CLDR-10478 Fix official languages for Macau

CLDR-10478 Remove `cmn` from Macau because of overlap with `zh`

Even though `cmn` knowledge is at 45% of Macau, since `zh` is implied to be `cmn` it ends up being double counted. Potentially we can separate `zh` from `cmn` -- but that's a whole new discussion that's best saved for later.
@jira-pull-request-webhook
Copy link

Hooray! The files in the branch are the same across the force-push. 😃

~ Your Friendly Jira-GitHub PR Checker Bot

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ddl DDL-SC specific work
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants