Use ISO 639-3 (3-letter) language codes in the database #73

MattBlissett · 2018-10-05T13:09:44Z

We have checklists containing names with less-spoken languages, which only have ISO 639-3 3-letter language codes.

Our API exposes 3-letter codes, but languages are parsed to two-letter codes and stored in the database as two-letter codes.

We should change to use three-letter codes throughout (though still accepting 2-letter codes, of course).

This checklist has many vernacular names with less-spoken three-letter languages: https://www.gbif.org/species/search?dataset_key=a0b06e2e-287a-4687-8a6c-2c0cfb31c16d&origin=SOURCE&issue=VERNACULAR_NAME_INVALID&advanced=1

mdoering · 2018-10-08T10:46:56Z

The problem boils down to our Language enumeration which only tracks 2 letter codes:
https://github.com/gbif/gbif-api/blob/master/src/main/java/org/gbif/api/vocabulary/Language.java#L36

The API does not use strings but this enumeration

mdoering · 2018-10-08T10:51:07Z

changing the db would not be a big thing, but pimping the enum and the LanguageParser a bit more.
Wikipedia claims there are currently 7776 3 letter codes. Do we want to manage them in an enumeration still?

mdoering mentioned this issue Oct 8, 2018

Extend Language to ISO 639-3 codes gbif/gbif-api#29

Open

MattBlissett mentioned this issue Mar 7, 2023

Create Data Package metadata editor gbif/ipt#1829

Closed

48 tasks

peterdesmet mentioned this issue May 23, 2023

Use 3 letter codes for vernacular name language? tdwg/camtrap-dp#310

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use ISO 639-3 (3-letter) language codes in the database #73

Use ISO 639-3 (3-letter) language codes in the database #73

MattBlissett commented Oct 5, 2018

mdoering commented Oct 8, 2018

mdoering commented Oct 8, 2018

Use ISO 639-3 (3-letter) language codes in the database #73

Use ISO 639-3 (3-letter) language codes in the database #73

Comments

MattBlissett commented Oct 5, 2018

mdoering commented Oct 8, 2018

mdoering commented Oct 8, 2018