LCCN incorrectly imported when containing alpha characters #2851
Labels
Affects: Data
Issues that affect book/author metadata or user/account data. [managed]
Lead: @hornc
Issues overseen by Charles (Staff: Data Engineering Lead) [managed]
Module: Import
Issues related to the configuration or use of importbot and other bulk import systems. [managed]
Priority: 2
Important, as time permits. [managed]
Theme: MARC records
Type: Bug
Something isn't working. [managed]
Milestone
Significant, non-numeric, characters are being dropped from LCCNs on import from MARC records. This is a long standing bug, so perhaps this is a duplicate report, but I can't find it in the current bug tracker. Perhaps it's in one of the previous trackers and didn't get transferred.
As an example, this MARC 010$a field
010 $aa 47003377
got imported here as
47003377
instead ofa47003377
.These leading alpha prefixes are significant for LCCNs, so must be preserved.
Note that https://lccn.loc.gov/a%2047003377 doesn't resolve, so we need to either strip spaces when constructing the URL.
The full structure of the LCCN is described here: https://www.loc.gov/marc/lccn_structure.html
Stakeholders
@hornc
Not a showstopper, but should be fixed before doing any more MARC imports.
The text was updated successfully, but these errors were encountered: