MARC21 to CSV converter

Python script for converting MARC21 files to a saner format (CSV), originally designed for the freely available Harvard Library Bibliographic Dataset.

This script only conserves the following subset of MARC21 records (with additional simple post-processing on them):

Field	Description
020a	ISBN
245a/245b	Title
100/110/111	Author
260b	Publisher
260a	Publisher Place
300a	Extent (eg. pages)
300c	Physical dimensions
650c	Subject
All 5xx	Notes
988a	Inclusion date
906a	Governing source
690 5	Harvard Library

The last three fields are Harvard-specific (per Harvard's documentation) and can be easily excluded. Additional fields can also be easily added. See the Library of Congress' official documentation on MARC21 for standard field codes.

Usage

This script expects source files will be in data/mrc/ and will output the converted files to data/csv/. Simply run:

python convert.py

The script requries pymarc. Install it with:

pip install pymarc

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert.py		convert.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MARC21 to CSV converter

Usage

About

Releases

Packages

License

DavidChouinard/mrc_to_csv

Folders and files

Latest commit

History

Repository files navigation

MARC21 to CSV converter

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages