Optical Music Recognition in Manuscripts from the Ricordi Archive

Important

If you like our work and use it in your work, cite us:

Simonetta F., Mondal R., Ludovico L. A., Ntalampiras S. "Optical Music Recognition in Manuscripts from the Ricordi Archive", AudioMostly 2024, Milan, Italy. DOI: https://doi.org/10.1145/3678299.3678324

Setup

clone all submodules (git clone --recursive)
install pyenv: curl https://pyenv.run | bash

Prepare staff-line-removal

install miniconda2 PYTHON_CONFIGURE_OPTS="--enable-shared" pyenv install miniconda2-4.7.12
PYENV_VERSION=miniconda2-4.7.12 conda env create -f staffline.yml
PYENV_VERSION=miniconda2-4.7.12 conda init $(basename $SHELL) (you may need to fix the command substitution syntax)
exec $SHELL

Prepare our project

install python >= 3.9: pyenv install 3.9.16 (recommended is 3.9.16)
install pdm: pipx install pdm, other info on the website
install dependencies: pdm install
for Jupyter, you will also need NodeJS available on your system

This will use the PyPI CUDA libraries. For using different CUDA libraries, you have to change the pyproject.toml file using the extra options provided by pytorch (see docs).

Pre-process

setup the relative section in config.toml
remove staff-lines: ./preprocess.sh; if it stops, you can restart it
pdm preprocess

After this, you will find two files ending with _nostaff.jpg and _nostaff.json for each file in the archive, containing the image without staff and the data about that image (original file name, list of blobs, etc.). You will also find a directory ending with _nostaff for each file in the archive, containing the list of blobs, each in png format and accompanied by a .json file containing the position, the parent image path and the annotations (after the Data Entry step).

Data Entry

pdm data_entry

This will start the Flask app listening on all hosts requests to your machine on port 1992. You can configure the port in config.toml, as well as other options (documented in config.toml).

The server will create a 2 files:

__annotator.json:
- keys are annotators (see config.toml);
- value of each key is a list of lists where the i-th list represents the annotations given for the i-th control blob
__control.json:
- keys are normal and control
- values are lists of string with the path of the blobs in each split

Dataset analysis

pdm dataset_analysis

This command will run the rater agreement analysis using papermill and the notebook ./Confusion_Matrix_Annotation.ipynb

pdm dataset_creation (first, fix the dataset_path variable)

This command will create the dataset from the annotated files using papermill and the notebook ./Create_Dataset.ipynb. The Datasets will be in the directory ./data.

Model runs

First, check the value of the dataset_path variable.

pdm binary
pdm multiclass

These command will use papermill to run the notebooks OMR_Binary.ipynb and OMR_Multiclass.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
omr		omr
staff-lines-removal @ 4c8855e		staff-lines-removal @ 4c8855e
static		static
.gitignore		.gitignore
.gitmodules		.gitmodules
Confusion_Matrix_Annotation.ipynb		Confusion_Matrix_Annotation.ipynb
Create_Dataset.ipynb		Create_Dataset.ipynb
OMR_Binary.ipynb		OMR_Binary.ipynb
OMR_Multiclass.ipynb		OMR_Multiclass.ipynb
README.md		README.md
__annotator.json		__annotator.json
__control.json.xz		__control.json.xz
config.toml		config.toml
exploring_blob_detection.ipynb		exploring_blob_detection.ipynb
pdm.lock		pdm.lock
preprocess.sh		preprocess.sh
pyproject.toml		pyproject.toml
staffline.yml		staffline.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical Music Recognition in Manuscripts from the Ricordi Archive

Setup

Prepare staff-line-removal

Prepare our project

Pre-process

Data Entry

Dataset analysis

Model runs

About

Releases

Packages

Languages

LIMUNIMI/RicordiArchiveOMR

Folders and files

Latest commit

History

Repository files navigation

Optical Music Recognition in Manuscripts from the Ricordi Archive

Setup

Prepare staff-line-removal

Prepare our project

Pre-process

Data Entry

Dataset analysis

Model runs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages