Change the repository type filter
All
Repositories list
17 repositories
Annif_API
PublicInstructions and pretrained models for using Annif (https://annif.org/) software for automatic subject indexing as local service..github
PublicTable_segmentation
PublicCode for segmenting table structures and detecting text content in document images.Train_BERT_NER
PublicCode for training Finnish named entity recognition (NER) model based on BERT.Document_segmentation
PublicArkkiivi_UI
PublicFaultyImageAPI
PublicAPI that combines empty page, post-it, folded corner and writing type detection models.EmptyAPI
PublicAPI for detecting empty document imagesCornerAPI
PublicAPI for a machine learning model trained to detect folded or torn corners and edges from scanned document images.PostitAPI
PublicAPI for a machine learning model trained to detect post-it/sticky notes from scanned document images.NER_API
PublicAPI for performing named entity recognition from text input in Finnish.WritingtypeAPI
PublicRepo for writingtype classifier API- Code that can be used for training a neural network model to classify input documents into distinct classes.
Train_fault_detection
PublicCode that can be used for training a neural network model to detect faults (sticky notes, folded corners etc.) in input documents.Train_writing_type
PublicTraining code for a deep learning model that detects document writing type from images.Document-analysis_API
PublicThis API use Annif as local server, NER component is included. It also includes Tesseract and uses Apache-tika software for language detection. It also has a limited multilingual support.Empty_training
PublicTraining code for a deep learning model that detects empty document from images.