Skip to content
@TurkuNLP

TurkuNLP Group - IT Department - University of Turku

Popular repositories Loading

  1. Turku-neural-parser-pipeline Turku-neural-parser-pipeline Public

    A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more than 50 languages. Top ranker in the CoNLL-18 Shared Task.

    Python 112 31

  2. FinBERT FinBERT Public

    BERT model trained from scratch on Finnish

    Shell 95 7

  3. Finnish-dep-parser Finnish-dep-parser Public

    The Finnish dependency parsing pipeline being developed by the Turku NLP group. Documentation:

    Python 49 10

  4. wikibert wikibert Public

    BERT models for many languages created from Wikipedia texts

    34 1

  5. Text_Mining_Course Text_Mining_Course Public

    Stuff for the Text Mining course

    Jupyter Notebook 28 9

  6. ocr-correction ocr-correction Public

    Post-processing OCR errors with seq2seq models

    Python 28 2

Repositories

Showing 10 of 126 repositories
  • TurkuNLP/LLM_document_descriptors’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Dec 13, 2024
  • list-of-publications Public

    Turku NLP list of publications

    TurkuNLP/list-of-publications’s past year of commit activity
    TeX 0 2 0 0 Updated Dec 12, 2024
  • htr-table-pipeline Public

    Handwritten text recognition pipeline for table data

    TurkuNLP/htr-table-pipeline’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated Dec 11, 2024
  • htr-annotations Public

    Handwritten text recognition annotations

    TurkuNLP/htr-annotations’s past year of commit activity
    0 0 0 0 Updated Dec 11, 2024
  • ATP_kurssi Public
    TurkuNLP/ATP_kurssi’s past year of commit activity
    Jupyter Notebook 4 4 0 0 Updated Dec 9, 2024
  • ocr-postcorrection-lm Public

    Code to try out ocr postcorrection with language models

    TurkuNLP/ocr-postcorrection-lm’s past year of commit activity
    Jupyter Notebook 0 0 1 0 Updated Dec 4, 2024
  • RAG-web-app Public
    TurkuNLP/RAG-web-app’s past year of commit activity
    HTML 3 0 6 0 Updated Dec 3, 2024
  • Keyword-embeddings-clusters Public

    Clusters with keywords grouped based on their word embeddings

    TurkuNLP/Keyword-embeddings-clusters’s past year of commit activity
    0 0 0 0 Updated Nov 29, 2024
  • ecco-ocr-ec Public
    TurkuNLP/ecco-ocr-ec’s past year of commit activity
    Python 0 0 0 0 Updated Nov 28, 2024
  • TurkuNLP/pytorch-registerlabeling’s past year of commit activity
    Python 1 1 0 0 Updated Nov 27, 2024