Skip to content
Change the repository type filter

All

    Repositories list

    • This is a corpus of written Zyrian Komi, built from Public Domain materials in Fenno-Ugrica collection
      Python
      0000Updated Oct 29, 2024Oct 29, 2024
    • This is a digital edition of Old Komi text corpus
      Jupyter Notebook
      0000Updated Mar 11, 2024Mar 11, 2024
    • Jupyter Notebook
      MIT License
      0010Updated May 4, 2023May 4, 2023
    • 1000Updated Apr 11, 2023Apr 11, 2023
    • PSDP-open

      Public
      Repository for open data and resources relating to Pite Saami
      TeX
      GNU General Public License v3.0
      0100Updated Dec 9, 2022Dec 9, 2022
    • IKDP-2

      Public
      Project website and documentation for the continuation project of IKDP
      0010Updated Aug 3, 2021Aug 3, 2021
    • Repository for aligned transcriptions of Erik Vászolyi's recordings.
      Jupyter Notebook
      0000Updated Apr 13, 2021Apr 13, 2021
    • This is a small Flask application that runs the pipeline from a web page.
      Python
      GNU General Public License v3.0
      0100Updated Mar 26, 2021Mar 26, 2021
    • DeepSpeech for ELAN users
      Python
      GNU General Public License v3.0
      0300Updated Feb 1, 2021Feb 1, 2021
    • langdoc

      Public
      Wider project description comes here
      0000Updated Jul 15, 2020Jul 15, 2020
    • elan-fst

      Public
      Script for workflow to add morphological analysis into ELAN files
      Python
      GNU General Public License v3.0
      11320Updated May 15, 2020May 15, 2020
    • Constraint Grammar based pseudonymization method for IKDP Spoken Komi corpus.
      MIT License
      0100Updated Apr 7, 2020Apr 7, 2020
    • berozovo

      Public
      Webpage for fieldwork trip related materials
      Python
      0000Updated Mar 11, 2020Mar 11, 2020
    • This is an open comparative database for different features in the Permic languages and dialects. New contributions are welcome.
      Creative Commons Zero v1.0 Universal
      0010Updated Jan 6, 2020Jan 6, 2020
    • eaf2korp

      Public
      This is a script that converts an ELAN file into VRT format used in Korp
      Jupyter Notebook
      Apache License 2.0
      0100Updated Aug 5, 2019Aug 5, 2019
    • nio

      Public
      FST Morphology for Nganasan
      Batchfile
      1100Updated Mar 31, 2019Mar 31, 2019
    • OCR Ground Truth for Unified Northern Alphabet
      HTML
      The Unlicense
      0300Updated Mar 20, 2019Mar 20, 2019
    • uralic

      Public
      Public Domain data from Uralic languages
      R
      Other
      3410Updated Feb 28, 2019Feb 28, 2019
    • 0100Updated Feb 27, 2019Feb 27, 2019
    • Pipeline for writing annotations from Giellatekno infrastructure to ELAN
      Python
      0400Updated Feb 26, 2019Feb 26, 2019
    • Database of settlements where Komi-Zyrian is spoken
      HTML
      0000Updated Feb 18, 2019Feb 18, 2019
    • kpv-lit

      Public
      Collection of Public Domain data in Komi-Zyrian
      Other
      0000Updated Jan 5, 2019Jan 5, 2019
    • R
      0000Updated Jan 4, 2019Jan 4, 2019
    • ocropy

      Public
      Python-based tools for document analysis and OCR
      Jupyter Notebook
      Apache License 2.0
      591000Updated Jan 4, 2019Jan 4, 2019
    • sjd-gold

      Public
      Open access, gold standard subsample of Kildin Saami corpus data for evaluation, testing and training
      Python
      0040Updated Nov 12, 2018Nov 12, 2018
    • FRechdoc

      Public
      Technical documentation about tools used in Freiburg. ELAN, XeLaTeX and RMarkdown templates.
      HTML
      1370Updated Nov 3, 2018Nov 3, 2018
    • Digital edition of Syrjaenische Texte serie
      HTML
      0000Updated Oct 22, 2018Oct 22, 2018
    • Python
      0100Updated Aug 21, 2018Aug 21, 2018
    • Web based JavaScript GUI library for proofreading/editing hOCR
      JavaScript
      MIT License
      25300Updated Jun 2, 2018Jun 2, 2018
    • castren

      Public
      0000Updated Jun 1, 2018Jun 1, 2018