Skip to content
Change the repository type filter

All

    Repositories list

    • ColBERT-X

      Public
      CLIR version of ColBERT
      Python
      MIT License
      116600Updated Sep 26, 2024Sep 26, 2024
    • HC3

      Public
      HLTCOE CLIR Conversation Collection
      Python
      Other
      0000Updated Sep 2, 2024Sep 2, 2024
    • turkle

      Public
      Django-based clone of Amazon's Mechanical Turk service running in your local environment.
      Python
      Other
      46148392Updated Jun 26, 2024Jun 26, 2024
    • PSQ

      Public
      Python
      Other
      0000Updated Apr 29, 2024Apr 29, 2024
    • Implementation of the measure Probability of Equal Expected Rank
      Python
      MIT License
      0200Updated Apr 23, 2024Apr 23, 2024
    • SIGIR 2023 tutorial on cross language information retrieval.
      Jupyter Notebook
      Creative Commons Zero v1.0 Universal
      01300Updated Feb 28, 2024Feb 28, 2024
    • JavaScript library for working with Concrete, a data serialization format for NLP
      JavaScript
      Other
      2310Updated Oct 26, 2023Oct 26, 2023
    • sandle

      Public
      Run a large language modeling SANDbox in your Local Environment
      Python
      Other
      17300Updated Oct 21, 2023Oct 21, 2023
    • Python modules and scripts for working with Concrete, a data serialization format for NLP
      Python
      Other
      82040Updated Oct 20, 2023Oct 20, 2023
    • JavaScript
      1100Updated Oct 12, 2023Oct 12, 2023
    • BLADE

      Public
      Python
      0100Updated Aug 10, 2023Aug 10, 2023
    • tasa

      Public
      TASA - Translation And Structural Alignment
      JavaScript
      Other
      32015Updated Jul 14, 2023Jul 14, 2023
    • concrete

      Public
      Thrift definitions, making HLT data specifications concrete
      Thrift
      Other
      51620Updated Jul 10, 2023Jul 10, 2023
    • patapsco

      Public
      Cross language information retrieval pipeline
      Python
      Other
      718201Updated Jun 9, 2023Jun 9, 2023
    • HTML
      0040Updated Apr 14, 2023Apr 14, 2023
    • client for the Turkle annotation platform
      Python
      Other
      0000Updated Dec 13, 2022Dec 13, 2022
    • Python
      1001Updated Nov 29, 2022Nov 29, 2022
    • HC4

      Public
      HLTCOE CLIR Common-Crawl Collection
      Python
      0720Updated Jul 18, 2022Jul 18, 2022
    • Concrete-Stanford: Wraps Stanford NLP with utilities to fit it into a concrete compliant workflow
      Java
      GNU General Public License v3.0
      3301Updated Feb 26, 2022Feb 26, 2022
    • Java wrappers and utilities for reading the Annotated NYT corpus
      Java
      Apache License 2.0
      0101Updated Jan 4, 2022Jan 4, 2022
    • prototurk

      Public
      Simple server for rapidly prototyping Mechanical Turk interfaces
      Python
      Other
      1500Updated Sep 22, 2021Sep 22, 2021
    • lid

      Public
      Python
      0100Updated Aug 21, 2021Aug 21, 2021
    • stretcher

      Public
      Concrete file server
      Java
      Other
      0102Updated Aug 2, 2021Aug 2, 2021
    • VBx

      Public
      Variational Bayes HMM over x-vectors diarization
      Python
      57400Updated Apr 28, 2021Apr 28, 2021
    • xvectors

      Public
      Python
      Other
      1910Updated Apr 23, 2021Apr 23, 2021
    • PredPatt

      Public
      PredPatt: Predicate-Argument Extraction from Universal Dependencies
      Python
      BSD 3-Clause "New" or "Revised" License
      2311270Updated Feb 24, 2021Feb 24, 2021
    • Dataset for exploring the uses of named entity recognition in information retrieval
      Python
      0000Updated Sep 1, 2020Sep 1, 2020
    • Named Entity Recognition for Chinese social media (Weibo). From EMNLP 2015 paper.
      Python
      17254560Updated Jun 9, 2020Jun 9, 2020
    • Jupyter Notebook
      11230Updated Mar 31, 2020Mar 31, 2020
    • NER annotations of the Chinese Newspaper Renmin
      Python
      1100Updated Mar 24, 2020Mar 24, 2020