- Release date: December 16, 2022
- Anserini dependency: v0.16.2
- Added bindings for Huggingface tokenizer.
- Added option to configure efSearch parameter in
pyserini.search.faiss
. - Added initial implementation of
LuceneIndexer
that provides bindings toSimpleIndexer
for on-the-fly indexing (i.e., no need to write jsonl output first). - Refactored code to match Arg classes refactoring in Anserini.
- Improved parsing of key type (
str
orint
) in query sets. - Installed pre-built BEIR indexes (Lucene 9).
Sorted by number of commits:
- Jimmy Lin (lintool)
- Chuan Meng (ChuanMeng)
- Xinyu (Crystina) Zhang (crystina-z)
- Ogundepo Odunayo (ToluClassics)
- Xueguang Ma (MXueguang)
- j762liu (ljatca)
- minconszhang (minconszhang)
All contributors with five or more commits, sorted by number of commits, according to GitHub:
- Jimmy Lin (lintool)
- Xueguang Ma (MXueguang)
- Yuqi Liu (yuki617)
- Xinyu (Crystina) Zhang (crystina-z)
- Johnson Han (x65han)
- Stephanie Hu (stephaniewhoo)
- Manveer Tamber (manveertamber)
- Arthur Chen (ArthurChen189)
- Jack Lin (jacklin64)
- Hang Li (hanglics)
- Ronak Pradeep (ronakice)
- Matt J. H. Yang (justram)
- Ogundepo Odunayo (ToluClassics)
- Chris Kamphuis (Chriskamphuis)
- Habeeb Shopeju (HAKSOAT)
- Shengyao Zhuang (ArvinZhuang)
- Sailesh Nankani (saileshnankani)
- Xinyu Mavis Liu (x389liu)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Pepijn Boers (PepijnBoers)