Release date: May 6, 2020
- Integrated metadata from CSV with JSON of full-text articles in CORD-19.
- Renamed
Covid*
toCord19*
to more accurately name of corpus. - Updated support for CORD-19, up through data drop of 2020/05/01.
- Added manual blacklist to skip outlier articles in CORD-19.
- Added query generator (and output queries) from University of Delaware for TREC-COVID (round 1).
- Added instructions for generating baseline runs for TREC-COVID (round 1).
- Added topics for TREC-COVID (round 2).
- Added collection support for 20Newsgroups.
- Added support for taking stopwords from an external file.
- Added ability to compute document frequency for phrases.
- Added support for MS MARCO documents in Elasticsearch
- Improved support for multiple vectors with same id in nearest neighbor search.
- Fixed bug in Solrini regression for MS MARCO document.
- Fixed out-of-date documentation for MS MARCO regressions.
Sorted by number of commits:
- Jimmy Lin (lintool)
- Hang Cui (HangCui0510)
- Yuqi Liu (yuki617)
- Chris Kamphuis (Chriskamphuis)
- Edwin Zhang (edwinzhng)
- Eiston Wei (eiston)
- Johnson Han (x65han)
- Kuang Lu (lukuang)
- Tommaso Teofili (tteofili)
- Vera Lin (y276lin)
- niazarak (niazarak)
- Stephanie Hu (stephaniewhoo)
- Wei Pang (weipang142857)
Sorted by number of commits, according to GitHub:
- Jimmy Lin (lintool)
- Peilin Yang (Peilin-Yang)
- Ryan Clancy (r-clancy)
- Ahmet Arslan (iorixxx)
- Royal Sequiera (rosequ)
- Emily Wang (emmileaf)
- Edwin Zhang (edwinzhng)
- Victor Yang (Victor0118)
- Tommaso Teofili (tteofili)
- Boris Lin (borislin)
- Chris Kamphuis (Chriskamphuis)
- Nikhil Gupta (nikhilro)
- Yuhao Xie (Kytabyte)
- Rodrigo Nogueira (rodrigonogueira4)
- Salman Mohammed (salman1993)
- Luchen Tan (LuchenTan)
- Xinyu Mavis Liu (x389liu)
- Zhiying Jiang (bazingagin)
- Michael Tu (tuzhucheng)
- Dayang Shi (dyshi)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Yuqi Liu (yuki617)
- Kuang Lu (lukuang)
- Peng Shi (Impavidity)
- Xin Qian (xeniaqian94)
- Adam Roegiest (aroegies)
- Weihua Li (w329li)
- Toke Eskildsen (tokee)
- Zhaohao Zeng (matthew-z)
- Hang Cui (HangCui0510)
- Xing Niu (xingniu)
- Ronak Pradeep (ronakice)
- Mina Farid (minafarid)
- Mengfei Liu (meng-f)
- Maik Fröbe (mam10eks)
- Adrien Grand (jpountz)
- Gaurav Baruah (gauravbaruah)
- Edward Lu (edwardhdlu)
- Adrien Pouyet (Ricocotam)
- Joel Mackenzie (JMMackenzie)
- Vera Lin (y276lin)
- Johnson Han (x65han)
- Wei Pang (weipang142857)
- Ruifan Yu (tiddler)
- Stephanie Hu (stephaniewhoo)
- Leonid Boytsov (searchivarius)
- Petek Yıldız (ptkyldz)
- niazarak (niazarak)
- Kevin Xu (kevinxyc1)
- Matt Yang (justram)
- Kelvin Jiang (kelvin-jiang)
- Guy Rosin (guyrosin)
- Eiston Wei (eiston)
- Charles Wu (charW)
- Matteo Catena (catenamatteo)
- Andrew Yates (andrewyates)
- Alireza Mirzaeiyan (amirzaeiyan)
- Antonio Mallia (amallia)
- Horatiu Lazu (MathBunny)
- Edward Li (LuKuuu)