In-Context-Reranking

Code and data for paper Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers.

We present in-context re-ranking (ICR), an efficient re-ranking method that directly leverages the attention pattern of LLMs for zero-shot re-ranking. By reading the LLM’s mind, ICR dramatically cuts the complexity of re-ranking $N$ documents from $O(N)$ ~ $O(N \log N)$ down to O(1) with better re-ranking performance, especially on more challenging tasks.

Data Preparation

BEIR datasets

Prepare BM25 retrieval results for BEIR datasets with src/bm25_retrieval.ipynb (You need to setup Pyserini). The retrieval result will be stored in retriever_outpout/.

Multi-hop datasets

Download ColBERTv2 top-20 retrieval results for multi-hop datasets here and put them in retriever_outpout/.

Custom dataset

Process your own data into the following json format:

[
  {
	"idx": "idx will be used to retrieve qrel records",
	"question": "query for retrieval or QA",
	"paragraphs":[
	  {
	    "idx": "idx of documents",
		"title": "title of document",
		"paragraph_text": "text of document",
		"is supporting": "true/false, whether the document is a target for retrieval",
	  },
	  {},
	]
  },
  {},
]

Experiments

We provide the scripts for reproducing our experiments:

bash run_icr_beir.sh
bash run_icr_multihop.sh

Citation

If you find this work helpful, please consider citing our paper:

@misc{chen2024attentionlargelanguagemodels,
      title={Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers}, 
      author={Shijie Chen and Bernal Jiménez Gutiérrez and Yu Su},
      year={2024},
      eprint={2410.02642},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2410.02642}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
_asset		_asset
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
experiments.py		experiments.py
requirements.txt		requirements.txt
run_icr_beir.sh		run_icr_beir.sh
run_icr_multihop.sh		run_icr_multihop.sh
run_rankgpt_beir.sh		run_rankgpt_beir.sh
run_rankgpt_multihop.sh		run_rankgpt_multihop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

In-Context-Reranking

Data Preparation

BEIR datasets

Multi-hop datasets

Custom dataset

Experiments

Citation

About

Releases

Packages

Languages

License

OSU-NLP-Group/In-Context-Reranking

Folders and files

Latest commit

History

Repository files navigation

In-Context-Reranking

Data Preparation

BEIR datasets

Multi-hop datasets

Custom dataset

Experiments

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages