This repository provides tools and scripts to run retrieval and train a model to retrieve passage combinations that helps downstream retrieval-augmented QA the most. A baseline is also provided by simply retrieving and taking the top k passages.
-
Run Retrieval:
- Use the provided scripts indicated in retrieval README to run the retrieval process. This step involves generating representation vectors for the static documents dataset and retrieving the best matching passages given the query vectors.
-
Train Model:
- Train a model to retrieve passage combinations using the provided
.sh
files.
bash qa_passage_combination/train_and_evaluate.sh
- Train a model to retrieve passage combinations using the provided
-
Evaluate Model:
- Evaluate the trained model using the provided
.sh
files.
bash qa_passage_combination/test.sh
- Evaluate the trained model using the provided
-
Baseline:
- A baseline is provided by simply retrieving and taking the top k passages. This can be used as a reference to compare the performance of your trained model.
bash qa_baseline/rerank.sh bash qa_baseline/run_qa_all.sh
This project is based on and inspired by the work and code from the following repositories:
We thank the authors of these repositories for their contributions to the community.
This project is licensed under the MIT License.