Skip to content

Commit

Permalink
fix: GermanDPR Dataset Causes Cross-Encoder Failure Due to Unexpected…
Browse files Browse the repository at this point in the history
… dict (#1621)

Fixes #1609
  • Loading branch information
KennethEnevoldsen authored Dec 22, 2024
1 parent e1b74f2 commit 748033e
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions mteb/tasks/Retrieval/deu/GermanDPRRetrieval.py
Original file line number Diff line number Diff line change
Expand Up @@ -83,6 +83,7 @@ def load_data(self, **kwargs):
)
corpus.update(neg_docs)
relevant_docs[q_id] = {k: 1 for k in pos_docs}
corpus = {doc["id"]: doc.get("title", "") + " " + doc["text"] for doc in corpus}
self.queries = {self._EVAL_SPLIT: queries}
self.corpus = {self._EVAL_SPLIT: corpus}
self.relevant_docs = {self._EVAL_SPLIT: relevant_docs}
Expand Down

0 comments on commit 748033e

Please sign in to comment.