Pointwise ABC, MonoT5, New Unit Tests #128

xpbowler · 2024-08-03T15:34:19Z

Summary of Changes

Bug fix in extract_kwargs function
Bug fix where IdentityReranker not returning top_k_rerank (only top_k_retrieve)
Add unit tests for API and retrieve_and_rerank
Added pointwise and pairwise ABC
Bug fix in Rank_LLM ABC parameters (missing kwargs)
...more changes below (@IR3KT4FUNZ)

…filename into the init function in PointwiseRankLLM, add PromptMode.MONOT5 = monot5 as prompt_mode

Pointwise changes

IR3KT4FUNZ · 2024-08-05T21:37:39Z

Changes:

add monot5 as a promptmode, and MonoT5 subclass of PointwiseRankLLM
write implementations for the abstract functions for MonoT5 and PointwiseRankLLM classes
update Reranker class to add support for the MonoT5 agent

test/api/output.txt

…nt to retrieve_and_rerank

…fix

…rting docs to prompts in pointwise by using the listwise functions, fix sorting order of monot5, add batching for run_llm_batched in pointwise_rankllm

…g to pointwise_rankllm class instead of individual pointwise model classes

ronakice

LGTM!

ronakice · 2024-08-12T02:51:53Z

src/rank_llm/rerank/pointwise/pointwise_rankllm.py

+try:
+    from vllm import LLM, SamplingParams
+except:
+    LLM = None
+    SamplingParams = None
+
+logger = logging.getLogger(__name__)


not needed?

ronakice · 2024-08-17T09:59:29Z

src/rank_llm/rerank/pointwise/monot5.py

+        self,
+        model: str,
+        prompt_mode: str = "monot5",
+        context_size: int = 512,


Create a new issue s.t., we support longer sequences eventually, nothing is limiting T5 usage with longer context (we've done it before)

ronakice · 2024-08-17T10:00:29Z

src/rank_llm/rerank/pointwise/monot5.py

+            truth_logit = logit_tensor[1176]
+            false_logit = logit_tensor[6136]
+            score = math.exp(truth_logit) / (


this is hardcoded and won't scale in the future to other T5 base models, note this as a T5 todo

ronakice · 2024-08-17T10:01:18Z

src/rank_llm/rerank/pointwise/monot5.py

+        outputs = self._tokenizer.decode(
+            output_ids, skip_special_tokens=True, spaces_between_special_tokens=False
+        )
+        truth_logit = logits[0][0][1176]


pairwise/pointwise ABC

92f201e

xpbowler marked this pull request as draft August 4, 2024 00:31

xpbowler marked this pull request as ready for review August 4, 2024 03:59

xpbowler marked this pull request as draft August 4, 2024 03:59

xpbowler and others added 8 commits August 4, 2024 16:55

add API, retrieve_and_rerank test cases

13c6ffd

Merge remote-tracking branch 'upstream/main'

bbf415a

run new unit tests in gh ci/cd

0723dda

add API + retrieve_and_rerank unit tests; update pr-format.yml

23a3e8d

modifying pointwise_rankllm ABC, and setting up monot5.py

4fc59cf

delete unnecesary abstract method passdown in PointwiseRankLLM, move …

6eae729

…filename into the init function in PointwiseRankLLM, add PromptMode.MONOT5 = monot5 as prompt_mode

fix unused imports for pull request check

37dab05

Merge pull request #1 from xpbowler/pointwise_changes

0dcc60b

Pointwise changes

xpbowler marked this pull request as ready for review August 5, 2024 21:58

xpbowler changed the title ~~Pairwise/Pointwise~~ Pointwise ABC, MonoT5, New Unit Tests Aug 5, 2024

ronakice reviewed Aug 6, 2024

View reviewed changes

test/api/output.txt Outdated Show resolved Hide resolved

IR3KT4FUNZ and others added 6 commits August 6, 2024 18:01

add progress bar, and fix run_rank_llm.py to pass in the query argume…

8206780

…nt to retrieve_and_rerank

remove pairwise (will be included in future PR)

f039f43

lint

4e03f49

change java version in workflow

ba70608

workflow

0a34638

add kwargs to run_llm_batched and run_llm

8490873

xpbowler mentioned this pull request Aug 7, 2024

Merge lit5 to rankllm #127

Merged

XKTZ added a commit to XKTZ/rank_llm that referenced this pull request Aug 7, 2024

Modified rankllm and reranker to avoid conflict with the castorini#128 …

382e2d2

…fix

IR3KT4FUNZ and others added 6 commits August 7, 2024 20:05

fixed window_size appearing in pointwise output filename, fixed conve…

aacf3cc

…rting docs to prompts in pointwise by using the listwise functions, fix sorting order of monot5, add batching for run_llm_batched in pointwise_rankllm

merge conflict resolution

7516ccd

add missing dependency

a5f1476

remove changes meant for mt5

9c96195

make progress bar go up to total number of (q, d) pairs, move batchin…

2b55bf5

…g to pointwise_rankllm class instead of individual pointwise model classes

merge main

754a2e0

xpbowler added 3 commits August 12, 2024 02:32

lint

531ebfa

small fix

d0128f3

removed double batch size in run_rank_llm

501a9a9

ronakice approved these changes Aug 17, 2024

View reviewed changes

ronakice merged commit 5fe9343 into castorini:main Aug 17, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pointwise ABC, MonoT5, New Unit Tests #128

Pointwise ABC, MonoT5, New Unit Tests #128

xpbowler commented Aug 3, 2024 •

edited

Loading

IR3KT4FUNZ commented Aug 5, 2024

ronakice left a comment

ronakice Aug 12, 2024

ronakice Aug 17, 2024

ronakice Aug 17, 2024

ronakice Aug 17, 2024

Pointwise ABC, MonoT5, New Unit Tests #128

Pointwise ABC, MonoT5, New Unit Tests #128

Conversation

xpbowler commented Aug 3, 2024 • edited Loading

Summary of Changes

IR3KT4FUNZ commented Aug 5, 2024

ronakice left a comment

Choose a reason for hiding this comment

ronakice Aug 12, 2024

Choose a reason for hiding this comment

ronakice Aug 17, 2024

Choose a reason for hiding this comment

ronakice Aug 17, 2024

Choose a reason for hiding this comment

ronakice Aug 17, 2024

Choose a reason for hiding this comment

xpbowler commented Aug 3, 2024 •

edited

Loading