Enable `ROUGEScore` to evaluate hypotheses against multiple references. #667

stancld · 2021-12-07T21:53:08Z

🚀 Feature

Enable ROUGEScore to evaluate hypotheses against multiple references.

Motivation

In the original paper, Lin (2004) proposes the evaluation of a hypothesis against the multiple references, and eventually, the maximum pairwise score is used.

Pitch

Enable ROUGEScore to evaluate hypotheses against multiple references as it is the case for other text metrics.

Alternatives

Leave it as it is.

Additional context

Ideal for #new-contributors

The text was updated successfully, but these errors were encountered:

ashutoshml · 2021-12-09T08:16:58Z

I can take a look at this

ashutoshml · 2021-12-09T10:15:52Z

I started working on it. The requirements say: "maximum pairwise score is used". Depending on rouge_types - there are 1, 2, L, different highest values, and consequently precision, recall and fmeasure. Which rouge_types' highest value should be used? 1, 2, L and which metric precision, recall and fmeasure?

Also, should we have an avg. version also (instead of just maximum pairwise)? It can be passed as an argument during init.

I can take a look at this

ashutoshml · 2021-12-09T16:01:51Z

@stancld I have written the code for maximum pairwise.
I wanted to know if we can quickly test just the test_rouge.py using pytest. make test seems to be taking a lot of time.
I'll run the full test once test_rouge.py works.

stancld · 2021-12-09T16:04:36Z

Hi @ashutoshml, thanks a lot for your effort! O:] I'll have a look tomorrow, but you can run:

pytest tests/text/test_rouge.py

in the project directory to run ROUGE test only :]

ashutoshml · 2021-12-10T06:54:38Z

Currently, we have

def update(self, preds: Union[str, List[str]], targets: Union[str, List[str]]) -> None:

Should we convert it into

def update(self, preds: Union[str, List[str]], targets: Union[str, List[str], List[List[str]]]) -> None:

?

For handling cases where we have list of predictions and list of list of references for that ?

stancld · 2021-12-10T15:45:59Z

Hi @ashutoshml, yes I think we may start with something like this. Feel free to open a draft issue once ready and we'll have a proper look :]

ashutoshml · 2021-12-11T11:27:19Z

@stancld During testing, we compare our score against the scores given by the rouge-score=0.0.4 package. It does not have a multiple-reference version. How do we write the test cases for it i.e., what would be the baseline?

ashutoshml · 2021-12-12T05:21:27Z

@stancld Opened a draft pull-request. Kindly check

stancld added the enhancement New feature or request label Dec 7, 2021

Borda added this to the v0.7 milestone Dec 8, 2021

ashutoshml mentioned this issue Dec 12, 2021

Multi Reference ROUGEScore #680

Merged

4 tasks

Borda closed this as completed in #680 Dec 17, 2021

Borda added the topic: Text label Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable `ROUGEScore` to evaluate hypotheses against multiple references. #667

Enable `ROUGEScore` to evaluate hypotheses against multiple references. #667

stancld commented Dec 7, 2021

ashutoshml commented Dec 9, 2021

ashutoshml commented Dec 9, 2021 •

edited

Loading

ashutoshml commented Dec 9, 2021

stancld commented Dec 9, 2021

ashutoshml commented Dec 10, 2021

stancld commented Dec 10, 2021 •

edited

Loading

ashutoshml commented Dec 11, 2021

ashutoshml commented Dec 12, 2021

Enable ROUGEScore to evaluate hypotheses against multiple references. #667

Enable ROUGEScore to evaluate hypotheses against multiple references. #667

Comments

stancld commented Dec 7, 2021

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

ashutoshml commented Dec 9, 2021

ashutoshml commented Dec 9, 2021 • edited Loading

ashutoshml commented Dec 9, 2021

stancld commented Dec 9, 2021

ashutoshml commented Dec 10, 2021

stancld commented Dec 10, 2021 • edited Loading

ashutoshml commented Dec 11, 2021

ashutoshml commented Dec 12, 2021

Enable `ROUGEScore` to evaluate hypotheses against multiple references. #667

Enable `ROUGEScore` to evaluate hypotheses against multiple references. #667

ashutoshml commented Dec 9, 2021 •

edited

Loading

stancld commented Dec 10, 2021 •

edited

Loading