Useless files created when running BertQA.predict() #119

andrelmfarias · 2019-05-02T15:20:02Z

When we run the predict() method of BertQA, two files are also created in the repository where we run the code:

nbest_predictions.json: An empty json file
predictions.json: a json file with predictions for the paragraphs in the squad-like dictionary fed as input to predict()

I suggest to take out the creation of these files as they are useless. Or maybe create a boolean parameter to keep the option to save predictions.json, but keep it as False by default.

The text was updated successfully, but these errors were encountered:

fmikaelian · 2019-05-06T09:45:29Z

Just realized we need to keep predictions.json for evaluation (see #104)

andrelmfarias · 2019-05-06T10:25:14Z

But not in the structure the current predictions.json is created. When we run predict() with BertQA (or QAPipeline) on the list of paragraphs sent by the retriever, it will generate a json file with several answers for the same question.

The evaluation script for SQUAD compares only one answer per question.

By the way, the json we will create for evaluation with the annotator will only have one answer per question. We cannot compare (evaluate) the predictions on predictions.json with respect to the json file of annotated BNP data set.

andrelmfarias · 2019-05-06T10:30:01Z

predictions.json should have only one answer per question sent to the model.

By the way, we should be sure that our sklearn version of the model can also receive a list of question an generate a list of answer, as well a json file with questions and answers. We will need it for proper evaluation. Our current version sklearn wrapper does not work with a list of questions.

fmikaelian · 2019-05-06T10:39:49Z

Our current version sklearn wrapper does not work with a list of questions.

I thought you already checked the sklearn version? See #70

andrelmfarias · 2019-05-06T10:45:30Z

I am talking about QAPipeline.predict()

It still only works to one question, and it will send a list of pairs question-paragraph for this question with different paragraphs to BertQA. BertQA takes this list and do a prediction for each of these pairs.

We still have to make it able to apply the retriever for several questions, send these several questions with several paragraphs to the reader and obtain a predictions.json with only one prediction per question.

fmikaelian · 2019-05-06T11:31:43Z

I am talking about QAPipeline.predict()

It still only works to one question, and it will send a list of pairs question-paragraph for this question with different paragraphs to BertQA. BertQA takes this list and do a prediction for each of these pairs.

We still have to make it able to apply the retriever for several questions, send these several questions with several paragraphs to the reader and obtain a predictions.json with only one prediction per question.

I think I got it. Is it an easy change? Like a for loop over X in the predict() ?

andrelmfarias · 2019-05-06T11:35:23Z

yes, it is.

But we still have to handle predict.json. It is generated by BertQA.predict().

For us and our needs, it should be generated by QAPipeline.predict()

fmikaelian · 2019-05-06T13:17:31Z

Actually there are 2 kinds of evaluations:

"Reader only" & "QAPipeline" (= Retriever + Reader). I think "reader only" evaluation is working already because you can do a "multiple predict". But we can probably not evaluate with "QAPipeline" since it can only do "single predict".

Is that correct?

andrelmfarias · 2019-05-06T13:24:37Z

Yes exactly, there 2 kinds of evaluations.

I always thought that what interests us and our work is the evaluation of the whole Pipeline, which evaluates the effectiveness of the app. I was also aware that this evaluation is not comparable to the evaluation of the model on SQUAD.

Now, as you mention the reader-only evaluation (comparable to evaluation on SQUAD) I think we can do both.

fmikaelian · 2019-05-06T13:29:48Z

So I propose to implement "multiple predict" for QAPipeline() so that we can do "QAPipeline evaluation" as well?

Then we'll report both evaluations in the paper?

andrelmfarias · 2019-05-06T13:33:06Z

I agree.

fmikaelian · 2019-05-15T15:11:09Z

@andrelmfarias

Would you like to add a boolean parameter with equals to False by default for exports of these files?

andrelmfarias · 2019-05-15T21:49:37Z

Yes, I think it is a good solution for this issue

andrelmfarias self-assigned this May 6, 2019

andrelmfarias mentioned this issue May 6, 2019

Implement multiple prediction in QAPipeline.predict() #122

Closed

andrelmfarias mentioned this issue Jun 14, 2019

removing data, models and logs directories from repo? #111

Closed

fmikaelian closed this as completed Jun 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Useless files created when running BertQA.predict() #119

Useless files created when running BertQA.predict() #119

andrelmfarias commented May 2, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019 •

edited

Loading

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 15, 2019 •

edited

Loading

andrelmfarias commented May 15, 2019

Useless files created when running BertQA.predict() #119

Useless files created when running BertQA.predict() #119

Comments

andrelmfarias commented May 2, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019 • edited Loading

andrelmfarias commented May 6, 2019

fmikaelian commented May 6, 2019

andrelmfarias commented May 6, 2019

fmikaelian commented May 15, 2019 • edited Loading

andrelmfarias commented May 15, 2019

fmikaelian commented May 6, 2019 •

edited

Loading

fmikaelian commented May 15, 2019 •

edited

Loading