Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorrect P@5 Values #6

Closed
ryan-clancy opened this issue Jun 18, 2019 · 4 comments
Closed

Incorrect P@5 Values #6

ryan-clancy opened this issue Jun 18, 2019 · 4 comments

Comments

@ryan-clancy
Copy link
Member

When doing the official run, I get the following values from trec_eval:

Starting container from saved image...
Logs for search in container with ID eade5686b363406fb16c3d8c51b37436cd1879429c53ee11ba0cb60cae33f752...
CREATE DATABASE
CREATE CONNECTION
SCORING TOPICS
Evaluating results using trec_eval...
###
# /tmp/output/olddog/run.bm25.robust04
###
map                     all     0.1771
P_5                     all     0.4385

MAP looks good, but the P@5 value isn't what's expected.

Log and run files available at https://github.com/osirrc/osirrc2019-runs/tree/master/olddog/robust04/2019-06-17

@arjenpdevries
Copy link
Member

I am checking right now, but I think that the measure in the Table is P@30 rather than P@5, so our mistake in which number to report.

@arjenpdevries
Copy link
Member

Confirmed - the number we put in the table is P@30.

Will be fixed!

@arjenpdevries
Copy link
Member

@r-clancy and @lintool a question:
I computed P@5 using trec_eval and get the value @r-clancy reports above (Phew);
but if I use trec_eval -c, I get P@5 = 0.4297.

I think people should report the latter, not the former - i.e., a system has to be evaluated on all topics, not on all topics that they returned answers for. (Otherwise, I could get a very high MAP by being smart about selecting topics to include - even if that is difficult to do, it is not unimaginable.)

@lintool
Copy link
Member

lintool commented Jun 18, 2019

@arjenpdevries filed issue here: osirrc/jig#105

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants