Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add index statistics function #1218

Merged
merged 2 commits into from
May 23, 2020
Merged

Add index statistics function #1218

merged 2 commits into from
May 23, 2020

Conversation

PepijnBoers
Copy link
Contributor

Adds function that returns selection of index statistics in map format (as previously printed in IndexUtils)

@codecov
Copy link

codecov bot commented May 23, 2020

Codecov Report

Merging #1218 into master will increase coverage by 0.06%.
The diff coverage is 86.66%.

Impacted file tree graph

@@             Coverage Diff              @@
##             master    #1218      +/-   ##
============================================
+ Coverage     48.33%   48.39%   +0.06%     
- Complexity      739      741       +2     
============================================
  Files           147      147              
  Lines          8537     8552      +15     
  Branches       1212     1213       +1     
============================================
+ Hits           4126     4139      +13     
- Misses         4072     4074       +2     
  Partials        339      339              
Impacted Files Coverage Δ Complexity Δ
.../main/java/io/anserini/index/IndexReaderUtils.java 53.27% <86.66%> (+2.34%) 31.00 <2.00> (+2.00)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 67285c9...760bc85. Read the comment docs.

Terms terms = MultiTerms.getTerms(reader, IndexArgs.CONTENTS);

indexStats.put("documents", reader.numDocs());
indexStats.put("documents (non-empty)", reader.getDocCount(IndexArgs.CONTENTS));
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

shall we make these into pythonic strings like "non_empty_documents"?

@lintool lintool merged commit b541d1e into castorini:master May 23, 2020
crystina-z pushed a commit to crystina-z/anserini that referenced this pull request Oct 28, 2022
* add negative rocchio reranker in pyserini

* add test for negative feedback in Rocchio
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants