Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REVIEW]: pocoMC: A Python package for accelerated Bayesian inference in astronomy and cosmology #4634

Closed
editorialbot opened this issue Aug 1, 2022 · 54 comments
Assignees
Labels
accepted published Papers published in JOSS Python recommend-accept Papers recommended for acceptance in JOSS. review TeX Track: 1 (AASS) Astronomy, Astrophysics, and Space Sciences

Comments

@editorialbot
Copy link
Collaborator

editorialbot commented Aug 1, 2022

Submitting author: @minaskar (Minas Karamanis)
Repository: https://github.com/minaskar/pocomc
Branch with paper.md (empty if default branch):
Version: 0.2.4
Editor: @dfm
Reviewers: @kazewong, @marylou-gabrie
Archive: 10.5281/zenodo.7308533

Status

status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/e7d10f594f5c8eb682d29dd84aaf71be"><img src="https://joss.theoj.org/papers/e7d10f594f5c8eb682d29dd84aaf71be/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/e7d10f594f5c8eb682d29dd84aaf71be/status.svg)](https://joss.theoj.org/papers/e7d10f594f5c8eb682d29dd84aaf71be)

Reviewers and authors:

Please avoid lengthy details of difficulties in the review thread. Instead, please create a new issue in the target repository and link to those issues (especially acceptance-blockers) by leaving comments in the review thread below. (For completists: if the target issue tracker is also on GitHub, linking the review thread in the issue or vice versa will create corresponding breadcrumb trails in the link target.)

Reviewer instructions & questions

@kazewong & @marylou-gabrie, your review will be checklist based. Each of you will have a separate checklist that you should update when carrying out your review.
First of all you need to run this command in a separate comment to create the checklist:

@editorialbot generate my checklist

The reviewer guidelines are available here: https://joss.readthedocs.io/en/latest/reviewer_guidelines.html. Any questions/concerns please let @dfm know.

Please start on your review when you are able, and be sure to complete your review in the next six weeks, at the very latest

Checklists

📝 Checklist for @kazewong

📝 Checklist for @marylou-gabrie

@editorialbot
Copy link
Collaborator Author

Hello humans, I'm @editorialbot, a robot that can help you with some common editorial tasks.

For a list of things I can do to help you, just type:

@editorialbot commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@editorialbot generate pdf

@editorialbot
Copy link
Collaborator Author

Software report:

github.com/AlDanial/cloc v 1.88  T=0.20 s (723.0 files/s, 229491.3 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
JavaScript                      12           2426           2523           9374
SVG                              6              0             13           9319
HTML                            17           1199            251           5692
CSS                             10            398            138           3169
PO File                         45           1087              0           2444
Python                          20            667           1528           2163
reStructuredText                14            286            146            503
Jupyter Notebook                13              0           2499            445
Markdown                         4             95              0            311
YAML                             4             22             17            132
TeX                              1              8              0             66
DOS Batch                        1              8              1             26
make                             1              4              7              9
-------------------------------------------------------------------------------
SUM:                           148           6200           7123          33653
-------------------------------------------------------------------------------


gitinspector failed to run statistical information for the repository

@editorialbot
Copy link
Collaborator Author

Wordcount for paper.md is 1254

@editorialbot
Copy link
Collaborator Author

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

OK DOIs

- None

MISSING DOIs

- 10.1214/06-ba127 may be a valid DOI for title: Nested sampling for general Bayesian computation

INVALID DOIs

- None

@editorialbot
Copy link
Collaborator Author

👉📄 Download article proof 📄 View article proof on GitHub 📄 👈

@dfm
Copy link

dfm commented Aug 1, 2022

@kazewong, @marylou-gabrie — This is the review thread for the paper. All of our communications will happen here from now on. Thanks again for agreeing to participate!

Please read the "Reviewer instructions & questions" in the first comment above, and generate your checklists by commenting @editorialbot generate my checklist on this issue ASAP. As you go over the submission, please check any items that you feel have been satisfied. There are also links to the JOSS reviewer guidelines.

The JOSS review is different from most other journals. Our goal is to work with the authors to help them meet our criteria instead of merely passing judgment on the submission. As such, the reviewers are encouraged to submit issues and pull requests on the software repository. When doing so, please mention openjournals/joss-reviews#4634 so that a link is created to this thread (and I can keep an eye on what is happening). Please also feel free to comment and ask questions on this thread. In my experience, it is better to post comments/questions/suggestions as you come across them instead of waiting until you've reviewed the entire package.

We aim for the review process to be completed within about 4-6 weeks but please try to make a start ahead of this as JOSS reviews are by their nature iterative and any early feedback you may be able to provide to the author will be very helpful in meeting this schedule.

@kazewong
Copy link

kazewong commented Aug 1, 2022

Review checklist for @kazewong

Conflict of interest

  • I confirm that I have read the JOSS conflict of interest (COI) policy and that: I have no COIs with reviewing this work or that any perceived COIs have been waived by JOSS for the purpose of this review.

Code of Conduct

General checks

  • Repository: Is the source code for this software available at the https://github.com/minaskar/pocomc?
  • License: Does the repository contain a plain-text LICENSE file with the contents of an OSI approved software license?
  • Contribution and authorship: Has the submitting author (@minaskar) made major contributions to the software? Does the full list of paper authors seem appropriate and complete?
  • Substantial scholarly effort: Does this submission meet the scope eligibility described in the JOSS guidelines

Functionality

  • Installation: Does installation proceed as outlined in the documentation?
  • Functionality: Have the functional claims of the software been confirmed?
  • Performance: If there are any performance claims of the software, have they been confirmed? (If there are no claims, please check off this item.)

Documentation

  • A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
  • Installation instructions: Is there a clearly-stated list of dependencies? Ideally these should be handled with an automated package management solution.
  • Example usage: Do the authors include examples of how to use the software (ideally to solve real-world analysis problems).
  • Functionality documentation: Is the core functionality of the software documented to a satisfactory level (e.g., API method documentation)?
  • Automated tests: Are there automated tests or manual steps described so that the functionality of the software can be verified?
  • Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support

Software paper

  • Summary: Has a clear description of the high-level functionality and purpose of the software for a diverse, non-specialist audience been provided?
  • A statement of need: Does the paper have a section titled 'Statement of need' that clearly states what problems the software is designed to solve, who the target audience is, and its relation to other work?
  • State of the field: Do the authors describe how this software compares to other commonly-used packages?
  • Quality of writing: Is the paper well written (i.e., it does not require editing for structure, language, or writing quality)?
  • References: Is the list of references complete, and is everything cited appropriately that should be cited (e.g., papers, datasets, software)? Do references in the text use the proper citation syntax?

@marylou-gabrie
Copy link

marylou-gabrie commented Aug 2, 2022

Review checklist for @marylou-gabrie

Conflict of interest

  • I confirm that I have read the JOSS conflict of interest (COI) policy and that: I have no COIs with reviewing this work or that any perceived COIs have been waived by JOSS for the purpose of this review.

Code of Conduct

General checks

  • Repository: Is the source code for this software available at the https://github.com/minaskar/pocomc?
  • License: Does the repository contain a plain-text LICENSE file with the contents of an OSI approved software license?
  • Contribution and authorship: Has the submitting author (@minaskar) made major contributions to the software? Does the full list of paper authors seem appropriate and complete?
  • Substantial scholarly effort: Does this submission meet the scope eligibility described in the JOSS guidelines

Functionality

  • Installation: Does installation proceed as outlined in the documentation?
  • Functionality: Have the functional claims of the software been confirmed?
  • Performance: If there are any performance claims of the software, have they been confirmed? (If there are no claims, please check off this item.)

Documentation

  • A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
  • Installation instructions: Is there a clearly-stated list of dependencies? Ideally these should be handled with an automated package management solution.
  • Example usage: Do the authors include examples of how to use the software (ideally to solve real-world analysis problems).
  • Functionality documentation: Is the core functionality of the software documented to a satisfactory level (e.g., API method documentation)?
  • Automated tests: Are there automated tests or manual steps described so that the functionality of the software can be verified?
  • Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support

Software paper

  • Summary: Has a clear description of the high-level functionality and purpose of the software for a diverse, non-specialist audience been provided?
  • A statement of need: Does the paper have a section titled 'Statement of need' that clearly states what problems the software is designed to solve, who the target audience is, and its relation to other work?
  • State of the field: Do the authors describe how this software compares to other commonly-used packages?
  • Quality of writing: Is the paper well written (i.e., it does not require editing for structure, language, or writing quality)?
  • References: Is the list of references complete, and is everything cited appropriately that should be cited (e.g., papers, datasets, software)? Do references in the text use the proper citation syntax?

@editorialbot editorialbot added the Track: 1 (AASS) Astronomy, Astrophysics, and Space Sciences label Aug 12, 2022
@xuanxu xuanxu added Track: 1 (AASS) Astronomy, Astrophysics, and Space Sciences and removed Track: 1 (AASS) Astronomy, Astrophysics, and Space Sciences labels Aug 12, 2022
@dfm
Copy link

dfm commented Aug 25, 2022

@kazewong, @marylou-gabrie — Just a quick check in here to keep this on your radar. Let us know if you run into any issues!

@marylou-gabrie
Copy link

marylou-gabrie commented Aug 29, 2022

A few questions:

  • What does the scale quantity logged in the results refer to? I did not find an explanation in the documentation.

  • Is there a way to access the trained flow (or flows)? This is definitely for advanced usage, but maybe it would be good to have a way to assess the quality of training of the flow model. Then, one could assess whether it is necessary to play with the training/config parameters to speedup the sampling.

  • What is the training loss function for the flow? I might have missed something but I did not find this info, neither in the JOSS submission nor in the accompanying paper.

  • Are the authors aware of typical cases in which the method fails? If yes it would be interesting to add a concise mention to limitations.

Overall comment:

Overall the documentation is well detailed and the effort to automate the sequential scheme is remarkable. The paper gives a concise overview of the method, directed to a wise audience. I do note that the specifics of the algorithms and many related references are instead in the accompanying paper (submitted to a different venue I imagine).

Minor:

  • The correlation coefficient threshold changes notation between the "Background"section of the docs (CC) and the "Advanced Guide"(gamma).

@kazewong
Copy link

kazewong commented Aug 30, 2022

Questions/comments related to unchecked items

  • Performance/Summary: See this thread Re: Summary checkpoint in Joss review minaskar/pocomc#17 (comment)
  • Community Guideline: I don't see any comments for seeking support. Per the checklist, I think the Authors should add some guideline to seeking support.
  • State-of-the-field/references: I think there are missing references which @marylou-gabrie have also flagged. For state-of-the-field, specifically related to code, I don't think it is any big issue, but the authors should add a couple references related to previous studies that didn't come with open-source codes. Once the authors added the reference suggested by @marylou-gabrie, I am happy to check both boxes out.

@minaskar
Copy link

minaskar commented Sep 7, 2022

Thank you both for your comments!

@marylou-gabrie

  • The scale variable refers to the ratio of the Metropolis proposal scale (in latent space) to the optimal proposal scale 2.38/Sqrt(D). It is a metric of the preconditioner's performance (i.e. 1 corresponds to perfect preconditioning). I'll make sure we include an explanation in the docs.
  • The trained flow can be accessed using the Flow class method of the sampler.
  • The flow loss function is the usual forward KL divergence $D_{KL}[p|q]$.
  • Cases in which PMC (and thus pocoMC) might not be the best choice are discussed in the Discussion section of the accompanying paper and include high-dimensional targets and/or cases in which the likelihood evaluation is cheap compared to the flow training cost. Of course, one can construct artificial cases in which the multimodality is so strong that the flow is not expressive enough to capture it. Since the code is intended for astronomical/cosmological research we do not expect any such difficulties.

@kazewong

@marylou-gabrie
Copy link

Thanks @minaskar for your helpful answers.

My only remaining question is how do you evaluate the forward KL: using which samples (and potential reweighting) to approximate the expectation over $p$ (where $p$ is the succesively annealed distribution if I am not mistaken)?

Please let us know when the missing references have been added to the paper.

After that, all good for me.

@kazewong
Copy link

kazewong commented Sep 19, 2022

I have added comments in the discussion thread in the code repo. Once that is addressed, I will be happy with the submission.

@dfm
Copy link

dfm commented Oct 8, 2022

@minaskar — I wanted to check in here since I think we're waiting on your responses to @kazewong, @marylou-gabrie's final small comments. Let us know if anything isn't clear or if you've addressed these issues. Thanks!

@minaskar
Copy link

minaskar commented Oct 9, 2022

@marylou-gabrie We're currently using the samples from the current annealed distribution in order to train the flow that will serve as the preconditioner for the next annealed distribution. Since the beta-spacings are small enough we found no benefit in reweighting the samples.

@dfm Thanks for the reminder and apologies for the delayed response, we will address the remaining comments during the next few days.

@minaskar
Copy link

Hi @kazewong and @marylou-gabrie,

I've added the missing references and fixed the minor issues. Let me know if there's anything else missing.

@dfm
Copy link

dfm commented Oct 26, 2022

@kazewong, @marylou-gabrie — Can you both take a look at @minaskar's responses to your feedback and let us know if there are any remaining issues? Thanks!!

@minaskar
Copy link

minaskar commented Nov 9, 2022

@dfm I merged the PR (actually made a new PR and merged it to the dev branch first, as the main is protected).

Version number is 0.2.3 and DOI is 10.5281/zenodo.7306240

@dfm
Copy link

dfm commented Nov 9, 2022

@minaskar — Great! Can you make an updated archive using a clean checkout or directly from this zip file: https://github.com/minaskar/pocomc/archive/refs/tags/0.2.3.zip ? The version at 10.5281/zenodo.7306240 contains a lot of temporary files and build products. Thanks!

@minaskar
Copy link

minaskar commented Nov 9, 2022

@dfm Done, the new DOI is 10.5281/zenodo.7308366

@minaskar
Copy link

minaskar commented Nov 9, 2022

@dfm DOI is 10.5281/zenodo.7308533 and version number is 0.2.4

@dfm
Copy link

dfm commented Nov 9, 2022

@editorialbot set 0.2.4 as version

@editorialbot
Copy link
Collaborator Author

Done! version is now 0.2.4

@dfm
Copy link

dfm commented Nov 9, 2022

@editorialbot set 10.5281/zenodo.7308533 as archive

@editorialbot
Copy link
Collaborator Author

Done! Archive is now 10.5281/zenodo.7308533

@dfm
Copy link

dfm commented Nov 9, 2022

@editorialbot check references

@dfm
Copy link

dfm commented Nov 9, 2022

@editorialbot generate pdf

@editorialbot
Copy link
Collaborator Author

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

OK DOIs

- 10.1093/mnras/stac2272 is OK
- 10.1146/annurev-astro-082214-122339 is OK
- 10.1214/06-ba127 is OK
- 10.1111/j.1467-9868.2006.00553.x is OK
- 10.1073/pnas.2109420119 is OK
- 10.1073/pnas.2109420119 is OK
- 10.1093/mnras/staa1469 is OK

MISSING DOIs

- None

INVALID DOIs

- None

@editorialbot
Copy link
Collaborator Author

👉📄 Download article proof 📄 View article proof on GitHub 📄 👈

@dfm
Copy link

dfm commented Nov 9, 2022

@editorialbot recommend-accept

@editorialbot
Copy link
Collaborator Author

Attempting dry run of processing paper acceptance...

@editorialbot
Copy link
Collaborator Author

👋 @openjournals/aass-eics, this paper is ready to be accepted and published.

Check final proof 👉📄 Download article

If the paper PDF and the deposit XML files look good in openjournals/joss-papers#3702, then you can now move forward with accepting the submission by compiling again with the command @editorialbot accept

@editorialbot editorialbot added the recommend-accept Papers recommended for acceptance in JOSS. label Nov 9, 2022
@editorialbot
Copy link
Collaborator Author

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

OK DOIs

- 10.1093/mnras/stac2272 is OK
- 10.1146/annurev-astro-082214-122339 is OK
- 10.1214/06-ba127 is OK
- 10.1111/j.1467-9868.2006.00553.x is OK
- 10.1073/pnas.2109420119 is OK
- 10.1073/pnas.2109420119 is OK
- 10.1093/mnras/staa1469 is OK

MISSING DOIs

- None

INVALID DOIs

- None

@dfm
Copy link

dfm commented Nov 9, 2022

@editorialbot accept

@editorialbot
Copy link
Collaborator Author

Doing it live! Attempting automated processing of paper acceptance...

@editorialbot
Copy link
Collaborator Author

🐦🐦🐦 👉 Tweet for this paper 👈 🐦🐦🐦

@editorialbot
Copy link
Collaborator Author

🚨🚨🚨 THIS IS NOT A DRILL, YOU HAVE JUST ACCEPTED A PAPER INTO JOSS! 🚨🚨🚨

Here's what you must now do:

  1. Check final PDF and Crossref metadata that was deposited 👉 Creating pull request for 10.21105.joss.04634 joss-papers#3703
  2. Wait a couple of minutes, then verify that the paper DOI resolves https://doi.org/10.21105/joss.04634
  3. If everything looks good, then close this review issue.
  4. Party like you just published a paper! 🎉🌈🦄💃👻🤘

Any issues? Notify your editorial technical team...

@editorialbot editorialbot added accepted published Papers published in JOSS labels Nov 9, 2022
@dfm
Copy link

dfm commented Nov 9, 2022

@kazewong, @marylou-gabrie — many thanks for your reviews here! JOSS relies upon the volunteer effort of people like you and we simply wouldn't be able to do this without you ✨

@minaskar — your paper is now accepted and published in JOSS ⚡🚀💥

@dfm dfm closed this as completed Nov 9, 2022
@editorialbot
Copy link
Collaborator Author

🎉🎉🎉 Congratulations on your paper acceptance! 🎉🎉🎉

If you would like to include a link to your paper from your README use the following code snippets:

Markdown:
[![DOI](https://joss.theoj.org/papers/10.21105/joss.04634/status.svg)](https://doi.org/10.21105/joss.04634)

HTML:
<a style="border-width:0" href="https://doi.org/10.21105/joss.04634">
  <img src="https://joss.theoj.org/papers/10.21105/joss.04634/status.svg" alt="DOI badge" >
</a>

reStructuredText:
.. image:: https://joss.theoj.org/papers/10.21105/joss.04634/status.svg
   :target: https://doi.org/10.21105/joss.04634

This is how it will look in your documentation:

DOI

We need your help!

The Journal of Open Source Software is a community-run journal and relies upon volunteer effort. If you'd like to support us please consider doing either one (or both) of the the following:

@minaskar
Copy link

minaskar commented Nov 9, 2022

Thank you so much @dfm @marylou-gabrie @kazewong

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
accepted published Papers published in JOSS Python recommend-accept Papers recommended for acceptance in JOSS. review TeX Track: 1 (AASS) Astronomy, Astrophysics, and Space Sciences
Projects
None yet
Development

No branches or pull requests

6 participants