Unexpected behaviour when a notebook is not valid JSON #374

pbugnion · 2020-01-09T16:49:24Z

Thanks for nbsphinx. It's great!

When a notebook is not valid JSON, the JSON source gets directly rendered to HTML.

As far as I can tell, this behaviour arises here: if the default converter fails for any reason, nbsphinx falls back silently to the RST Parser. The RST Parser seems to just convert the content, broken JSON and all, to HTML.

Unless I'm missing something, I'd expect nbsphinx to fail if it fails to parse the notebook?

I've created a minimal repository demonstrating this here. With this, the rendered HTML looks like:

This tripped us up in ipywidgets when a notebook became invalid JSON as a result of a merge conflict. Reported in this issue.

pbugnion · 2020-01-09T16:51:19Z

I'm very happy to implement a change to nbsphinx myself, if need be, once the maintainers have decided on the right course.

mgeier · 2020-01-10T12:14:32Z

Thanks for the report!

This is actually known behavior but I'm aware that it's a bit unfortunate.

The reason why this is not an error is that the NotebookParser.parse() method has two different tasks to do: (1) parse a whole input document, (2) parse a single translated sentence/paragraph.

See the docstring:

nbsphinx/src/nbsphinx.py

Lines 822 to 832 in aad12aa

    
                   *inputstring* is either the JSON representation of a notebook, 
        
                   or a paragraph of text coming from the Sphinx translation 
        
                   machinery. 
        
                   Note: For now, the translation strings use reST formatting, 
        
                   because the NotebookParser uses reST as intermediate 
        
                   representation. 
        
                   However, there are plans to remove this intermediate step 
        
                   (https://github.com/spatialaudio/nbsphinx/issues/36), and after 
        
                   that, the translated strings will most likely be parsed as 
        
                   CommonMark.

... and this issue + PR: #154 + #156

I'm very happy to implement a change to nbsphinx myself

If you find a reliable way to detect that a given string is not intended as a reST (or in the future probably a Markdown) string, we could probably raise a combined NotebookError to get a proper error instead of a nonsensical output HTML.

There has also been a bit of a discussion on the Sphinx-dev mailing list: https://groups.google.com/d/topic/sphinx-dev/PvAQfZcDeHw/discussion

I think ideally Sphinx would create a separate API for translating paragraphs, then this problem would immediately go away. But I don't know whether that will happen anytime soon (or at all).

mgeier · 2020-01-10T12:27:34Z

As a work-around, you could turn Sphinx warnings into errors with -W: https://www.sphinx-doc.org/en/master/man/sphinx-build.html#id6

It is quite likely that a garbled JSON file will cause quite a few warnings. It most definitely does in this case.

I'm normally using -W in CI, which helps finding problems early.

pbugnion · 2020-01-10T16:23:51Z

Ah thanks for the explanations. That makes sense. I'd read that part of the docstring, but I hadn't quite figured how it applied to my issue.

Thanks for the suggestion of using -W. You're definitely right that there were lots of warnings in that case.

pbugnion mentioned this issue Jan 9, 2020

Jupyter widget list document page is bugged jupyter-widgets/ipywidgets#2702

Closed

pbugnion closed this as completed Jan 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unexpected behaviour when a notebook is not valid JSON #374

Unexpected behaviour when a notebook is not valid JSON #374

pbugnion commented Jan 9, 2020 •

edited

Loading

pbugnion commented Jan 9, 2020 •

edited

Loading

mgeier commented Jan 10, 2020

mgeier commented Jan 10, 2020

pbugnion commented Jan 10, 2020

Unexpected behaviour when a notebook is not valid JSON #374

Unexpected behaviour when a notebook is not valid JSON #374

Comments

pbugnion commented Jan 9, 2020 • edited Loading

pbugnion commented Jan 9, 2020 • edited Loading

mgeier commented Jan 10, 2020

mgeier commented Jan 10, 2020

pbugnion commented Jan 10, 2020

pbugnion commented Jan 9, 2020 •

edited

Loading

pbugnion commented Jan 9, 2020 •

edited

Loading