Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Content ignored for RSS feed #80

Open
captn3m0 opened this issue Jun 5, 2023 · 6 comments
Open

Content ignored for RSS feed #80

captn3m0 opened this issue Jun 5, 2023 · 6 comments
Labels
bug Something isn't working help wanted Extra attention is needed storyprovider

Comments

@captn3m0
Copy link

captn3m0 commented Jun 5, 2023

I have the feed hosted locally, but here's the RSS feed:

beatroot.atom.zip

The output isn't what I'd expect:

image

@j6k4m8
Copy link
Owner

j6k4m8 commented Aug 7, 2023

I have a feeling this may be fixed in #82; let me know if you experience the issue still?

@j6k4m8
Copy link
Owner

j6k4m8 commented Jun 7, 2024

Hey @captn3m0 — still experiencing this, or can we close this issue? Happy to brainstorm if it's still happening!

@j6k4m8 j6k4m8 added bug Something isn't working storyprovider labels Jun 7, 2024
@captn3m0
Copy link
Author

captn3m0 commented Jun 25, 2024

Still broken, but I have a guess on why. The code relies on summary, while the Atom feed I have publishes a content element instead.

I'm also unclear whether Goosepaper supports the RSS or the Atom standard, or both. feedparser seems to support normalization all the standards, so perhaps that should be used: https://feedparser.readthedocs.io/en/latest/content-normalization.html

I tried to parse the feed manually with feedparser, and it returns decent HTML, no matter which method I use:

e.summary
'<div><img src="https://beatrootnews.com/uploads/2023/06-June/05-Mon/sm/Antilla-building_647dc0f96050d.png" width="350" /></div>The Supreme Court today granted three weeks interim bail to former police officer Pradeep Sharma, who was arrested in connection with the Antilia bomb scare case and the killing of businessman Mansukh Hiran.<br /><br />A vacation bench of Justices Aniruddha Bose and Rajesh Bindal granted relief to Sharma after noting that he had sought interim bail on humanitarian grounds since his wife had developed serious complications after a surgery.On February 25, 2021, an explosives-laden SUV was found near Ambani\'s residence \'Antilia\' in south Mumbai. Businessman Hiran, who was in possession of the SUV, was found dead in a creek in neighbouring Thane on March 5, 2021.<br /><br />The allegation against Sharma, who belonged to the Mumbai Police\'s encounter squad that killed over 300 criminals in numerous encounters, was that he had helped his former colleague Waze in eliminating Hiran.<br /><br />Sharma was arrested in the case in June 2021 and is currently in judicial custody.'
>>> e['description']
'<div><img src="https://beatrootnews.com/uploads/2023/06-June/05-Mon/sm/Antilla-building_647dc0f96050d.png" width="350" /></div>The Supreme Court today granted three weeks interim bail to former police officer Pradeep Sharma, who was arrested in connection with the Antilia bomb scare case and the killing of businessman Mansukh Hiran.<br /><br />A vacation bench of Justices Aniruddha Bose and Rajesh Bindal granted relief to Sharma after noting that he had sought interim bail on humanitarian grounds since his wife had developed serious complications after a surgery.On February 25, 2021, an explosives-laden SUV was found near Ambani\'s residence \'Antilia\' in south Mumbai. Businessman Hiran, who was in possession of the SUV, was found dead in a creek in neighbouring Thane on March 5, 2021.<br /><br />The allegation against Sharma, who belonged to the Mumbai Police\'s encounter squad that killed over 300 criminals in numerous encounters, was that he had helped his former colleague Waze in eliminating Hiran.<br /><br />Sharma was arrested in the case in June 2021 and is currently in judicial custody.'
>>> e['summary']
'<div><img src="https://beatrootnews.com/uploads/2023/06-June/05-Mon/sm/Antilla-building_647dc0f96050d.png" width="350" /></div>The Supreme Court today granted three weeks interim bail to former police officer Pradeep Sharma, who was arrested in connection with the Antilia bomb scare case and the killing of businessman Mansukh Hiran.<br /><br />A vacation bench of Justices Aniruddha Bose and Rajesh Bindal granted relief to Sharma after noting that he had sought interim bail on humanitarian grounds since his wife had developed serious complications after a surgery.On February 25, 2021, an explosives-laden SUV was found near Ambani\'s residence \'Antilia\' in south Mumbai. Businessman Hiran, who was in possession of the SUV, was found dead in a creek in neighbouring Thane on March 5, 2021.<br /><br />The allegation against Sharma, who belonged to the Mumbai Police\'s encounter squad that killed over 300 criminals in numerous encounters, was that he had helped his former colleague Waze in eliminating Hiran.<br /><br />Sharma was arrested in the case in June 2021 and is currently in judicial custody.'

I tested against both 0.6.0 and 0.7.1. I also attempted to run it against other formats (Json/Mrss/SFeed), and while MRSS was parsed - it resulted in the same output.

Here's the generations against both 0.6.0 and 0.7.1:

@j6k4m8
Copy link
Owner

j6k4m8 commented Jun 25, 2024

Ah, thank you @captn3m0 for the thorough report! Feedparser looks like the right answer here, I wonder why we didn't originally do that...

@Kernald
Copy link

Kernald commented Sep 10, 2024

An easy public reproducer is xkcd's feed - no content shows up either.

@j6k4m8
Copy link
Owner

j6k4m8 commented Sep 10, 2024

Thanks for the report @Kernald! I'll address this as soon as I can get a few hours to tinker :)

@j6k4m8 j6k4m8 added the help wanted Extra attention is needed label Sep 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed storyprovider
Projects
None yet
Development

No branches or pull requests

3 participants