feature request: pdf support #28

asg0451 · 2024-03-27T16:55:22Z

I'd like to be able to capture a pdf url, eg https://gavinadair.files.wordpress.com/2017/03/baker-changes-of-mind.pdf

currently, it is captured but no tags or added nor is text extracted

logs:

hoarder-workers 2024-03-27T16:53:27.624Z info: [Crawler][9] Will crawl "https://gavinadair.files.wordpress.com/2017/03/baker-changes-of-mind.pdf" for link with id "h03n4dihn2gp0kn8giwiyir7"                                                                                           hoarder-workers 2024-03-27T16:53:27.813Z info: [search][30] Completed successfully                                                                                                                                                                                                      hoarder-workers 2024-03-27T16:53:27.822Z error: [Crawler][9] Crawling job failed: {}

The text was updated successfully, but these errors were encountered:

MohamedBassem · 2024-03-27T16:56:56Z

Yeah, only html Content-Type currently works. PDF support is a reasonable feature request though. Will add it to the backlog. Thanks!

MarkLuk · 2024-06-08T18:27:45Z

Exactly my use-case! I research & bookmark a lot of PDF files. Would like to have support to view their content in the preview.

extended the database to allow storing pdf assets alongside links added downloading of pdfs added aiinference for pdfs updated the UI to display the same as for asset bookmarks

Added a new sourceUrl column to the asset bookmarks Added transforming a link bookmark pointing at a pdf to an asset bookmark made sure the "View Original" link is also shown for asset bookmarks that have a sourceURL updated gitignore for IDEA

MohamedBassem added the feature request New feature or request label Mar 27, 2024

mutschler mentioned this issue Jun 20, 2024

[Feature Request] Support for adding Images by URL #246

Closed

MohamedBassem closed this as completed in be1b7f7 Jun 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature request: pdf support #28

feature request: pdf support #28

asg0451 commented Mar 27, 2024 •

edited

Loading

MohamedBassem commented Mar 27, 2024

MarkLuk commented Jun 8, 2024

feature request: pdf support #28

feature request: pdf support #28

Comments

asg0451 commented Mar 27, 2024 • edited Loading

MohamedBassem commented Mar 27, 2024

MarkLuk commented Jun 8, 2024

asg0451 commented Mar 27, 2024 •

edited

Loading