-
-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix bookmarking of PDFs #826
Conversation
@BPerlakiH You should not need a PDF parser if you don't need an HTML parser (for the bookmarks). This is also not what is requested in the issue. I really don't get it. |
@kelson42 The bookmarks are currently containing these fields we do display:
So far we were getting these fields only via an HTML parser. In case of a PDF file it was not working as expected. Therefore I thought we might use a PDF parser to get the same fields if possible. |
It could be simplified then with: in case of a non html (text) document:
|
@BPerlakiH Indeed, but not sure what you mean? |
yes, this is what is requested in the issue and this is appropriate to me. So, as far as I understand you wanted to get a better bookmark by allowing to get more details about the PDF. Can you please:
|
As discussed for nautilus and zimit, I think it makes much more sense to enhance scrapers so that they populate properly the title and provide proper indexing data for search. This would allow all readers to benefit from such an enhancement at once. I don't mind if this is implement in apple reader, but it might soon be "obsolete" once all pdfs have a proper title due to implementation in the scraper. I intend to add this to the python-scraperlib in the coming weeks. |
@benoit74 At the core of this issue is a lack of PDF support at scraper side. But, even if this has to be fixed there, it has to be handled properly at reader level if no title metadata available... For PDF or any other supported mime-type. |
b092199
to
5a71f44
Compare
I've updated this PR, and created a new issue for the more detailed solution: #827 |
I'm a bit worried by this PR. I feel there's no room for discussion and everything is rushed. My first opinion when reading the PR description was the one of @kelson42 but reading the implementation, I think it was the appropriate way to go: PDF is a natively supported format on apple system. Hence the PDF Parser is builtin. |
@rgaudin See my comment to the dedicated ticket. |
Fixes: #817
Currently this is how it looks with all the Water docs bookmarked on macOS: