Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🗽New Feature: support pdf format #10

Open
rakakroma opened this issue Mar 30, 2023 · 1 comment
Open

🗽New Feature: support pdf format #10

rakakroma opened this issue Mar 30, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@rakakroma
Copy link
Owner

would like to support pdf file but have no idea how to do. any suggestion?

@rakakroma rakakroma added the enhancement New feature or request label Mar 31, 2023
@rakakroma rakakroma changed the title new feature: support pdf format 🗽New Feature: support pdf format Apr 1, 2023
@rakakroma
Copy link
Owner Author

rakakroma commented May 9, 2023

This is how Steal the Word works in pdf.js currently....,
截圖 2023-05-09 下午2 54 31

While in pdf2htmlEX:

截圖 2023-05-09 下午2 59 01

although the latter one need to be transformed by user, it got better display obviously.

The advantage of pdf.js is, it got more chance to get the full sentence by selection (sometimes got the wrong one, though), but pdf2htmlEX divided each sentence into several small blocks, so current method always fail, i am not sure if i can build a custom sentence selection method for it.

Also there's another problem, we got extremely small annotation tooltip in both, that should be fixed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant