Comments (2)
pdftohtml does not give complete control on the PDF and uses its own images for borders, etc...decided to put the extra effort of parsing the PDF document using the poppler library...that will provide the API level control on the PDF...
from bookworm.
Just pushed a version of Bookworm with PDF support based on the use of Poppler and "pdftohtml". However the following needs to be completed:
- Avoid re-parsing of the book once it has been parsed and added to the library - this can be done by maintaining the parsed contents in the ~/.config/bookworm folder until a book is not removed from the library. This duplicates the contents of the book and hence adds to storage. Add a toggle button to choose between performance and storage
- PDF parsing and conversion to HTML takes considerable time, especially for PDFs with images - will need to see if pdftohtml can be optimized
- Poppler generated HTML files are not optimized in terms of the number of words per line. Some work is needed to optimize this
- Add contractor support for PDF mime type to offer support for Bookworm in Files
from bookworm.
Related Issues (20)
- Every time an e-book is opened a new data base row is created HOT 4
- Appstream complains about the image HOT 2
- suggestion: use a different/additional call sign for the CLI
- suggest: update webpage re Python HOT 1
- Can't change colors with "Customize reading profile" options HOT 1
- Two page reading layout problem
- Runtime org.gnome.Platform branch 42 is end-of-life HOT 2
- System darkmode causes very low contrast in shortcuts tab HOT 1
- Flatpak com.github.babluboy.bookworm not runing on Debian
- Icon that appears in the LXDE panel (the running program instance) is generic
- Ctrl + F search returns, "No matches found". HOT 2
- You are still running org.gnome.Platform on branch 42, which is end of life. HOT 6
- Crashes and freezing when loading large cbz files HOT 1
- webkit2gtk4.0 will be obsoleted HOT 1
- Preferences Shortcut tab can't see content when using Dark mode theme HOT 1
- meson.build:2:0: ERROR: Could not execute Vala compiler "valac" HOT 3
- flatpak Gnome platform runtime version 42 and Platform 21.08 has reached end of life HOT 1
- Restore page position HOT 2
- Bookworm Flatpak not opening in Linux Mint 21.3 "Virginia" Cinnamon Edition
- Unable to load epub file
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bookworm.