Coder Social home page Coder Social logo

docu2html's Introduction

Document2HTML converter

Build Status Build status Github Releases Documentation MIT Licence

Documents to HTML converter

Features

Extension Text Styles extraction Images extraction
HTML/XHTML Yes Yes Yes
XML Yes Not applicable Not applicable
DOCX Yes Yes Yes
DOC Yes No No
RTF Yes Yes Yes
ODT Yes Yes Yes
XLSX Yes Yes Yes
XLS Yes Yes No
CSV Yes Not applicable Not applicable
TXT/MD Yes Yes Yes
JSON Yes Not applicable Not applicable
EPUB Yes Yes Yes
PDF Yes No Yes
PPT Yes No No

Dependencies

cURL for downloading images:

apt-get install libcurl4-openssl-dev
or
brew install curl

iconv for encoding conversion

sudo apt-get install libc6
or
brew install libiconv

Tidy for cleaning and repairing HTML

sudo apt-get install libtidy-dev
or
brew install tidy-html5

file for determining file extension

Third-party

Building

Make sure the Qt (>= 5.6) development libraries are installed:

  • In Ubuntu/Debian: apt-get install qt5-default qttools5-dev-tools zlib1g-dev
  • In Fedora: sudo dnf builddep tiled
  • In Arch Linux: pacman -S qt
  • In Mac OS X with Homebrew:
    • brew install qt5
    • brew link qt5 --force
  • Or you can download Qt from: https://www.qt.io/download-open-source/

Now you can compile by running:

qmake (or qmake-qt5 on some systems)
make

To do a shadow build, you can run qmake from a different directory and refer it to space-invaders.pro, for example:

mkdir build
cd build
qmake ../src/document2html.pro
make

If you have ideas how to build project with CMake instead of Qt please contact me.

Tool usage

Usage:

    document2html -f|-d <input file|dir> -o <output dir> [-si]
    document2html -h
    document2html -v

Options:

Short Flag Long Flag Description
-f --file Input file
-d --dir Input directory
-o --out Output directory
-s --style Extract styles
-i --image Extract images
-h --help Display help message
-v --version Display package version

Thanks

Contact

If you have questions regarding the libraries, I would like to invite you to open an issue at Github. Please describe your request, problem, or question as detailed as possible, and also mention the version of the libraries you are using as well as the version of your compiler and operating system. Opening an issue at Github allows other users and contributors to this libraries to collaborate.

You're welcome! :)

docu2html's People

Stargazers

CozyFox avatar Bocke avatar  avatar PolarDefend avatar

Watchers

CozyFox avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.