- RU.UNIX (2:5077/15.22) -------------------------------------------- RU.UNIX -
From : Vlad Sirenko 2:463/257.18 25 Apr 00 02:11:24
Subj : PDF->html
-------------------------------------------------------------------------------
ant@kmbank.kuban.ru writes:
> > Есть какая-нибудь утилита, котоpой можно печатать PDF (давать на вход ей)
> > и чтобы ее из скpиптов вызывать можно было?
> >
> > Вот такое скpомное желание...
>
>
> А PDF->html бывает?
Xref: localhost comp.os.linux.announce:491
-----BEGIN PGP SIGNED MESSAGE-----
Version 0.21 of pdftohtml is now available for download from
http://www.ra.informatik.uni-stuttgart.de/~gosho/pdftohtml/code.html
pdftohtml 0.21
=============Pdftohtml v. 0.21 converts Portable Document Format files to HTML. For further
information please visit the pdftohtml homepage
http://www.ra.informatik.uni-stuttgart.de/~gosho/pdftohtml/
Changes from version 0.2:
- -------------------------
- - Many bugfixes
- - Support for colored fonts in pdf documents
- - Better handling of different font sizes.
Changes from version 0.1:
- --------------------------
- - All images from the PDF file are extracted as JPEG or PNG images.
- - A shell script pdftohtml is now the main user interface
(note: pnmtopng http://www.cdrom.com/pub/png/pngcode.html is now necessary)
- - A complex HTML layout mechanism was added (using CSS)
- - Frames are (optionally) used in the HTML output
Todo:
- -----
- - pdf vector drawings are not yet extracted.
- - Images are sometimes cut into multiple images.
- - Add option to switch off page output
Known Bugs:
- -----------
- - -e option is currently not working
- - pdf files with dark backgrounds are not converted well.
- --
Rainer Dorsch
Abt. Rechnerarchitektur e-mail:rainer.dorsch@informatik.uni-stuttgart.de
Uni Stuttgart Tel.: 0711-7816-215
--
Best regards, -- Vlad.
--- Gnus v5.7/Emacs 20.4 * Origin: Terem (2:463/257.18@fidonet)