Hanns Lohmann wrote:
Hallo, lässt sich eine pdf-Datei in OO-Writer importieren (nicht ex-, sondern
importieren!!)
Wie schon alle anderen gesagt haben: nein. Es muss also der Umweg über
eine Texterkennung (OCR) gewählt werden.
Alt.comp.freeware hat kürzlich dazu den folgenden Beitrag. Achtung: die
Qualität der Texterkennung dieser freewares ist variabel.
Ich selbst habe deshalb Finereader 7 gekauft, der sehr gut auch Tabellen
und ihre Formatierung erkennt.
website der freeware newsgroup mit einer Auswahl unter freien OCRs:
http://www.pricelesswarehome.org/
Finereader 4 Pro gibt es kostenlos und legal unter
http://www.ilsoftware.it/articoli.asp?ID=2904
Die Seite ist in talienisch. Das Einzige was beim dem auszufüllenden
Fragebogen anzugeben ist ist eine italienische Provinz. Danach erhält
man eine Seite mit der Lizenznummer.
Guten San
Frank
Free OCR softwares:
GOCR/JOCR v0.40 - 194 KB
GOCR/JOCR is an OCR (Optical Character Recognition) program, developed
under the GNU Public License. Joerg Schulenburg started the program,
and now leads a team of developers.
GOCR can be used with different front-ends, which makes it very easy to
port to different OSes and architectures. It can open many different
image formats, and its quality have been improving in a daily basis.
The original name is GOCR. It's what is used internally in the sources.
But, when registering the site at Sourceforge, gocr was already taken.
So, it's kind of both. Yeah, we know.
http://jocr.sourceforge.net/scr_option.gif
Jörg Schulenburg
[EMAIL PROTECTED]
http://jocr.sourceforge.net/index.html
http://www-e.uni-magdeburg.de/jschulen/ocr/index.html
http://www-e.uni-magdeburg.de/jschulen/ocr/gocr040exe.zip
Graph OCR - 145 KB
A program for getting numeric data from scaned graphics.
http://www.moskvin.biz/ss/lineocr.gif
D. B. Moskvin
[EMAIL PROTECTED]
http://www.moskvin.biz/attic.php
http://www.moskvin.biz/free/grocr.zip
OmniFormat v7.5 - 5319 KB
OmniFormat is a free document conversion utility which allows dynamic
conversion and image manipulation of over 75 file formats including
HTML, DOC, XLS, WPD, PDF, XML, JPG, GIF, TIF, PNG, PCX, PPT, PS, TXT,
Photo CD, FAX and MPEG. OmniFormat supports Optical Character
Recognition (OCR) and may also be used to convert images and documents
to rights managed PDF files.
Omniformat requires that Pdf995 - also FREE - be installed. Pdf995 is
the fast, affordable way to create professional-quality documents in
the popular PDF file format. Its easy-to-use interface allows you to
create PDF files by simply selecting the "print" command from any
application, creating documents which can be viewed on any computer
with a PDF viewer.
The OmniFormat OCR Module enables OmniFormat to automatically convert
scanned images to text when the TXT output format is selected in
OmniFormat. The OCR Module will process all import formats handled by
OmniFormat. It can also extract text from PDF files and be run from the
command line.
We support Windows 95, 98, 2000 and Me, NT 4.0 and XP.
http://www.pctipp.ch/library/graphics/categories/downloads/dl/24948_2.JPG
Software995
[EMAIL PROTECTED]
http://www.omniformat.com/
http://www.freeware995.com/omniformat/omniformat.exe
OCR Module v2.5 - 520 KB:
http://www.freeware995.com/omniformat/ocrmodule.exe
SimpleOCR v3.1 - 9511 KB
Do you dread having to retype that document you are holding in your
hand? If only you had the electronic file, your life would be so much
easier. With SimpleOCR, you could easily and accurately convert that
paper document into editable electronic text for use in any application
including Word and WordPerfect.
Not only is SimpleOCR up to 99% accurate, it is 100% free.
Features:
- Huge Dictionary - With more than 120,000 words, it is unlikely that
SimpleOCR will run into a word it does not know. In the rare event
that it does not, our improved text editor allows you to easily add the
new word to the dictionary. By adding new words to the dictionary,
SimpleOCR becomes better with every use.
- Despeckle - For those documents which are not particularly clear
(i.e. faxes, copies of copies, ...), SimpleOCR provides a despeckle or
"noisy document" option which increases SimpleOCR's accuracy.
- Format Retention - SimpleOCR can keep certain elements of the
document's format in the recognized document. From varying font sizes
to font formatting elements such as underline, italic, and bold,
SimpleOCR recognizes it all. For certain documents, it retains the
original document's format with up to 99% accuracy.
- Image Retention - Along with the document's text, SimpleOCR has the
uncanny ability to capture and retain pictures from the document. This
is a great feature which reduces the need to import images from a
document by other means.
- Plain Text Extraction - Just need the plain text from the original
document? No problem. SimpleOCR can be set to recognize the
characters and words but ignore the formatting. The resulting file is
ready for y