Hi,

I don't know what your troubles have been and what you mean by "complicated to use". As far as I can see there is not much difference in using the extractors technically, neither in initialization nor in invoking them nor in defining other extractors within the frameworks.

Aperture is used as part of the Metaxa engine in Stanbol. If you have issues with PDF extraction in using that engine, I would suggest that you open an issue about that in the Stanbol JIRA that we can look into it. Since Tika and Aperture both basically rely on PDFBox I don't expect significant differences between them.

The main drawback of current Tika is that the Tika-Metadata allow only for atomic values, and representing and handling complex structured metadata such as microformats, RDFa etc. is not straightforward. Aperture uses uniformly RDF for all metadata representations that naturally supports structured data and does not need additional effort to turn idiosyncratic metadata keys and values into something interpretable.

Best regards,

Walter


Jürgen Jakobitsch wrote:
hi all,

i just wanted to let you know, that we had troubles with aperture in the past
and skipped to tika.

besides being complicated to use, aperture wasn't able to extract from pdfs 
which were no
problem for tika.
if it's just to get rdf out of some sorts of documents, tika (all the deps are 
already there)
will do it. a content handler that makes rdf out of metadata is a matter of an 
hour...
please also consider that aperture is a one man show according to sourceforge 
svn browse with last release
about a year ago...

wkr jürgen




--
Dr. Walter Kasper
DFKI GmbH
Stuhlsatzenhausweg 3
D-66123 Saarbrücken
Tel.:  +49-681-85775-5300 (*NEW NUMBER*)
Fax:   +49-681-85775-5338 (*NEW NUMBER*)
Email: [email protected]
-------------------------------------------------------------
Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH
Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern

Geschaeftsfuehrung:
Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
Dr. Walter Olthoff

Vorsitzender des Aufsichtsrats:
Prof. Dr. h.c. Hans A. Aukes

Amtsgericht Kaiserslautern, HRB 2313
-------------------------------------------------------------

Reply via email to