Hi,
I don't know what your troubles have been and what you mean by
"complicated to use". As far as I can see there is not much difference
in using the extractors technically, neither in initialization nor in
invoking them nor in defining other extractors within the frameworks.
Aperture is used as part of the Metaxa engine in Stanbol. If you have
issues with PDF extraction in using that engine, I would suggest that
you open an issue about that in the Stanbol JIRA that we can look into
it. Since Tika and Aperture both basically rely on PDFBox I don't expect
significant differences between them.
The main drawback of current Tika is that the Tika-Metadata allow only
for atomic values, and representing and handling complex structured
metadata such as microformats, RDFa etc. is not straightforward.
Aperture uses uniformly RDF for all metadata representations that
naturally supports structured data and does not need additional effort
to turn idiosyncratic metadata keys and values into something interpretable.
Best regards,
Walter
Jürgen Jakobitsch wrote:
hi all,
i just wanted to let you know, that we had troubles with aperture in the past
and skipped to tika.
besides being complicated to use, aperture wasn't able to extract from pdfs
which were no
problem for tika.
if it's just to get rdf out of some sorts of documents, tika (all the deps are
already there)
will do it. a content handler that makes rdf out of metadata is a matter of an
hour...
please also consider that aperture is a one man show according to sourceforge
svn browse with last release
about a year ago...
wkr jürgen
--
Dr. Walter Kasper
DFKI GmbH
Stuhlsatzenhausweg 3
D-66123 Saarbrücken
Tel.: +49-681-85775-5300 (*NEW NUMBER*)
Fax: +49-681-85775-5338 (*NEW NUMBER*)
Email: [email protected]
-------------------------------------------------------------
Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH
Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern
Geschaeftsfuehrung:
Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
Dr. Walter Olthoff
Vorsitzender des Aufsichtsrats:
Prof. Dr. h.c. Hans A. Aukes
Amtsgericht Kaiserslautern, HRB 2313
-------------------------------------------------------------