Hi Adán, we've already been using PDFbox in DSpace before this feature for PDF text extraction and thumbnail generation in filter-media and as a PDF packager, so this is a good choice. That said, we'll need to address the encoding issue mentioned and described in [1] - probably by upgrading when PDFbox 2 is released and uploaded to Maven Central, but if you can find another way, a patch for 5.x will be highly appreciated.
[1] https://jira.duraspace.org/browse/DS-2224 Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette ------------------------------------------------------------------------------ _______________________________________________ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette