Tim, I could likely work on this next week. Will contact you off-list regarding some specific questions I have.
Ed On 6/12/2015 8:35 AM, Tim Donohue wrote: > Hi Ed, > > Currently there is not a media filter for Excel files. But, it'd make > a great addition (if you or someone else had time). > > It'd likely actually may be easy to code, as it'd likely be similar to > the PowerPointFilter. The PowerPointFilter uses Apache POI, a > third-party library which can read Microsoft formats. So, it should > be possible to create an ExcelFilter using the Apache POI components > related to Excel. > > Here's more on Apache POI: > https://poi.apache.org/ > > Here's our existing PowerPointFilter which uses Apache POI to filter > PPT/PPTX: > https://github.com/DSpace/DSpace/blob/master/dspace-api/src/main/java/org/dspace/app/mediafilter/PowerPointFilter.java > > > > If anyone is interested in contributing this to DSpace, we'd welcome > Pull Requests via GitHub! > > - Tim > > On 6/10/2015 3:32 PM, Ed Goulet wrote: >> All, >> >> Is there such a media filter to extract text from Excel files? >> >> Like org.dspace.app.mediafilter.PDFFilter ... but it might be >> org.dspace.app.mediafilter.XLSFilter ? >> >> Ed >> >> ------------------------------------------------------------------------------ >> >> >> _______________________________________________ >> DSpace-tech mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dspace-tech >> List Etiquette: >> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette >> > ------------------------------------------------------------------------------ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

