Hi Ed, Currently there is not a media filter for Excel files. But, it'd make a great addition (if you or someone else had time).
It'd likely actually may be easy to code, as it'd likely be similar to the PowerPointFilter. The PowerPointFilter uses Apache POI, a third-party library which can read Microsoft formats. So, it should be possible to create an ExcelFilter using the Apache POI components related to Excel. Here's more on Apache POI: https://poi.apache.org/ Here's our existing PowerPointFilter which uses Apache POI to filter PPT/PPTX: https://github.com/DSpace/DSpace/blob/master/dspace-api/src/main/java/org/dspace/app/mediafilter/PowerPointFilter.java If anyone is interested in contributing this to DSpace, we'd welcome Pull Requests via GitHub! - Tim On 6/10/2015 3:32 PM, Ed Goulet wrote: > All, > > Is there such a media filter to extract text from Excel files? > > Like org.dspace.app.mediafilter.PDFFilter ... but it might be > org.dspace.app.mediafilter.XLSFilter ? > > Ed > > ------------------------------------------------------------------------------ > _______________________________________________ > DSpace-tech mailing list > DSpace-tech@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/dspace-tech > List Etiquette: > https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette > ------------------------------------------------------------------------------ _______________________________________________ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette