Hi Ed,

Currently there is not a media filter for Excel files. But, it'd make a 
great addition (if you or someone else had time).

It'd likely actually may be easy to code, as it'd likely be similar to 
the PowerPointFilter. The PowerPointFilter uses Apache POI, a 
third-party library which can read Microsoft formats.  So, it should be 
possible to create an ExcelFilter using the Apache POI components 
related to Excel.

Here's more on Apache POI:
https://poi.apache.org/

Here's our existing PowerPointFilter which uses Apache POI to filter 
PPT/PPTX:
https://github.com/DSpace/DSpace/blob/master/dspace-api/src/main/java/org/dspace/app/mediafilter/PowerPointFilter.java

If anyone is interested in contributing this to DSpace, we'd welcome 
Pull Requests via GitHub!

- Tim

On 6/10/2015 3:32 PM, Ed Goulet wrote:
> All,
>
> Is there such a media filter to extract text from Excel files?
>
> Like org.dspace.app.mediafilter.PDFFilter ... but it might be
> org.dspace.app.mediafilter.XLSFilter ?
>
> Ed
>
> ------------------------------------------------------------------------------
> _______________________________________________
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette: 
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>

------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to