Tim,

I could likely work on this next week. Will contact you off-list 
regarding some specific questions I have.

Ed

On 6/12/2015 8:35 AM, Tim Donohue wrote:
> Hi Ed,
>
> Currently there is not a media filter for Excel files. But, it'd make 
> a great addition (if you or someone else had time).
>
> It'd likely actually may be easy to code, as it'd likely be similar to 
> the PowerPointFilter. The PowerPointFilter uses Apache POI, a 
> third-party library which can read Microsoft formats.  So, it should 
> be possible to create an ExcelFilter using the Apache POI components 
> related to Excel.
>
> Here's more on Apache POI:
> https://poi.apache.org/
>
> Here's our existing PowerPointFilter which uses Apache POI to filter 
> PPT/PPTX:
> https://github.com/DSpace/DSpace/blob/master/dspace-api/src/main/java/org/dspace/app/mediafilter/PowerPointFilter.java
>  
>
>
> If anyone is interested in contributing this to DSpace, we'd welcome 
> Pull Requests via GitHub!
>
> - Tim
>
> On 6/10/2015 3:32 PM, Ed Goulet wrote:
>> All,
>>
>> Is there such a media filter to extract text from Excel files?
>>
>> Like org.dspace.app.mediafilter.PDFFilter ... but it might be
>> org.dspace.app.mediafilter.XLSFilter ?
>>
>> Ed
>>
>> ------------------------------------------------------------------------------
>>  
>>
>> _______________________________________________
>> DSpace-tech mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> List Etiquette: 
>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>>
>


------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to