[ 
http://jira.nuxeo.org/browse/NXP-1658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stéfane Fermigier updated NXP-1658:
-----------------------------------

    Fix Version/s:     (was: 5.2 M2)
                   5.2 M3

> Meta-Data extraction
> --------------------
>
>                 Key: NXP-1658
>                 URL: http://jira.nuxeo.org/browse/NXP-1658
>             Project: Nuxeo Enterprise Platform
>          Issue Type: New Feature
>            Reporter: Dragos Mihalache
>            Assignee: Dragos Mihalache
>             Fix For: 5.2 M3
>
>          Time Spent: 2 days
>  Remaining Estimate: 0 minutes
>
> Meta-Data extractor :
> =====================
> The idea is to be able to extract meta-data from files and attache them
> to a schema in the document.
> Typical use cases include :
>  - extracting MSOffice / OpenOffice meta-data
>  - extracting images meta-data
>  - extracting video or audio meta-data
> The first uses case is really on File => meta-data, but in the future we
> may want a bi-directionnal sync.
> Similar implementations in Nuxeo :
> ==================================
> Nuxeo has already similar features :
> DocModifier :
> -------------
> The DocModifier is now used for sync wordML meta-data with Nuxeo
> MetaData.
> The problem with DocModifier are :
>  - it needs refactoring :
>    - some processing and configuration should be done in a Tranformation
> Plugin rather than in doc modifier
>    but we can refactor it right now because seveal projects use it
> Tranformation Plugins :
> -----------------------
> Extracting meta-data could be implemented as a tranformation plugin that
> takes a file and returns meta-datas.
> For now Tranformation Plugins take a Blob as input and output a Blob.
> This could be changed to Blob => Map or we could just send the Map as a
> Blob.
> Possible existing libs :
> ========================
> http://aperture.sourceforge.net/
> or
> http://meta-extractor.sourceforge.net/
> or
> Apache POI.
> Both libs provides support for differents meta-data set.
> An interesting point is that mos Meta-data extract libs also include :
>  - text extraction 
>  - html view generation
> Because of that I would think that it would be better to integrate these
> libs in the Tranformation Service.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://jira.nuxeo.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
_______________________________________________
ECM-tickets mailing list
ECM-tickets@lists.nuxeo.com
http://lists.nuxeo.com/mailman/listinfo/ecm-tickets

Reply via email to