[ http://jira.nuxeo.org/browse/NXP-1658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Stéfane Fermigier updated NXP-1658: ----------------------------------- Fix Version/s: (was: 5.2 M2) 5.2 M3 > Meta-Data extraction > -------------------- > > Key: NXP-1658 > URL: http://jira.nuxeo.org/browse/NXP-1658 > Project: Nuxeo Enterprise Platform > Issue Type: New Feature > Reporter: Dragos Mihalache > Assignee: Dragos Mihalache > Fix For: 5.2 M3 > > Time Spent: 2 days > Remaining Estimate: 0 minutes > > Meta-Data extractor : > ===================== > The idea is to be able to extract meta-data from files and attache them > to a schema in the document. > Typical use cases include : > - extracting MSOffice / OpenOffice meta-data > - extracting images meta-data > - extracting video or audio meta-data > The first uses case is really on File => meta-data, but in the future we > may want a bi-directionnal sync. > Similar implementations in Nuxeo : > ================================== > Nuxeo has already similar features : > DocModifier : > ------------- > The DocModifier is now used for sync wordML meta-data with Nuxeo > MetaData. > The problem with DocModifier are : > - it needs refactoring : > - some processing and configuration should be done in a Tranformation > Plugin rather than in doc modifier > but we can refactor it right now because seveal projects use it > Tranformation Plugins : > ----------------------- > Extracting meta-data could be implemented as a tranformation plugin that > takes a file and returns meta-datas. > For now Tranformation Plugins take a Blob as input and output a Blob. > This could be changed to Blob => Map or we could just send the Map as a > Blob. > Possible existing libs : > ======================== > http://aperture.sourceforge.net/ > or > http://meta-extractor.sourceforge.net/ > or > Apache POI. > Both libs provides support for differents meta-data set. > An interesting point is that mos Meta-data extract libs also include : > - text extraction > - html view generation > Because of that I would think that it would be better to integrate these > libs in the Tranformation Service. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://jira.nuxeo.org/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira _______________________________________________ ECM-tickets mailing list ECM-tickets@lists.nuxeo.com http://lists.nuxeo.com/mailman/listinfo/ecm-tickets