[ https://issues.apache.org/jira/browse/TIKA-213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13018129#comment-13018129 ]
Nick Burch commented on TIKA-213: --------------------------------- The patch looks a good start (thanks Selva!). A couple of things we might want to tweak are: * I'm not sure the removal of quotes in the values is correct, shouldn't we escape it? * Numbers could be output without quoting * If we have several values for one metadata field, we should probably output it as key:array rather than multiple key:value entries Some of these changes might be easier with a json library, anyone know if jackson for example would help with them? > JSON output from Tika CLI > ------------------------- > > Key: TIKA-213 > URL: https://issues.apache.org/jira/browse/TIKA-213 > Project: Tika > Issue Type: New Feature > Components: cli > Reporter: Jukka Zitting > Assignee: Chris A. Mattmann > Priority: Minor > Attachments: json_output_option.patch > > > From the ApacheCon: CouchDB seems interested in Tika, and they'd like to see > an option for producing JSON output from the Tika CLI. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira