[ https://issues.apache.org/jira/browse/NUTCH-1863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17002143#comment-17002143 ]
ASF GitHub Bot commented on NUTCH-1863: --------------------------------------- balashashanka commented on issue #490: Fix for NUTCH-1863: Add JSON format dump output to readdb command URL: https://github.com/apache/nutch/pull/490#issuecomment-568392227 Hi @sebastian-nagel, I have made the changes. Thanks for the insight on the reason for using DataOutputStream. In the write function I have used Map<String, Object> since the metadata is a json by itself. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > Add JSON format dump output to readdb command > --------------------------------------------- > > Key: NUTCH-1863 > URL: https://issues.apache.org/jira/browse/NUTCH-1863 > Project: Nutch > Issue Type: New Feature > Components: crawldb > Affects Versions: 2.3, 1.10 > Reporter: Lewis John McGibbney > Assignee: Shashanka Balakuntala Srinivasa > Priority: Major > Fix For: 1.17 > > > Opening up the ability for third parties to consume Nutch crawldb data as > JSON would be a poisitive thing IMHO. > This issue should improve the readdb functionality of both 1.X to enable JSON > dumps of crawldb data. -- This message was sent by Atlassian Jira (v8.3.4#803005)