[ https://issues.apache.org/jira/browse/NUTCH-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13762066#comment-13762066 ]
Daniel Ciborowski commented on NUTCH-1517: ------------------------------------------ Does this process work with the data stored in hdfs? or does it have to be stored on local file system? Still not able to get nutch to save segments though... But when I tried to use the index on my previously crawled data I am still getting the matched 0 files errors. > CloudSearch indexer > ------------------- > > Key: NUTCH-1517 > URL: https://issues.apache.org/jira/browse/NUTCH-1517 > Project: Nutch > Issue Type: New Feature > Components: indexer > Reporter: Julien Nioche > Fix For: 1.9 > > Attachments: 0023883254_1377197869_indexer-cloudsearch.patch > > > Once we have made the indexers pluggable, we should add a plugin for Amazon > CloudSearch. See http://aws.amazon.com/cloudsearch/. Apparently it uses a > JSON based representation Search Data Format (SDF), which we could reuse for > a file based indexer. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira