[jira] [Commented] (NUTCH-1517) CloudSearch indexer

Daniel Ciborowski (JIRA) Mon, 09 Sep 2013 13:22:19 -0700

    [ 
https://issues.apache.org/jira/browse/NUTCH-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13762066#comment-13762066
 ]


Daniel Ciborowski commented on NUTCH-1517:
------------------------------------------

Does this process work with the data stored in hdfs? or does it have to be 
stored on local file system? Still not able to get nutch to save segments 
though... But when I tried to use the index on my previously crawled data I am 
still getting the matched 0 files errors.

                
> CloudSearch indexer
> -------------------
>
>                 Key: NUTCH-1517
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1517
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer
>            Reporter: Julien Nioche
>             Fix For: 1.9
>
>         Attachments: 0023883254_1377197869_indexer-cloudsearch.patch
>
>
> Once we have made the indexers pluggable, we should add a plugin for Amazon 
> CloudSearch. See http://aws.amazon.com/cloudsearch/. Apparently it uses a 
> JSON based representation Search Data Format (SDF), which we could reuse for 
> a file based indexer.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (NUTCH-1517) CloudSearch indexer

Reply via email to