[ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12638461#action_12638461 ]
Felix Z. commented on NUTCH-442: -------------------------------- Hi, really great! I use the NUTCH/SOLR - integration patches for a theme-specific search engine and I could never reach this goal without the excellent work. But since the latest patch (v.8), I am not able to put the fetched documents into solr. The build is OK, apache-solr-solrj and apache-solr-common are in /lib, so everything should be fine so far. Do I have to configure solrj or to add something like "nutch index -solr http:...." like in patch v.7? I got this error: Indexer: org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory file:/opt/nutch/crawls/0744/3/18/crawlData/crawldb already exists Thanks for help Felix. > Integrate Solr/Nutch > -------------------- > > Key: NUTCH-442 > URL: https://issues.apache.org/jira/browse/NUTCH-442 > Project: Nutch > Issue Type: New Feature > Components: indexer, searcher > Environment: Ubuntu linux > Reporter: rubdabadub > Assignee: Doğacan Güney > Fix For: 1.0.0 > > Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, > NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, > NUTCH-442_v7a.patch.txt, NUTCH-442_v8.patch, NUTCH_442_v3.patch, > RFC_multiple_search_backends.patch, schema.xml > > > Hi: > After trying out Sami's patch regarding Solr/Nutch. Can be found here > (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html) > and I can confirm it worked :-) And that lead me to request the following : > I would be very very great full if this could be included in nutch 0.9 as I > am trying to eliminate my python based crawler which post documents to solr. > As I am in the corporate enviornment I can't install trunk version in the > production enviornment thus I am asking this to be included in 0.9 release. I > hope my wish would be granted. > I look forward to get some feedback. > Thank you. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.