[jira] Commented: (NUTCH-442) Integrate Solr/Nutch
[ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637545#action_12637545 ] Doğacan Güney commented on NUTCH-442: - Great! (I am obviously +1 on adding this before 1.0 :D ) So, can I get some reviews on what people think of this patch then? On solrj: I will send an updated patch that uses solrj instead. > Integrate Solr/Nutch > > > Key: NUTCH-442 > URL: https://issues.apache.org/jira/browse/NUTCH-442 > Project: Nutch > Issue Type: New Feature > Environment: Ubuntu linux >Reporter: rubdabadub > Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, > NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, > NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, > RFC_multiple_search_backends.patch, schema.xml > > > Hi: > After trying out Sami's patch regarding Solr/Nutch. Can be found here > (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html) > and I can confirm it worked :-) And that lead me to request the following : > I would be very very great full if this could be included in nutch 0.9 as I > am trying to eliminate my python based crawler which post documents to solr. > As I am in the corporate enviornment I can't install trunk version in the > production enviornment thus I am asking this to be included in 0.9 release. I > hope my wish would be granted. > I look forward to get some feedback. > Thank you. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch
[ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637507#action_12637507 ] Andrzej Bialecki commented on NUTCH-442: - +1 on adding this before 1.0 - it would be a shame to miss this functionality when it's been asked for over and over. One change that should be made (either in this patch or as a follow-up) is to use SolrJ instead of plain HTTP. I don't think we need to branch for this - as long as the patch passes tests and runs basic commands IMHO it's good enough to expose a wider audience to it. Applying this to trunk/ actually gives us better chances that it will be tested by more people. > Integrate Solr/Nutch > > > Key: NUTCH-442 > URL: https://issues.apache.org/jira/browse/NUTCH-442 > Project: Nutch > Issue Type: New Feature > Environment: Ubuntu linux >Reporter: rubdabadub > Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, > NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, > NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, > RFC_multiple_search_backends.patch, schema.xml > > > Hi: > After trying out Sami's patch regarding Solr/Nutch. Can be found here > (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html) > and I can confirm it worked :-) And that lead me to request the following : > I would be very very great full if this could be included in nutch 0.9 as I > am trying to eliminate my python based crawler which post documents to solr. > As I am in the corporate enviornment I can't install trunk version in the > production enviornment thus I am asking this to be included in 0.9 release. I > hope my wish would be granted. > I look forward to get some feedback. > Thank you. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch
[ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637489#action_12637489 ] Enis Soztutar commented on NUTCH-442: - I personally believe this patch should be in before 1.0, since it does not make sense to make such a change in 1.1. However since there is some need to test this patch more thoroughly, I guess we can make a branch and commit it there, so that people can test this easily. However branching has it's own problems, especially keeping in sync with trunk would get harder and harder. Since this issue has a large number of votes and watchers, I suggest we branch and commit it, test this out a little bit more, and merge to trunk before 1.0. > Integrate Solr/Nutch > > > Key: NUTCH-442 > URL: https://issues.apache.org/jira/browse/NUTCH-442 > Project: Nutch > Issue Type: New Feature > Environment: Ubuntu linux >Reporter: rubdabadub > Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, > NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, > NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, > RFC_multiple_search_backends.patch, schema.xml > > > Hi: > After trying out Sami's patch regarding Solr/Nutch. Can be found here > (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html) > and I can confirm it worked :-) And that lead me to request the following : > I would be very very great full if this could be included in nutch 0.9 as I > am trying to eliminate my python based crawler which post documents to solr. > As I am in the corporate enviornment I can't install trunk version in the > production enviornment thus I am asking this to be included in 0.9 release. I > hope my wish would be granted. > I look forward to get some feedback. > Thank you. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch
[ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637419#action_12637419 ] Doğacan Güney commented on NUTCH-442: - Thanks to everyone for comments. Unfortunately this patch will probably have to wait until after 1.0 to get in. But since many people are interested in having some sort of Solr integration in trunk maybe we can update Sami Siren's solr patch and commit it for 1.0. What do others think? > Integrate Solr/Nutch > > > Key: NUTCH-442 > URL: https://issues.apache.org/jira/browse/NUTCH-442 > Project: Nutch > Issue Type: New Feature > Environment: Ubuntu linux >Reporter: rubdabadub > Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, > NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, > NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, > RFC_multiple_search_backends.patch, schema.xml > > > Hi: > After trying out Sami's patch regarding Solr/Nutch. Can be found here > (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html) > and I can confirm it worked :-) And that lead me to request the following : > I would be very very great full if this could be included in nutch 0.9 as I > am trying to eliminate my python based crawler which post documents to solr. > As I am in the corporate enviornment I can't install trunk version in the > production enviornment thus I am asking this to be included in 0.9 release. I > hope my wish would be granted. > I look forward to get some feedback. > Thank you. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.