[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers

2013-05-22 Thread Sebastian Nagel (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel updated NUTCH-945:
--

Fix Version/s: 1.8

> Indexing to multiple SOLR Servers
> -
>
> Key: NUTCH-945
> URL: https://issues.apache.org/jira/browse/NUTCH-945
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.2
>Reporter: Charan Malemarpuram
> Fix For: 2.3, 1.8
>
> Attachments: MurmurHashPartitioner.java, 
> NonPartitioningPartitioner.java, patch-NUTCH-945.txt
>
>
> It would be nice to have a default Indexer in Nutch, which can submit docs to 
> multiple SOLR Servers.
> > Partitioning is always the question, when writing to multiple SOLR Servers.
> > Default partitioning can be a simple hashcode based distribution with 
> > addition hooks to customization.
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers

2013-01-12 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-945:
---

Fix Version/s: 2.2

> Indexing to multiple SOLR Servers
> -
>
> Key: NUTCH-945
> URL: https://issues.apache.org/jira/browse/NUTCH-945
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.2
>Reporter: Charan Malemarpuram
> Fix For: 2.2
>
> Attachments: MurmurHashPartitioner.java, 
> NonPartitioningPartitioner.java, patch-NUTCH-945.txt
>
>
> It would be nice to have a default Indexer in Nutch, which can submit docs to 
> multiple SOLR Servers.
> > Partitioning is always the question, when writing to multiple SOLR Servers.
> > Default partitioning can be a simple hashcode based distribution with 
> > addition hooks to customization.
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers

2013-01-12 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-945:
---

Patch Info: Patch Available

> Indexing to multiple SOLR Servers
> -
>
> Key: NUTCH-945
> URL: https://issues.apache.org/jira/browse/NUTCH-945
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.2
>Reporter: Charan Malemarpuram
> Fix For: 2.2
>
> Attachments: MurmurHashPartitioner.java, 
> NonPartitioningPartitioner.java, patch-NUTCH-945.txt
>
>
> It would be nice to have a default Indexer in Nutch, which can submit docs to 
> multiple SOLR Servers.
> > Partitioning is always the question, when writing to multiple SOLR Servers.
> > Default partitioning can be a simple hashcode based distribution with 
> > addition hooks to customization.
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers

2012-02-28 Thread Sujit Pal (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sujit Pal updated NUTCH-945:


Attachment: MurmurHashPartitioner.java

I built this as an example in order to verify that setting a custom partitioner 
works. Uses the o.a.h...MurmurHash as its base. Could probably be useful as an 
example of a custom Partitioner.

> Indexing to multiple SOLR Servers
> -
>
> Key: NUTCH-945
> URL: https://issues.apache.org/jira/browse/NUTCH-945
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.2
>Reporter: Charan Malemarpuram
> Attachments: MurmurHashPartitioner.java, 
> NonPartitioningPartitioner.java, patch-NUTCH-945.txt
>
>
> It would be nice to have a default Indexer in Nutch, which can submit docs to 
> multiple SOLR Servers.
> > Partitioning is always the question, when writing to multiple SOLR Servers.
> > Default partitioning can be a simple hashcode based distribution with 
> > addition hooks to customization.
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers

2012-02-28 Thread Sujit Pal (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sujit Pal updated NUTCH-945:


Attachment: NonPartitioningPartitioner.java

Partitioner that always returns 0 (for handling single SOLR server case).

> Indexing to multiple SOLR Servers
> -
>
> Key: NUTCH-945
> URL: https://issues.apache.org/jira/browse/NUTCH-945
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.2
>Reporter: Charan Malemarpuram
> Attachments: NonPartitioningPartitioner.java, patch-NUTCH-945.txt
>
>
> It would be nice to have a default Indexer in Nutch, which can submit docs to 
> multiple SOLR Servers.
> > Partitioning is always the question, when writing to multiple SOLR Servers.
> > Default partitioning can be a simple hashcode based distribution with 
> > addition hooks to customization.
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (NUTCH-945) Indexing to multiple SOLR Servers

2012-02-28 Thread Sujit Pal (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sujit Pal updated NUTCH-945:


Attachment: patch-NUTCH-945.txt

Patch file to make updates to SolrConstants (add new property), SolrIndexerJob 
(handle commit to multiple solr servers) and SolrWriter (instantiate and invoke 
partitioner).


> Indexing to multiple SOLR Servers
> -
>
> Key: NUTCH-945
> URL: https://issues.apache.org/jira/browse/NUTCH-945
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.2
>Reporter: Charan Malemarpuram
> Attachments: patch-NUTCH-945.txt
>
>
> It would be nice to have a default Indexer in Nutch, which can submit docs to 
> multiple SOLR Servers.
> > Partitioning is always the question, when writing to multiple SOLR Servers.
> > Default partitioning can be a simple hashcode based distribution with 
> > addition hooks to customization.
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira