[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2011-01-15 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-2282:


Attachment: SOLR-2282-concurrency-branch_3x.patch
SOLR-2282-concurrency-trunk.patch

Thanks for debugging this, Dawid! I think solution 2) you suggested would be 
the best because it applies both to version 3.4.2 of Carrot2 (currently used by 
Solr) and the 3.5.0 version (not yet released).

I'm attaching patches for Solr trunk and branch_3x that fix the concurrency 
issue and correct a typo in a log message output by 
{{LuceneLanguageModelFactory}}.

> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Assignee: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282-concurrency-branch_3x.patch, 
> SOLR-2282-concurrency-trunk.patch, SOLR-2282-diagnostics.patch, 
> SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, 
> SOLR-2282.patch, SOLR-2282_test.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2011-01-13 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-2282:


Attachment: SOLR-2282-diagnostics.patch

Robert: I was using the random seed from the build result in the hope that it 
will fail the test for me. I'm still unable to get the exception though, with 
or without the seed. I suppose it shouldn't matter whether I run the complete 
test suite or just this one test method? (I was doing the latter to save time)

If you have a spare moment, would you be able check the following two things on 
your machine:

1. Apply the attached diagnostics patch and run the tests. If the test doesn't 
fail after the change, this means there's some concurrency issue in Carrot2's 
internal resource pooling mechanisms that we'll need to find. This patch is not 
a solution to the problem though, just a diagnostic measure.

2. It's paranoid, but can you run the test with the 
{{-Dargs=-XX:+TraceClassLoading}} option and check that there's no old (v3.4.0) 
Carrot2 JAR hiding in the bushes? Version 3.4.0 had a subtle bug that could be 
causing the exception. If there's no traces of Carrot2 3.4.0 JAR in the 
classpath, we'll need to do further inspection of our code.

> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Assignee: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282-diagnostics.patch, SOLR-2282.patch, 
> SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, 
> SOLR-2282_test.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2011-01-12 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated SOLR-2282:
--

Attachment: SOLR-2282_test.patch

here's a patch to fix the BaseDistributedTestCase, so clustering and other 
contribs can set their own home and use it.

this fixes the unknown field problem, but i'm still seeing the zzBuffer array 
index out of bounds exception... perhaps 
my checkout is somehow out of date... maybe you can test the patch?


> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Assignee: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, 
> SOLR-2282.patch, SOLR-2282.patch, SOLR-2282_test.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2010-12-19 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-2282:
-

Attachment: SOLR-2282.patch

Forgot to svn add a file...

> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, 
> SOLR-2282.patch, SOLR-2282.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2010-12-19 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-2282:
-

Attachment: SOLR-2282.patch

Updated patch. I think this is ready to go.

> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, 
> SOLR-2282.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2010-12-19 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-2282:
-

Attachment: SOLR-2282.patch

Add a simple test for distributed mode contributed by Brad Giaccio in SOLR-769.

Currently, the test fails due to class path problem when launching Jetty:

{quote}
org/mortbay/jetty/SessionIdManager
java.lang.NoClassDefFoundError: org/mortbay/jetty/SessionIdManager
at 
org.apache.solr.BaseDistributedSearchTestCase.createJetty(BaseDistributedSearchTestCase.java:211)
at 
org.apache.solr.BaseDistributedSearchTestCase.createJetty(BaseDistributedSearchTestCase.java:202)
at 
org.apache.solr.BaseDistributedSearchTestCase.createServers(BaseDistributedSearchTestCase.java:148)
at 
org.apache.solr.BaseDistributedSearchTestCase.testDistribSearch(BaseDistributedSearchTestCase.java:566)
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1104)
at 
org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1042)
Caused by: java.lang.ClassNotFoundException: org.mortbay.jetty.SessionIdManager
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
{quote}

(I have to move now, if someone solves the problem, it is welcome :)

> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2010-12-19 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-2282:
-

Attachment: SOLR-2282.patch

Updated test to avoid deprecated version of cluster method.

> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282.patch, SOLR-2282.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-2282) Distributed Support for Search Result Clustering

2010-12-17 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-2282:
-

Attachment: SOLR-2282.patch

A patch attached. Currently, carrot.produceSummary doesn't work in distributed 
mode:

{code:title=ClusteringComponent.finishStage()}
// TODO: Currently, docIds is set to null in distributed environment.
// This causes CarrotParams.PRODUCE_SUMMARY doesn't work.
// To work CarrotParams.PRODUCE_SUMMARY under distributed mode, we can choose 
either one of:
// (a) In each shard, ClusteringComponent produces summary and finishStage()
// merges these summaries.
// (b) Adding doHighlighting(SolrDocumentList, ...) method to SolrHighlighter 
and
// making SolrHighlighter uses "external text" rather than stored values to 
produce snippets.
{code}


> Distributed Support for Search Result Clustering
> 
>
> Key: SOLR-2282
> URL: https://issues.apache.org/jira/browse/SOLR-2282
> Project: Solr
>  Issue Type: New Feature
>  Components: contrib - Clustering
>Affects Versions: 1.4, 1.4.1
>Reporter: Koji Sekiguchi
>Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-2282.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to 
> incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org