[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-08-24 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: carrot2-core-3.4.0-jdk1.5.jar

Hi Grant,

Thanks for committing the patches! I noticed that the 3.x branch build failed 
because Carrot2 JAR had classes in Java 1.6 format. I'm attaching a Java 
1.5-compliant JAR. After replacing the original JAR with the attached one, all 
Solr tests passed on Java 1.5 on my machine. Apologies for not checking this 
earlier.

Also, I believe the last paragraph of contrib/clustering/README.txt does not 
hold any more as all JARs are now distributed with Solr.

Staszek

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Fix For: 3.1, 4.0
>
> Attachments: carrot2-core-3.4.0-jdk1.5.jar, 
> SOLR-1804-carrot2-3.4.0-dev-trunk.patch, SOLR-1804-carrot2-3.4.0-dev.patch, 
> SOLR-1804-carrot2-3.4.0-libs.zip, SOLR-1804.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-08-23 Thread Grant Ingersoll (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Ingersoll updated SOLR-1804:
--

Fix Version/s: 3.1

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-1804-carrot2-3.4.0-dev-trunk.patch, 
> SOLR-1804-carrot2-3.4.0-dev.patch, SOLR-1804-carrot2-3.4.0-libs.zip, 
> SOLR-1804.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-08-20 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: SOLR-1804-carrot2-3.4.0-libs.zip

Here are the libs with Carrot2 3.4.0 JAR.

1. Apply the patch (the patch hasn't changed)
2. Copy the libs from the ZIP overwriting the old ones
3. Remove Google collections from solr/lib (it's replaced by Guava from the 
ZIP). If you don't do that, tests will fail due to class path conflicts.

I've just tested this on my machine with the latest branch_3x (r966551) and all 
tests pass. If some tests fail for you, let me know and I'll investigate.

S.

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev-trunk.patch, 
> SOLR-1804-carrot2-3.4.0-dev.patch, SOLR-1804-carrot2-3.4.0-libs.zip, 
> SOLR-1804.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-08-20 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: (was: SOLR-1804-carrot2-3.4.0-dev-libs.zip)

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev-trunk.patch, 
> SOLR-1804-carrot2-3.4.0-dev.patch, SOLR-1804.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-08-20 Thread Grant Ingersoll (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Ingersoll updated SOLR-1804:
--

Attachment: SOLR-1804.patch

Brings this up to date, fixing some build issues.  The Carrot test does not 
pass.  Stazsek, perhaps you can take this, upgrade to latest C2 and get the 
test to work?

Then, I can commit.

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev-libs.zip, 
> SOLR-1804-carrot2-3.4.0-dev-trunk.patch, SOLR-1804-carrot2-3.4.0-dev.patch, 
> SOLR-1804.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-28 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: SOLR-1804-carrot2-3.4.0-dev-trunk.patch

A patch against solr trunk, the libs are the same as for the branch_3x patch.

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev-libs.zip, 
> SOLR-1804-carrot2-3.4.0-dev-trunk.patch, SOLR-1804-carrot2-3.4.0-dev.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-22 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: SOLR-1804-carrot2-3.4.0-dev.patch

Ok, here's another shot. This time, the language model factory includes support 
for Chinese. To avoid compilation issues, the classes are loaded through 
reflection. Not pretty, but works. If there's a way to have access to smart 
chinese at compilation time, let me know, I can remove the reflection stuff, so 
that the refactoring is more reliable.

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-22 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: (was: SOLR-1804-carrot2-3.4.0-dev.patch)

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-22 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: (was: SOLR-1804-carrot2-3.4.0-dev-libs.zip)

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-22 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: SOLR-1804-carrot2-3.4.0-dev-libs.zip

Updated dependencies.

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev-libs.zip, 
> SOLR-1804-carrot2-3.4.0-dev.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: SOLR-1804-carrot2-3.4.0-dev-libs.zip

Libs required for the Carrot2 3.4.0 update.

> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev-libs.zip, 
> SOLR-1804-carrot2-3.4.0-dev.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stanislaw Osinski updated SOLR-1804:


Attachment: SOLR-1804-carrot2-3.4.0-dev.patch

Hi,

As we're near the 3.4.0 release of Carrot2, I'm including a patch that upgrades 
the clustering plugin. The most notable changes are:

* [3.4.0] Carrot2 core no longer depends on Lucene APIs, so the {{build.xml}} 
can be enabled again. The only class that makes use of Lucene API, 
{{LuceneLanguageModelFactory}}, is now included in the plugin's code, so there 
shouldn't be any problems with refactoring. In fact, I've already updated 
{{LuceneLanguageModelFactory}} to remove the use of deprecated APIs.
* [3.3.0] The STC algorithm has seen some [significant scalability 
improvements|http://project.carrot2.org/release-3.3.0-notes.html]
* [3.2.0] Carrot2 core no longer depends on LGPL libraries, so all the JARs can 
now be included in Solr SVN and SOLR-2007 won't need fixing.

Included is a patch against r966211. A ZIP with JARs will follow in a sec.

A couple of notes:

* The upgrade requires upgrading Google collections to Guava. This is a drop-in 
replacement, all tests pass for me after the upgrade, plus the upgrade is 
[recommended|http://code.google.com/p/google-collections/] on the original 
Google Collections site.
* The patch includes Carrot2 3.4.0-dev JAR, but I guess it's worth committing 
already to avoid the library downloads hassle (SOLR-2007).
* Originally, Carrot2 supports clustering of Chinese content based on the Smart 
Chinese Tokenizer. This tokenizer would have to be referenced from the 
{{LuceneLanguageModelFactory}} class in Solr. However, when compiling the code 
in Ant, this smartcn doesn't seem available in the classpath. Is it a matter of 
modifying the build files, or it's a policy on dependencies between plugins?

Let me know if you have any problems applying the patch.

Thanks!

S.


> Upgrade Carrot2 to 3.2.0
> 
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Clustering
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org