[jira] [Issue Comment Edited] (LUCENE-3921) Add decompose compound Japanese Katakana token capability to Kuromoji

2012-03-26 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13239195#comment-13239195 ] Christian Moen edited comment on LUCENE-3921 at 3/27/12 5:32 AM

[jira] [Commented] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-25 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237826#comment-13237826 ] Christian Moen commented on LUCENE-3915: Thanks, Robert. I'm thinking it could

[jira] [Commented] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-25 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237863#comment-13237863 ] Christian Moen commented on LUCENE-3915: Thanks, Robert. Understood. I'll run

[jira] [Created] (LUCENE-3916) Consider different query and index segmentation for Japanese

2012-03-25 Thread Christian Moen (Created) (JIRA)
Components: modules/analysis Affects Versions: 3.6, 4.0 Reporter: Christian Moen Priority: Minor Kuromoji today uses search mode segmentation both at query and index time. The benefit with search mode segmentation is that it segments compounds such as 関西国際空

[jira] [Commented] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-25 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237874#comment-13237874 ] Christian Moen commented on LUCENE-3915: Committed revision 1305046 on {{trunk

[jira] [Commented] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-25 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237886#comment-13237886 ] Christian Moen commented on LUCENE-3915: Committed revision 1305051 and 1305052

[jira] [Resolved] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-25 Thread Christian Moen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen resolved LUCENE-3915. Resolution: Fixed Add Japanese filter to replace term attribute with readings

[jira] [Updated] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-25 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3915: --- Component/s: modules/analysis Affects Version/s: 4.0 3.6

[jira] [Commented] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-25 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237887#comment-13237887 ] Christian Moen commented on LUCENE-3915: Thanks, Robert and Koji

[jira] [Commented] (LUCENE-3909) Move Kuromoji to analysis.ja and introduce Japanese* naming

2012-03-25 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237897#comment-13237897 ] Christian Moen commented on LUCENE-3909: Thanks, Koji. I hope to do the move

[jira] [Commented] (LUCENE-3888) split off the spell check word and surface form in spell check dictionary

2012-03-24 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237457#comment-13237457 ] Christian Moen commented on LUCENE-3888: This is excellent, Koji and Robert. We

[jira] [Updated] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-24 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3901: --- Attachment: LUCENE-3901.patch Add katakana stem filter to better deal with certain

[jira] [Commented] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-24 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237461#comment-13237461 ] Christian Moen commented on LUCENE-3901: Updated patch with minor whitespace

[jira] [Commented] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-24 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237462#comment-13237462 ] Christian Moen commented on LUCENE-3901: Committed revision 1304719 on {{trunk

[jira] [Commented] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-24 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237475#comment-13237475 ] Christian Moen commented on LUCENE-3901: Committed revision 1304727

[jira] [Resolved] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-24 Thread Christian Moen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen resolved LUCENE-3901. Resolution: Fixed Add katakana stem filter to better deal with certain katakana

[jira] [Issue Comment Edited] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-24 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237475#comment-13237475 ] Christian Moen edited comment on LUCENE-3901 at 3/24/12 8:10 AM

[jira] [Created] (LUCENE-3909) Move Kuromoji to analysis.ja and introduce Japanese* naming

2012-03-24 Thread Christian Moen (Created) (JIRA)
Components: modules/analysis Affects Versions: 3.6, 4.0 Reporter: Christian Moen Lucene/Solr 3.6 and 4.0 will get out-of-the-box Japanese language support through {{KuromojiAnalyzer}}, {{KuromojiTokenizer}} and various other filters. These filters currently live

[jira] [Issue Comment Edited] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-24 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237475#comment-13237475 ] Christian Moen edited comment on LUCENE-3901 at 3/24/12 10:48 AM

[jira] [Commented] (LUCENE-3909) Move Kuromoji to analysis.ja and introduce Japanese* naming

2012-03-24 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237566#comment-13237566 ] Christian Moen commented on LUCENE-3909: Thanks, Robert and Mike. It would

[jira] [Assigned] (LUCENE-3909) Move Kuromoji to analysis.ja and introduce Japanese* naming

2012-03-24 Thread Christian Moen (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen reassigned LUCENE-3909: -- Assignee: Christian Moen Move Kuromoji to analysis.ja and introduce Japanese

[jira] [Created] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-24 Thread Christian Moen (Created) (JIRA)
Reporter: Christian Moen Priority: Minor Koji and Robert are working on LUCENE-3888 that allows spell-checkers to do their similarity matching using a different word than its surface form. This approach is very useful for languages such as Japanese where the surface form

[jira] [Commented] (LUCENE-3915) Add Japanese filter to replace term attribute with readings

2012-03-24 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13237674#comment-13237674 ] Christian Moen commented on LUCENE-3915: Find attached a draft patch

[jira] [Created] (LUCENE-3901) Add katakana filter to better deal with katakana spelling variants

2012-03-22 Thread Christian Moen (Created) (JIRA)
: New Feature Components: modules/analysis Reporter: Christian Moen Fix For: 3.6, 4.0 Many Japanese katakana words end in a long sound that is sometimes optional. For example, パーティー and パーティ are both perfectly valid for party. Similarly we have センター and センタ

[jira] [Updated] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-22 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3901: --- Summary: Add katakana stem filter to better deal with certain katakana spelling variants

[jira] [Commented] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235497#comment-13235497 ] Christian Moen commented on LUCENE-3901: Patch for this coming up shortly

[jira] [Commented] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235517#comment-13235517 ] Christian Moen commented on LUCENE-3897: Committed revision 1303739 on {{trunk

[jira] [Resolved] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-22 Thread Christian Moen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen resolved LUCENE-3897. Resolution: Fixed Thanks a lot, Mike and Robert! KuromojiTokenizer

[jira] [Commented] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235531#comment-13235531 ] Christian Moen commented on LUCENE-3897: Committed revision 1303744

Re: 3.6 branching

2012-03-22 Thread Christian Moen
Robert, I think this is a very good idea. +1. Christian http://atilika.com On Mar 22, 2012, at 8:48 PM, Robert Muir wrote: Hello, I propose for 3.6 that we don't create a release branch but just use our branch_3x as the release branch. We can 'svn mv' it to 'lucene_solr_3_6' when the

[jira] [Updated] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-22 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3901: --- Attachment: LUCENE-3901.patch Add katakana stem filter to better deal with certain

[jira] [Updated] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-22 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3901: --- Attachment: LUCENE-3901.patch Add katakana stem filter to better deal with certain

[jira] [Commented] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235746#comment-13235746 ] Christian Moen commented on LUCENE-3901: Find attached a patch

[jira] [Assigned] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-22 Thread Christian Moen (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen reassigned LUCENE-3901: -- Assignee: Christian Moen Add katakana stem filter to better deal with certain

[jira] [Commented] (LUCENE-3901) Add katakana stem filter to better deal with certain katakana spelling variants

2012-03-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235877#comment-13235877 ] Christian Moen commented on LUCENE-3901: Thanks a lot, Robert. I'll do some more

[jira] [Commented] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-21 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234268#comment-13234268 ] Christian Moen commented on LUCENE-3897: The assertion suggests that backtracking

[jira] [Commented] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-21 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234456#comment-13234456 ] Christian Moen commented on LUCENE-3897: I've been trying to make an even more

[jira] [Commented] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-21 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235352#comment-13235352 ] Christian Moen commented on LUCENE-3897: Thanks a lot, Mike. +1! I've been

[jira] [Assigned] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-21 Thread Christian Moen (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen reassigned LUCENE-3897: -- Assignee: Christian Moen KuromojiTokenizer fails with large docs

[jira] [Commented] (LUCENE-3887) 'ant javadocs' should fail if a package is missing a package.html

2012-03-20 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13233604#comment-13233604 ] Christian Moen commented on LUCENE-3887: Robert, I should be careful

[jira] [Commented] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-20 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234043#comment-13234043 ] Christian Moen commented on LUCENE-3897: Thanks, Robert. I'll have a look

[jira] [Commented] (LUCENE-3887) 'ant javadocs' should fail if a package is missing a package.html

2012-03-20 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234046#comment-13234046 ] Christian Moen commented on LUCENE-3887: Robert, very good point regarding

[jira] [Commented] (LUCENE-3895) Not getting random-seed/reproduce-with if a test fails from another thread

2012-03-20 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234089#comment-13234089 ] Christian Moen commented on LUCENE-3895: Thanks, Robert. I'm trying

[jira] [Commented] (LUCENE-3897) KuromojiTokenizer fails with large docs

2012-03-20 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234096#comment-13234096 ] Christian Moen commented on LUCENE-3897: Robert, your change to LUCENE-3895

[jira] [Commented] (LUCENE-3895) Not getting random-seed/reproduce-with if a test fails from another thread

2012-03-20 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234112#comment-13234112 ] Christian Moen commented on LUCENE-3895: This does the job very well and I can

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-10 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13226868#comment-13226868 ] Christian Moen commented on LUCENE-3767: Thanks for the feedback. Mike, you

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-10 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13226872#comment-13226872 ] Christian Moen commented on LUCENE-3767: Committed revision 1299213

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-10 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13226881#comment-13226881 ] Christian Moen commented on LUCENE-3767: Confirmed this working in a {{branch_3x

[jira] [Resolved] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-10 Thread Christian Moen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen resolved LUCENE-3767. Resolution: Fixed Explore streaming Viterbi search in Kuromoji

[jira] [Updated] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-06 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3767: --- Attachment: LUCENE-3767_branch_3x.patch Explore streaming Viterbi search in Kuromoji

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-06 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13224025#comment-13224025 ] Christian Moen commented on LUCENE-3767: I've attached a patch for {{branch_3x

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-04 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13221903#comment-13221903 ] Christian Moen commented on LUCENE-3767: Committed to {{trunk}} with revision

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-03 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13221544#comment-13221544 ] Christian Moen commented on LUCENE-3767: Thanks, Mike. I'll commit

[jira] [Commented] (LUCENE-3801) Generify FST shortestPaths() to take a comparator

2012-03-02 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13220909#comment-13220909 ] Christian Moen commented on LUCENE-3801: If it's possible to speed things up

[jira] [Commented] (LUCENE-3801) Generify FST shortestPaths() to take a comparator

2012-03-02 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13220919#comment-13220919 ] Christian Moen commented on LUCENE-3801: Exactly. If we match on normalized

[jira] [Assigned] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-03-01 Thread Christian Moen (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen reassigned LUCENE-3767: -- Assignee: Christian Moen (was: Michael McCandless) Explore streaming Viterbi

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-02-29 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13219015#comment-13219015 ] Christian Moen commented on LUCENE-3767: Thanks, Mike. I've tried the latest

Re: Welcome Stefan Matheis

2012-02-29 Thread Christian Moen
Welcome Stefan! Excellent UI work! On Mar 1, 2012, at 6:04 AM, Ryan McKinley wrote: I'm pleased to announce that Stefan Matheis has joined our ranks as a committer. He has given the solr admin UI some much needed love. It now looks like it belongs in 2012! Stefan, it is tradition

[jira] [Updated] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-02-28 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3767: --- Attachment: SolrXml-5498.xml Explore streaming Viterbi search in Kuromoji

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-02-28 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13218026#comment-13218026 ] Christian Moen commented on LUCENE-3767: Mike, Thanks a lot for this. I've been

[jira] [Commented] (LUCENE-3819) Clean up what we show in right side bar of website.

2012-02-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13213740#comment-13213740 ] Christian Moen commented on LUCENE-3819: +1, Mark. +1, Yonik. I also think

[jira] [Issue Comment Edited] (LUCENE-3819) Clean up what we show in right side bar of website.

2012-02-22 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13213740#comment-13213740 ] Christian Moen edited comment on LUCENE-3819 at 2/22/12 4:21 PM

[jira] [Commented] (LUCENE-3819) Clean up what we show in right side bar of website.

2012-02-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13213776#comment-13213776 ] Christian Moen commented on LUCENE-3819: Thanks, Mark. I like your idea

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-02-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13214196#comment-13214196 ] Christian Moen commented on LUCENE-3767: I agree completely; we should definitely

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-02-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13214217#comment-13214217 ] Christian Moen commented on LUCENE-3767: Robert, some comments are below. {quote

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-02-22 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13214227#comment-13214227 ] Christian Moen commented on LUCENE-3767: {quote} I left the default mode

[jira] [Commented] (LUCENE-3811) remove unused benchmark dependencies

2012-02-21 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13212529#comment-13212529 ] Christian Moen commented on LUCENE-3811: Good cleaning job. +1

[jira] [Commented] (LUCENE-3767) Explore streaming Viterbi search in Kuromoji

2012-02-21 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13212623#comment-13212623 ] Christian Moen commented on LUCENE-3767: Mike, Thanks a lot for this. I'd meant

[jira] [Commented] (SOLR-2909) Add support for ResourceLoaderAware tokenizerFactories in synonym filter factories

2012-02-20 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211736#comment-13211736 ] Christian Moen commented on SOLR-2909: -- Thanks. Sekiguchi-san, I'm happy to test

Re: Welcome Sami Siren as committer

2012-02-19 Thread Christian Moen
Congratulations! On Feb 20, 2012, at 12:32 AM, Robert Muir wrote: I'm pleased to announce that Sami Siren has joined our ranks as a committer. He has been contributing various patches to Lucene/Solr, especially to Solr's distributed indexing capabilities. Sami, its tradition that you

[jira] [Commented] (SOLR-2909) Add support for ResourceLoaderAware tokenizerFactories in synonym filter factories

2012-02-19 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211711#comment-13211711 ] Christian Moen commented on SOLR-2909: -- Ohtani-san, good catch. Sekiguchi-san

Re: Welcome Christian Moen as committer

2012-02-15 Thread Christian Moen
with computers. I'm also doing some non-profit work for a business organization that promotes trade relations between Norway and Japan. Best, Christian On Feb 15, 2012, at 6:48 AM, Robert Muir wrote: I'm pleased to announce that Christian Moen has joined our ranks as a committer. He has been

[jira] [Updated] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-09 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated SOLR-3056: - Attachment: SOLR-3056.patch Introduce Japanese field type in schema.xml

[jira] [Commented] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-09 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204438#comment-13204438 ] Christian Moen commented on SOLR-3056: -- I agree, Robert. I'll add suitable

[jira] [Updated] (LUCENE-3751) Align default Japanese configurations for Lucene and Solr

2012-02-09 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3751: --- Attachment: LUCENE-3751.patch Align default Japanese configurations for Lucene

[jira] [Updated] (LUCENE-3751) Align default Japanese configurations for Lucene and Solr

2012-02-09 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated LUCENE-3751: --- Attachment: LUCENE-3751.patch Align default Japanese configurations for Lucene

[jira] [Commented] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-09 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204448#comment-13204448 ] Christian Moen commented on SOLR-3056: -- {{KuromojiAnalyzer}} (LUCENE-3751) has also

[jira] [Created] (SOLR-3115) Improve default Japanese stopwords.txt description

2012-02-09 Thread Christian Moen (Created) (JIRA)
Reporter: Christian Moen Priority: Minor As discussed in SOLR-3056, the description in the default Japanese stopwords.txt should be improved to describe case- and width-handling. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please

[jira] [Updated] (SOLR-3115) Improve default Japanese stopwords.txt description

2012-02-09 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated SOLR-3115: - Attachment: SOLR-3115.patch Improve default Japanese stopwords.txt description

[jira] [Commented] (SOLR-3115) Improve default Japanese stopwords.txt description

2012-02-09 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204459#comment-13204459 ] Christian Moen commented on SOLR-3115: -- A patch is attached with an improved

[jira] [Issue Comment Edited] (SOLR-3115) Improve default Japanese stopwords.txt description

2012-02-09 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204459#comment-13204459 ] Christian Moen edited comment on SOLR-3115 at 2/9/12 11:34 AM

[jira] [Updated] (SOLR-3115) Improve default Japanese stopwords.txt description

2012-02-09 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated SOLR-3115: - Affects Version/s: 4.0 3.6 Improve default Japanese stopwords.txt

[jira] [Commented] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-09 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204461#comment-13204461 ] Christian Moen commented on SOLR-3056: -- An improved description of {{stopwords.txt

[jira] [Commented] (LUCENE-3751) Align default Japanese configurations for Lucene and Solr

2012-02-09 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204464#comment-13204464 ] Christian Moen commented on LUCENE-3751: Updated patch that now uses

[jira] [Issue Comment Edited] (LUCENE-3751) Align default Japanese configurations for Lucene and Solr

2012-02-09 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204464#comment-13204464 ] Christian Moen edited comment on LUCENE-3751 at 2/9/12 11:53 AM

[jira] [Issue Comment Edited] (LUCENE-3751) Align default Japanese configurations for Lucene and Solr

2012-02-09 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204464#comment-13204464 ] Christian Moen edited comment on LUCENE-3751 at 2/9/12 11:53 AM

[jira] [Commented] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-08 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203616#comment-13203616 ] Christian Moen commented on SOLR-3056: -- Thanks a lot, Robert. bq. I'll open up

[jira] [Issue Comment Edited] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-08 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203616#comment-13203616 ] Christian Moen edited comment on SOLR-3056 at 2/8/12 2:05 PM

[jira] [Commented] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-08 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203662#comment-13203662 ] Christian Moen commented on SOLR-3056: -- Thanks, Robert. I was thinking to leave

[jira] [Commented] (LUCENE-3751) Align default Japanese configurations for Lucene and Solr

2012-02-08 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203666#comment-13203666 ] Christian Moen commented on LUCENE-3751: Thanks a lot, Robert. Let's put

[jira] [Issue Comment Edited] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-08 Thread Christian Moen (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203662#comment-13203662 ] Christian Moen edited comment on SOLR-3056 at 2/8/12 3:45 PM

[jira] [Commented] (SOLR-3097) Introduce default Japanese stoptags and stopwords to Solr's example configuration

2012-02-07 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13202438#comment-13202438 ] Christian Moen commented on SOLR-3097: -- Robert, I agree. Would a patch that contains

[jira] [Commented] (SOLR-3105) Add analysis configurations for different languages to the example

2012-02-07 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13202524#comment-13202524 ] Christian Moen commented on SOLR-3105: -- This looks very good and makes it a whole lot

[jira] [Created] (SOLR-3107) Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor

2012-02-07 Thread Christian Moen (Created) (JIRA)
: Improvement Components: contrib - LangId Affects Versions: 3.6, 4.0 Reporter: Christian Moen Priority: Minor The {{language-detection}} library used by {{LangDetectLanguageIdentifierUpdateProcessor}} uses a random sampling feature enabled by default as a means

[jira] [Commented] (SOLR-3107) Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor

2012-02-07 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13202613#comment-13202613 ] Christian Moen commented on SOLR-3107: -- Attached a trivial patch tested on {{trunk

[jira] [Updated] (SOLR-3107) Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor

2012-02-07 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated SOLR-3107: - Attachment: SOLR-3107.patch Disable random sampling

[jira] [Commented] (SOLR-3105) Add analysis configurations for different languages to the example

2012-02-07 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203134#comment-13203134 ] Christian Moen commented on SOLR-3105: -- Hoss, +1. Add

[jira] [Updated] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-05 Thread Christian Moen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Moen updated SOLR-3056: - Attachment: SOLR-3056_schema40.patch Introduce Japanese field type in schema.xml

[jira] [Commented] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-05 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13200688#comment-13200688 ] Christian Moen commented on SOLR-3056: -- Stopwords and stoptags for Solr are now

[jira] [Commented] (SOLR-3056) Introduce Japanese field type in schema.xml

2012-02-05 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13200689#comment-13200689 ] Christian Moen commented on SOLR-3056: -- Updated patch for {{schema.xml}} on {{trunk

<    1   2   3   4   5   >