[jira] Commented: (SOLR-1078) WordDelimiterFilter do wrong word breaking for Thai vowel

2009-05-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12707203#action_12707203 ] Robert Muir commented on SOLR-1078: --- thai vowels are neither, they are Character.getType(c

[jira] Commented: (SOLR-1078) WordDelimiterFilter do wrong word breaking for Thai vowel

2009-05-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12707511#action_12707511 ] Robert Muir commented on SOLR-1078: --- looks pretty good... i was concerned about the split

[jira] Commented: (SOLR-1078) WordDelimiterFilter do wrong word breaking for Thai vowel

2009-05-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12707522#action_12707522 ] Robert Muir commented on SOLR-1078: --- i think so, U+005E CIRCUMFLEX ACCENT, U+0060 GRAVE AC

[jira] Commented: (SOLR-1204) Enhance SpellingQueryConverter to handle UTF-8 instead of ASCII only

2009-06-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716597#action_12716597 ] Robert Muir commented on SOLR-1204: --- hi, michael. Its not for some languages. I recommend

[jira] Commented: (SOLR-1204) Enhance SpellingQueryConverter to handle UTF-8 instead of ASCII only

2009-06-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12716763#action_12716763 ] Robert Muir commented on SOLR-1204: --- those others you mentioned also look good... in fact

[jira] Commented: (SOLR-1231) query parser fails parsing umlaut character

2009-06-18 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721398#action_12721398 ] Robert Muir commented on SOLR-1231: --- expanding on what yonik says, looks like the servlet

[jira] Commented: (SOLR-1231) query parser fails parsing umlaut character

2009-06-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721814#action_12721814 ] Robert Muir commented on SOLR-1231: --- oops you are right, ignore what i said :) > query pa

[jira] Created: (SOLR-1266) WordDelimiterFilter: option to disable english possessive stemming

2009-07-08 Thread Robert Muir (JIRA)
WordDelimiterFilter: option to disable english possessive stemming -- Key: SOLR-1266 URL: https://issues.apache.org/jira/browse/SOLR-1266 Project: Solr Issue Type: Improvement

[jira] Updated: (SOLR-1266) WordDelimiterFilter: option to disable english possessive stemming

2009-07-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1266: -- Attachment: SOLR-1266.txt patch that adds option, defaulting is existing behavior (true) > WordDelimite

[jira] Commented: (SOLR-1266) WordDelimiterFilter: option to disable english possessive stemming

2009-07-09 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12729450#action_12729450 ] Robert Muir commented on SOLR-1266: --- Yonik, thanks. I wasn't sure about back-compat requi

[jira] Commented: (SOLR-1266) WordDelimiterFilter: option to disable english possessive stemming

2009-07-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12729865#action_12729865 ] Robert Muir commented on SOLR-1266: --- Yonik, sure. I'll update AnalyzersTokenizersTokenFilt

[jira] Commented: (SOLR-1279) ApostropheTokenizer

2009-07-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731125#action_12731125 ] Robert Muir commented on SOLR-1279: --- Sergey, have you looked at SOLR-1266? By using the n

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-07-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12737592#action_12737592 ] Robert Muir commented on SOLR-1321: --- andrej, i really like this feature. one question tho

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-07-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12737601#action_12737601 ] Robert Muir commented on SOLR-1321: --- andrzej i see what you are saying. I think its a grea

[jira] Issue Comment Edited: (SOLR-1321) Support for efficient leading wildcards search

2009-07-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12737601#action_12737601 ] Robert Muir edited comment on SOLR-1321 at 7/31/09 10:11 AM: - an

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-08-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738343#action_12738343 ] Robert Muir commented on SOLR-1321: --- Andrzej, with the costs logic, you wouldn't have to l

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-08-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738352#action_12738352 ] Robert Muir commented on SOLR-1321: --- sounds perfect, great idea. Thanks! > Support for e

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-08-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738626#action_12738626 ] Robert Muir commented on SOLR-1321: --- Andrzej, did you accidentally leave out ReversedWildc

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-08-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12738730#action_12738730 ] Robert Muir commented on SOLR-1321: --- andrzej, thanks, I like this design. > Support for

[jira] Created: (SOLR-1336) Add support for lucene's SmartChineseAnalyzer

2009-08-05 Thread Robert Muir (JIRA)
Add support for lucene's SmartChineseAnalyzer - Key: SOLR-1336 URL: https://issues.apache.org/jira/browse/SOLR-1336 Project: Solr Issue Type: New Feature Components: Analysis

[jira] Created: (SOLR-1342) CapitalizationFilterFactory uses incorrect length calculations

2009-08-06 Thread Robert Muir (JIRA)
CapitalizationFilterFactory uses incorrect length calculations -- Key: SOLR-1342 URL: https://issues.apache.org/jira/browse/SOLR-1342 Project: Solr Issue Type: Bug Compone

[jira] Updated: (SOLR-1342) CapitalizationFilterFactory uses incorrect length calculations

2009-08-06 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1342: -- Attachment: SOLR-1342.patch patch attached, if its not obvious that its a bug i can try to create some t

[jira] Updated: (SOLR-1336) Add support for lucene's SmartChineseAnalyzer

2009-08-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1336: -- Attachment: SOLR-1336.patch patch, needs lucene-smartcn-2.9-dev.jar added to lib to work (this analyzer

[jira] Commented: (SOLR-1336) Add support for lucene's SmartChineseAnalyzer

2009-08-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12740988#action_12740988 ] Robert Muir commented on SOLR-1336: --- {quote} Are the stopwords (words="org/apache/lucene/a

[jira] Updated: (SOLR-1336) Add support for lucene's SmartChineseAnalyzer

2009-08-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1336: -- Attachment: SOLR-1336.patch add warning about large dictionaries, note that stopwords are being loaded f

[jira] Commented: (SOLR-1353) implement reusable token streams for all Solr tokenizers / token filters

2009-08-09 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12741134#action_12741134 ] Robert Muir commented on SOLR-1353: --- Yonik, at least in the case of analyzer class=xxx, I

[jira] Created: (SOLR-1356) Add support for Lucene's persian analysis

2009-08-10 Thread Robert Muir (JIRA)
Add support for Lucene's persian analysis - Key: SOLR-1356 URL: https://issues.apache.org/jira/browse/SOLR-1356 Project: Solr Issue Type: New Feature Components: Analysis Reporter

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-08-12 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742451#action_12742451 ] Robert Muir commented on SOLR-1321: --- i do have one comment on the reverse() present here:

[jira] Commented: (SOLR-1321) Support for efficient leading wildcards search

2009-08-12 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742611#action_12742611 ] Robert Muir commented on SOLR-1321: --- btw, i found apache harmony has a nice impl of in-pla

[jira] Created: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-13 Thread Robert Muir (JIRA)
WordDelimiterFilter position increment bug -- Key: SOLR-1362 URL: https://issues.apache.org/jira/browse/SOLR-1362 Project: Solr Issue Type: Bug Components: Analysis Reporter: Robe

[jira] Updated: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-13 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1362: -- Attachment: SOLR-1362.patch > WordDelimiterFilter position increment bug > -

[jira] Commented: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-13 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742983#action_12742983 ] Robert Muir commented on SOLR-1362: --- yonik, maybe: i am unable to tell from docs/tests if

[jira] Commented: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-13 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742995#action_12742995 ] Robert Muir commented on SOLR-1362: --- fyi this line of code was changed from = to += in SOL

[jira] Commented: (SOLR-1353) implement reusable token streams for all Solr tokenizers / token filters

2009-08-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12743858#action_12743858 ] Robert Muir commented on SOLR-1353: --- seems to almost double throughput... how does this co

[jira] Commented: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745186#action_12745186 ] Robert Muir commented on SOLR-1362: --- Yonik, in this case I think existing gaps would be pr

[jira] Commented: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-20 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745520#action_12745520 ] Robert Muir commented on SOLR-1362: --- ah, i see your point... sounds right to me. i can r

[jira] Updated: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-20 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1362: -- Attachment: SOLR-1362_tests.txt I started working on a patch, but found the existing behavior to be more

[jira] Updated: (SOLR-1356) Add support for Lucene's persian analysis

2009-08-20 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1356: -- Attachment: SOLR-1356.patch factory for the filter, and schema.xml examples (maybe unnecessary, feel fre

[jira] Commented: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-25 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12747655#action_12747655 ] Robert Muir commented on SOLR-1362: --- Yonik, thanks! I will work on the skipped token subtr

[jira] Commented: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-25 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12747752#action_12747752 ] Robert Muir commented on SOLR-1362: --- bq. I had implemented the "remove normal posIncr" thi

[jira] Commented: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-25 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12747754#action_12747754 ] Robert Muir commented on SOLR-1362: --- actually one last thing Yonik, at the beginning of th

[jira] Updated: (SOLR-1362) WordDelimiterFilter position increment bug

2009-08-25 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1362: -- Attachment: SOLR-1362.patch patch that moves the protWords check below the posInc calculation, and sets