[jira] Created: (LUCENE-973) Token of "" returns in CJK

2007-08-07 Thread Toru Matsuzawa (JIRA)
Token of "" returns in CJK --- Key: LUCENE-973 URL: https://issues.apache.org/jira/browse/LUCENE-973 Project: Lucene - Java Issue Type: Bug Components: Analysis Affects Versions: 2.3 Reporte

[jira] Updated: (LUCENE-973) Token of "" returns in CJK

2007-08-07 Thread Toru Matsuzawa (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Toru Matsuzawa updated LUCENE-973: -- Attachment: CJKTokenizer20070807.patch patch attached. > Token of "" returns in CJK > ---

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518162 ] Michael McCandless commented on LUCENE-847: --- This looks great Steve! More specific feeedback soon, but ...

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518165 ] Steven Parkes commented on LUCENE-847: -- I think we ideally would like concurrency to be fully in

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518184 ] Michael McCandless commented on LUCENE-847: --- Some more feedback: - Is the separate IndexMerger interfac

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518186 ] Michael McCandless commented on LUCENE-847: --- > > I think we ideally would like concurrency to be fully inde

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518210 ] Steven Parkes commented on LUCENE-847: -- Is the separate IndexMerger interface really necessary? I wrestled

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518222 ] Steven Parkes commented on LUCENE-847: -- On a related note, Mike, there a few FIXME's in IW related to useCompoun

Hudson build is back to normal: Lucene-Nightly #179

2007-08-07 Thread hudson
See http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/179/changes - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Deprecating IndexModifier

2007-08-07 Thread Ning Li
With the plan towards 3.0 release laid out, I think it's a good time to deprecate IndexModifier and eventually remove IndexModifier. The only method in IndexModifier which is not implemented in IndexWriter is "deleteDocument(int doc)". This is because of the concern that document ids are changing

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518237 ] Michael McCandless commented on LUCENE-847: --- > Looking at IW, with the new DocsWriter stuff, it looks like

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518251 ] Steven Parkes commented on LUCENE-847: -- Ah. I understand better now. I have to admit, I haven't kept up to date

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518263 ] Michael McCandless commented on LUCENE-847: --- > I'm thinking maybe a MergePolicy#useCompoundDocStore( Segmen

[jira] Created: (LUCENE-974) Remove Author tags from code

2007-08-07 Thread Grant Ingersoll (JIRA)
Remove Author tags from code Key: LUCENE-974 URL: https://issues.apache.org/jira/browse/LUCENE-974 Project: Lucene - Java Issue Type: Wish Reporter: Grant Ingersoll Assignee: Grant Ingerso

TREC Collection, NIST and Lucene

2007-08-07 Thread Grant Ingersoll
DISCLAIMER: Just to be clear, what follows is my personal opinion and in no way, shape or form reflects an official position from the Lucene project: So, now that we have all this great stuff for running TREC experiments in contrib/benchmark, I am wondering if people think it would be use

[jira] Assigned: (LUCENE-550) InstantiatedIndex - faster but memory consuming index

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned LUCENE-550: -- Assignee: Grant Ingersoll (was: Karl Wettin) > InstantiatedIndex - faster but memory c

Re: Deprecating IndexModifier

2007-08-07 Thread Grant Ingersoll
+1 On Aug 7, 2007, at 3:37 PM, Ning Li wrote: With the plan towards 3.0 release laid out, I think it's a good time to deprecate IndexModifier and eventually remove IndexModifier. The only method in IndexModifier which is not implemented in IndexWriter is "deleteDocument(int doc)". This is bec

[jira] Updated: (LUCENE-974) Remove Author tags from code

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated LUCENE-974: --- Attachment: LUCENE-974.patch Remove all @author tags > Remove Author tags from code > --

[jira] Commented: (LUCENE-550) InstantiatedIndex - faster but memory consuming index

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518291 ] Grant Ingersoll commented on LUCENE-550: Hey Karl, I started to look at this, but there are a few stoppers a

[jira] Created: (LUCENE-975) Position based TermVectorMapper

2007-08-07 Thread Grant Ingersoll (JIRA)
Position based TermVectorMapper --- Key: LUCENE-975 URL: https://issues.apache.org/jira/browse/LUCENE-975 Project: Lucene - Java Issue Type: New Feature Components: Store Reporter: Grant Inge

[jira] Updated: (LUCENE-975) Position based TermVectorMapper

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated LUCENE-975: --- Attachment: LUCENE-975.patch First attempt at a position based TermVectorMapper. I believe i

Re: Best Practices for getting Strings from a position range

2007-08-07 Thread Grant Ingersoll
Hi Peter, Give https://issues.apache.org/jira/browse/LUCENE-975 a try. It provides a TermVectorMapper that loads by position. Still not what ideally what you want, but I haven't had time to scope that one out yet., -Grant On Jul 24, 2007, at 6:02 PM, Peter Keegan wrote: Hi Grant, No