[jira] Closed: (LUCENE-1673) Move TrieRange to core

2009-06-19 Thread Uwe Schindler (JIRA)
contrib in revision 786474 Thanks for all your help! I will now open another issue for the remaining (optional) tasks Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene

[jira] Updated: (LUCENE-1673) Move TrieRange to core

2009-06-18 Thread Uwe Schindler (JIRA)
the parsers to FieldCache and make the plain-text-number parsers public there, too. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-18 Thread Michael McCandless (JIRA)
can separately tweak the javadocs... Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Michael McCandless (JIRA)
for cutting over contrib/spacial to NumericUtils Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Michael McCandless (JIRA)
in stored fields file and the index format is unchanged. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Uwe Schindler (JIRA)
back-compat issues, so it could be added any time - no need to link it to this issue or to rush it. I think the same, I should first resolve this and open some more issues :-) Move TrieRange to core -- Key: LUCENE-1673 URL: https

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Michael McCandless (JIRA)
of these (NumericField, NumericSortField) are important to do for 2.9. Maybe others (adding support for the missing numeric types (byte short)) can wait. Let's wrap this one up and move onto the next ones ;) Move TrieRange to core -- Key: LUCENE-1673 URL

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Uwe Schindler (JIRA)
a short note to package.html in analysis and search. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Michael McCandless (JIRA)
section detailing how it works, what precisionStep means (and tradeoffs of high/low values for it), the reference to the full paper, etc. But we can iterate on the javadocs in the separate issue, too. Move TrieRange to core -- Key: LUCENE-1673

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Michael McCandless (JIRA)
would return a NumericField. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search Affects

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Uwe Schindler (JIRA)
:00 on the following day exclusive. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Michael McCandless (JIRA)
on the day to 0:00 on the following day exclusive. Couldn't we have a NumericTermQuery for such cases? You have the full precision term in the index... Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Uwe Schindler (JIRA)
downto the millisecond). The important thing is: the lower precision terms are not at common date boundaries. Because of this different use cases, in my opinion, DateTools has its usage. Move TrieRange to core -- Key: LUCENE-1673 URL: https

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Yonik Seeley (JIRA)
of fields... they aren't coupled now except when the generic format of the index changes (like omitNorms, omitTf, indexed, etc). Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Michael McCandless (JIRA)
things are lost from your index time doc (a well known issue at this point!), but I don't think we should intentionally make that behavior even more buggy, if we can help it... Move TrieRange to core -- Key: LUCENE-1673 URL: https

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Yonik Seeley (JIRA)
in Lucene... changing that and encoding types into the index deserves it's own issue and discussion (and something big like that doesn't seem to belong in 2.9 which is winding down). Move TrieRange to core -- Key: LUCENE-1673 URL: https

[jira] Issue Comment Edited: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Yonik Seeley (JIRA)
everything else does in Lucene... changing that and encoding types into the index deserves it's own issue and discussion (and something big like that doesn't seem to belong in 2.9 which is winding down). Move TrieRange to core -- Key: LUCENE-1673

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-16 Thread Earwin Burrfoot (JIRA)
and put it in contribs. If it's hard to build - Lucene core is to blame, it's not extensible enough. From my experience, for that purporse it's okay as it is. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse

[jira] Updated: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Uwe Schindler (JIRA)
to be openend after this. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search Affects Versions: 2.9

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Michael McCandless (JIRA)
these numeric types too...). Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search Affects

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Michael McCandless (JIRA)
it util seems OK, since it's used by analysis searching. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Yonik Seeley (JIRA)
the right parser? A factory method TrieUtils.getSortField() could also return the right SortField. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Uwe Schindler (JIRA)
, but as the code was not yet released (in Lucene), the Lucene version should not use NumberUtils at all. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Uwe Schindler (JIRA)
() in NumericTokenStream? This makes it really easy to index. The only good thing of NumericField would be the possibility to automatically disable TF and Norms per default when indexing. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Uwe Schindler (JIRA)
in complete, its really useless. Maybe, I add a getShift() method to NumericUtils, that returns the shift value of a Token/String. See java-dev mailing with Yonik. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Michael McCandless (JIRA)
be the possibility to automatically disable TF and Norms per default when indexing. Consumability (good defaults)! (And also not having to know that you must go and get a tokenStream from NumericUtils). Move TrieRange to core -- Key: LUCENE-1673

Re: [jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Mark Miller
Michael McCandless (JIRA) wrote: We're forking off new 2.9 issues left and right here!! Evil :) You guys are like small team working against me. We still have 29+- issue to wrap up though, so probably plenty of time. I hope we can set a rough target date soon though - it really feels like

Re: [jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Michael McCandless
On Mon, Jun 15, 2009 at 4:42 PM, Mark Millermarkrmil...@gmail.com wrote: Remember the last time we started to push for 2.9 in Dec/Jan :) Yes this is very much on my mind too!! So maybe, it's a race between the trie* group of issues, and the other 28 ;) Mike

RE: [jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Uwe Schindler
PM To: java-dev@lucene.apache.org Subject: Re: [jira] Commented: (LUCENE-1673) Move TrieRange to core On Mon, Jun 15, 2009 at 4:42 PM, Mark Millermarkrmil...@gmail.com wrote: Remember the last time we started to push for 2.9 in Dec/Jan :) Yes this is very much on my mind too!! So

[jira] Updated: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Uwe Schindler (JIRA)
about it - [unneeded SortField factory, parsers] - extra issue, maybe after 3.0 Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-15 Thread Yonik Seeley (JIRA)
-compat issues, so it could be added any time - no need to link it to this issue or to rush it. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
, as TRIE no longer used) and the parser instances can be added to FieldCache. For indexing or querying it is not required for end users, one can use NumericTokenStream and NumericRangeQuery for all his needs. So NumberUtils is more internal than before. Any thoughts? Move TrieRange to core

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
encoding name from TrieUtils) - the default parsers (private) in FieldCache renaming to also PlainText* (but accessible) - add TrieUtils.XxxParser to FieldCache (but accessible) Move TrieRange to core -- Key: LUCENE-1673 URL: https

[jira] Issue Comment Edited: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
(private) in FieldCache renaming to also PlainText* (but accessible) - add TrieUtils.XxxParser to FieldCache (but accessible) Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673

[jira] Issue Comment Edited: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
changes here. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search Affects Versions: 2.9

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Yonik Seeley (JIRA)
an analysis.numeric package? As a general comment, moving TrieRange to core should be moving it to the core and perhaps renaming the classes if we can think of a better name. Some of the other stuff belongs in a different issue. Move TrieRange to core -- Key

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
, moving TrieRange to core should be moving it to the core and perhaps renaming the classes if we can think of a better name. Some of the other stuff belongs in a different issue. I think this is correct. I will post a patch soon, that leaves TrieUtils alive. Move TrieRange to core

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Yonik Seeley (JIRA)
and friends). Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search Affects Versions: 2.9

[jira] Updated: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
people from doing things like new NumericRangeQuery(field,precStep,new Long(val1),new Float(val2)) which may lead to undefined behaviour. The second problem is missing type safety with auto-boxing in Java 5. Move TrieRange to core -- Key: LUCENE-1673

[jira] Issue Comment Edited: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
)) which may lead to undefined behaviour. The second problem is missing type safety with auto-boxing in Java 5. Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene

[jira] Issue Comment Edited: (LUCENE-1673) Move TrieRange to core

2009-06-14 Thread Uwe Schindler (JIRA)
of java.lang.Number? Because type safety to prevent people from doing things like new NumericRangeQuery(field,precStep,new Long(val1),new Float(val2)) which may lead to undefined behaviour. The second problem is missing type safety with auto-boxing in Java 5. Move TrieRange to core

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-10 Thread Michael McCandless (JIRA)
. NumericRangeQuery.newFloatRange(Float a, Float b, precisionStep) and so on. Could we also do this for a term range? Then, we could have a single RangeQuery that rewrites to the right impl based on what kind of range you are doing? (And in fact it could fold in FieldCacheRangeFilter too). Move TrieRange to core

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-09 Thread Michael McCandless (JIRA)
, with this change. EG TermRangeQuery... to emphasize that you use it for non-numbers. The javadocs of TermRangeQuery should point to Int/LongRangeQuery as strongly preferred for numeric ranges. Cool. For the others, too (FieldCacheRangeQuery). {quote} Yes. Move TrieRange to core

Re: [jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-09 Thread Jason Rutherglen
, with this change. EG TermRangeQuery... to emphasize that you use it for non-numbers. The javadocs of TermRangeQuery should point to Int/LongRangeQuery as strongly preferred for numeric ranges. Cool. For the others, too (FieldCacheRangeQuery). {quote} Yes. Move TrieRange to core

RE: [jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-09 Thread Uwe Schindler
http://www.thetaphi.de eMail: u...@thetaphi.de _ From: Jason Rutherglen [mailto:jason.rutherg...@gmail.com] Sent: Tuesday, June 09, 2009 8:48 PM To: java-dev@lucene.apache.org Subject: Re: [jira] Commented: (LUCENE-1673) Move TrieRange to core I wonder if we could handle this by adding

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-02 Thread Michael McCandless (JIRA)
compile, because the setValue method is not available for TokenStream; the stream should be defined as LongTrieTokenStream, I think?; same with IntTrieTokenStream) Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-02 Thread Uwe Schindler (JIRA)
} Cool. For the others, too (FieldCacheRangeQuery). There is a lot more to decide, I will keep this issue open a little bit before starting to work to collect ideas! Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira

[jira] Created: (LUCENE-1673) Move TrieRange to core

2009-06-01 Thread Uwe Schindler (JIRA)
Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New Feature Components: Search Affects Versions: 2.9 Reporter

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-01 Thread Earwin Burrfoot (JIRA)
, you won't be bound by any other back-compat policies besides common sense. :) Move TrieRange to core -- Key: LUCENE-1673 URL: https://issues.apache.org/jira/browse/LUCENE-1673 Project: Lucene - Java Issue Type: New

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-01 Thread Paul Elschot (JIRA)
putting everything in o.a.l.trie . I'd prefer to have explicit class names containing Long, Int etc, and also containing Trie. I don't know the details of the tokenizing, but AbstractTrieField sounds just right. Move TrieRange to core -- Key: LUCENE

Move TrieRange to Core/Module and integration issues

2009-04-13 Thread Uwe Schindler
Hi, it was discussed now many times on this list, but I did not get a solution, if we should include TrieRange into the core or not. When thinking about it and looking in the latest developments about TrieRange (TokenStreams for indexing), I plan to do the following: a) Put the classes into the

Re: Move TrieRange to Core/Module and integration issues

2009-04-13 Thread Mark Miller
Uwe Schindler wrote: MultiTermQuery has in its protected getEnum() returning FilteredTermEnum. For TrieRange, the return should be changed to TermEnum, it is not needed to have a FilteredTermEnum (FilteredTermEnum is only an implementation, the method should return an abstract TermEnum). If this

RE: Move TrieRange to Core/Module and integration issues

2009-04-13 Thread Uwe Schindler
MultiTermQuery has in its protected getEnum() returning FilteredTermEnum. For TrieRange, the return should be changed to TermEnum, it is not needed to have a FilteredTermEnum (FilteredTermEnum is only an implementation, the method should return an abstract TermEnum). If this is fixed, I

Re: Move TrieRange to Core/Module and integration issues

2009-04-13 Thread Michael McCandless
On Mon, Apr 13, 2009 at 12:05 PM, Uwe Schindler u...@thetaphi.de wrote: For me it would not be a problem, I would use a FilteredTermEnum and subclass it, but would only implement next() and the other abstract methods would be dummies (including difference() returning 1.0f). Only the enum and

RE: Move TrieRange to Core/Module and integration issues

2009-04-13 Thread Uwe Schindler
For me it would not be a problem, I would use a FilteredTermEnum and subclass it, but would only implement next() and the other abstract methods would be dummies (including difference() returning 1.0f). Only the enum and the term should have a protected access or a getter in this class.

Re: move TrieRange* to core?

2009-03-20 Thread Chris Hostetter
(resending msg from earlier today during @apache mail outage -- i didn't get a copy from the list, so i'm assuming no one did) : Date: Fri, 20 Mar 2009 16:51:05 -0700 (PDT) : : : I think we should move TrieRange* into core before 2.9? : : -0 : : I think we should try to move more things *out

RE: move TrieRange* to core?

2009-03-19 Thread Uwe Schindler
If so... maybe we could extend FieldCache's parser to allow it to stop-early? Ie it'd get the TermEnum, iterate through all the full precision terms first, asking your parser to convert to long/int, and then when your parser sees the very first not-full-precision term, it tells

Re: move TrieRange* to core?

2009-03-19 Thread Michael McCandless
Uwe Schindler u...@thetaphi.de wrote: My problem is: In my opinion, this is all to complicated and nasty and bad API design. One nice, but hackish approach would be to define FieldCache.StopFillCacheException extends RuntimeException and throw this inside the parse function, if a trie value

move TrieRange* to core?

2009-03-18 Thread Michael McCandless
I think we should move TrieRange* into core before 2.9? It's received alot of attention, from both developers (Uwe Yonik did lots of iterations, and Solr is folding it in) and user interest. It's a simpler more scalable way to index numeric fields that you intend to sort and/or do range

Re: move TrieRange* to core?

2009-03-18 Thread Andi Vajda
On Mar 18, 2009, at 13:01, Michael McCandless luc...@mikemccandless.com wrote: I think we should move TrieRange* into core before 2.9? It's received alot of attention, from both developers (Uwe Yonik did lots of iterations, and Solr is folding it in) and user interest. It's a simpler

Re: move TrieRange* to core?

2009-03-18 Thread Earwin Burrfoot
On Wed, Mar 18, 2009 at 23:08, Andi Vajda va...@osafoundation.org wrote: On Mar 18, 2009, at 13:01, Michael McCandless luc...@mikemccandless.com wrote: I think we should move TrieRange* into core before 2.9? It's received alot of attention, from both developers (Uwe Yonik did lots

RE: move TrieRange* to core?

2009-03-18 Thread Uwe Schindler
...@thetaphi.de -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Wednesday, March 18, 2009 9:02 PM To: java-dev@lucene.apache.org Subject: move TrieRange* to core? I think we should move TrieRange* into core before 2.9? It's received alot

RE: move TrieRange* to core?

2009-03-18 Thread Uwe Schindler
I think we should move TrieRange* into core before 2.9? It's received alot of attention, from both developers (Uwe Yonik did lots of iterations, and Solr is folding it in) and user interest. It's a simpler more scalable way to index numeric fields that you intend to sort and/or do

Re: move TrieRange* to core?

2009-03-18 Thread Michael McCandless
Uwe Schindler wrote: I would be happy with a renaming to NumberRangeFilter, but trie should appear somewhere in the docs. I like this approach (and referencing the original paper); I think it's important the javadocs give enough detail about how it works so that one can understand the big

Re: move TrieRange* to core?

2009-03-18 Thread Michael McCandless
Uwe Schindler wrote: I have no problem with it! Thanks! What I would like to be fixed before moving it to core is the fact that a additional helper field is needed for the trie values. If everything could be in one field and the field is still sortable, it would be fine. For that, the

Re: move TrieRange* to core?

2009-03-18 Thread Michael McCandless
Michael McCandless luc...@mikemccandless.com wrote: Though, won't this make loading the field cache more costly since you'll iterate through many more terms? Or... do the full precision fields always order above all lower precision fields across all docs? If so... maybe we could extend

RE: move TrieRange* to core?

2009-03-18 Thread Uwe Schindler
Though, won't this make loading the field cache more costly since you'll iterate through many more terms? Or... do the full precision fields always order above all lower precision fields across all docs? The highest precision terms have a shift value of 0. As the first char of the encoded

Re: move TrieRange* to core?

2009-03-18 Thread Michael McCandless
Uwe Schindler u...@thetaphi.de wrote: If so... maybe we could extend FieldCache's parser to allow it to stop-early? Ie it'd get the TermEnum, iterate through all the full precision terms first, asking your parser to convert to long/int, and then when your parser sees the very first