Re: Best Practices for getting Strings from a position range

2007-08-15 Thread Grant Ingersoll
On Aug 15, 2007, at 10:46 AM, Peter Keegan wrote: Grant, I built an index as described here: http://www.nabble.com/SpanQuery-and-database-join-tf4262902.html Many documents have only 1 or 2 rows, some have dozens. Here is a typical query without spans: +((+contents:quaker +contents:cereal) (

Re: Best Practices for getting Strings from a position range

2007-08-15 Thread Peter Keegan
Grant, I built an index as described here: http://www.nabble.com/SpanQuery-and-database-join-tf4262902.html Many documents have only 1 or 2 rows, some have dozens. Here is a typical query without spans: +((+contents:quaker +contents:cereal) (+boost50:quaker +boost50:cereal)) +literals:co$us), so

Re: Best Practices for getting Strings from a position range

2007-08-14 Thread Grant Ingersoll
Hi Peter, Could you give more details on this test? What are you comparing, etc.? Sample queries would be good. I would like to write up a contrib/Benchmark algorithm to begin investigating this and see if there is anything that can be done. Thanks, Grant On Aug 10, 2007, at 6:27 PM,

Re: Best Practices for getting Strings from a position range

2007-08-10 Thread Peter Keegan
ok, glad we're on the same page. I did some performance testing with span queries and, unfortunately, the results are discouraging for my intended use. When I added a simple SpanNearQuery to existing queries, the throughput decreased by a factor of 10+. I figured spans would be expensive, but not

Re: Best Practices for getting Strings from a position range

2007-08-10 Thread Grant Ingersoll
Sorry for the confusion. I thought you just wanted access to the term info per position.I think we will have to add something to the Spans like we talked about before. -Grant On Aug 10, 2007, at 11:03 AM, Peter Keegan wrote: Grant, I'm afraid I don't understand how to use this mapper

Re: Best Practices for getting Strings from a position range

2007-08-10 Thread Peter Keegan
Grant, I'm afraid I don't understand how to use this mapper in the context of a SpanQuery. It seems like I would have to modify SpanScorer to fetch payload data and provide a new method to access the payloads while iterating through the documents. If this can be accomplished without modifying Span

Re: Best Practices for getting Strings from a position range

2007-08-09 Thread Peter Keegan
Hi Grant, I'm hoping to check this out soon. Thanks, Peter On 8/7/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > > Hi Peter, > > Give https://issues.apache.org/jira/browse/LUCENE-975 a try. It > provides a TermVectorMapper that loads by position. > > Still not what ideally what you want, but

Re: Best Practices for getting Strings from a position range

2007-08-07 Thread Grant Ingersoll
Hi Peter, Give https://issues.apache.org/jira/browse/LUCENE-975 a try. It provides a TermVectorMapper that loads by position. Still not what ideally what you want, but I haven't had time to scope that one out yet., -Grant On Jul 24, 2007, at 6:02 PM, Peter Keegan wrote: Hi Grant, No

Re: Best Practices for getting Strings from a position range

2007-07-24 Thread Peter Keegan
Hi Grant, No problem - I know you are very busy. I just wanted to get a sense for the timing because I'd like to use this for a release this Fall. If I can get a prototype working in the coming weeks AND the performance is great :) , this would be terrific. If not, I'll have to fall back on a mo

Re: Best Practices for getting Strings from a position range

2007-07-24 Thread Grant Ingersoll
Sorry, Peter, I haven't had a chance to work on it. I don't see it happening this week, but maybe next. I do think the Mapper approach via TermVectors will work. It will require implementing a new mapper that orders by position, but I don't think that is too hard. I started on one on th

Re: Best Practices for getting Strings from a position range

2007-07-23 Thread Peter Keegan
Any idea on when this might be available (days, weeks...)? Peter On 7/16/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote: On Jul 16, 2007, at 1:06 AM, Chris Hostetter wrote: > > : Do we have a best practice for going from, say a SpanQuery doc/ > : position information and retrieving the actual

Re: Best Practices for getting Strings from a position range

2007-07-16 Thread Grant Ingersoll
On Jul 16, 2007, at 1:06 AM, Chris Hostetter wrote: : Do we have a best practice for going from, say a SpanQuery doc/ : position information and retrieving the actual range of positions of : content from the Document? Is it just to reanalyze the Document : using the appropriate Analyzer and s

Re: Best Practices for getting Strings from a position range

2007-07-15 Thread Chris Hostetter
: Do we have a best practice for going from, say a SpanQuery doc/ : position information and retrieving the actual range of positions of : content from the Document? Is it just to reanalyze the Document : using the appropriate Analyzer and start recording once you hit the : positions you are inte