On Aug 15, 2007, at 10:46 AM, Peter Keegan wrote:
Grant,
I built an index as described here:
http://www.nabble.com/SpanQuery-and-database-join-tf4262902.html
Many documents have only 1 or 2 rows, some have dozens.
Here is a typical query without spans:
+((+contents:quaker +contents:cereal) (
Grant,
I built an index as described here:
http://www.nabble.com/SpanQuery-and-database-join-tf4262902.html
Many documents have only 1 or 2 rows, some have dozens.
Here is a typical query without spans:
+((+contents:quaker +contents:cereal) (+boost50:quaker +boost50:cereal))
+literals:co$us), so
Hi Peter,
Could you give more details on this test? What are you comparing,
etc.? Sample queries would be good. I would like to write up a
contrib/Benchmark algorithm to begin investigating this and see if
there is anything that can be done.
Thanks,
Grant
On Aug 10, 2007, at 6:27 PM,
ok, glad we're on the same page.
I did some performance testing with span queries and, unfortunately, the
results are discouraging for my intended use. When I added a simple
SpanNearQuery to existing queries, the throughput decreased by a factor of
10+. I figured spans would be expensive, but not
Sorry for the confusion. I thought you just wanted access to the
term info per position.I think we will have to add something to
the Spans like we talked about before.
-Grant
On Aug 10, 2007, at 11:03 AM, Peter Keegan wrote:
Grant,
I'm afraid I don't understand how to use this mapper
Grant,
I'm afraid I don't understand how to use this mapper in the context of a
SpanQuery. It seems like I would have to modify SpanScorer to fetch payload
data and provide a new method to access the payloads while iterating through
the documents. If this can be accomplished without modifying Span
Hi Grant,
I'm hoping to check this out soon.
Thanks,
Peter
On 8/7/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
>
> Hi Peter,
>
> Give https://issues.apache.org/jira/browse/LUCENE-975 a try. It
> provides a TermVectorMapper that loads by position.
>
> Still not what ideally what you want, but
Hi Peter,
Give https://issues.apache.org/jira/browse/LUCENE-975 a try. It
provides a TermVectorMapper that loads by position.
Still not what ideally what you want, but I haven't had time to scope
that one out yet.,
-Grant
On Jul 24, 2007, at 6:02 PM, Peter Keegan wrote:
Hi Grant,
No
Hi Grant,
No problem - I know you are very busy. I just wanted to get a sense for the
timing because I'd like to use this for a release this Fall. If I can get a
prototype working in the coming weeks AND the performance is great :) , this
would be terrific. If not, I'll have to fall back on a mo
Sorry, Peter, I haven't had a chance to work on it. I don't see it
happening this week, but maybe next.
I do think the Mapper approach via TermVectors will work. It will
require implementing a new mapper that orders by position, but I
don't think that is too hard. I started on one on th
Any idea on when this might be available (days, weeks...)?
Peter
On 7/16/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
On Jul 16, 2007, at 1:06 AM, Chris Hostetter wrote:
>
> : Do we have a best practice for going from, say a SpanQuery doc/
> : position information and retrieving the actual
On Jul 16, 2007, at 1:06 AM, Chris Hostetter wrote:
: Do we have a best practice for going from, say a SpanQuery doc/
: position information and retrieving the actual range of positions of
: content from the Document? Is it just to reanalyze the Document
: using the appropriate Analyzer and s
: Do we have a best practice for going from, say a SpanQuery doc/
: position information and retrieving the actual range of positions of
: content from the Document? Is it just to reanalyze the Document
: using the appropriate Analyzer and start recording once you hit the
: positions you are inte
13 matches
Mail list logo