Re: Advanced NearSpanQuery

2011-07-13 Thread Mike Sokolov
Sorry for the misdirection ... On 07/13/2011 11:37 AM, Simon Willnauer wrote: I don't think this is possible with spans today. Once https://issues.apache.org/jira/browse/LUCENE-2878 is due this should be possible with a boolean query I think. to work around this you need to write a SpanOR query

Re: Advanced NearSpanQuery

2011-07-13 Thread Simon Willnauer
I don't think this is possible with spans today. Once https://issues.apache.org/jira/browse/LUCENE-2878 is due this should be possible with a boolean query I think. to work around this you need to write a SpanOR query with a minShouldMatch functionality though. simon On Wed, Jul 13, 2011 at 5:09

RE: Advanced NearSpanQuery

2011-07-13 Thread Jeroen Lauwers
Hi Mike, Thanks for your quick reply, but do not seem to find any documentation on "DisjunctionSumQuery" and I'm not familiar with that concept. Could you point me in the right direction? Jeroen -Original Message- From: Mike Sokolov [mailto:soko...@ifactory.com] Sent: woensdag 13 juli

Re: Advanced NearSpanQuery

2011-07-13 Thread Mike Sokolov
Can you wrap a SpanNearQuery around an DisjunctionSumQuery with minNrShouldMatch=8? -Mike On 07/13/2011 08:53 AM, Jeroen Lauwers wrote: Hi, I was wondering if anyone could help me on this: I want to search for: 1. a set of words (eg. 10) 2. only a couple of words may come in be

Advanced NearSpanQuery

2011-07-13 Thread Jeroen Lauwers
Hi, I was wondering if anyone could help me on this: I want to search for: 1. a set of words (eg. 10) 2. only a couple of words may come in between (eg. 3) in the result document 3. of the supplied set of (10) words, at least 8 must be present (or in other words: 2 of the s

Performance question

2011-07-13 Thread Mihai Caraman
Hello, My name is Mihai and I'm trying to write a java (later I'll need to port it to pylucene) search on billions of mentions like twitter statuses. Mentions are grouped by some containing keywords. I'm thinking of partitioning the index for faster results as follows:

Re: searching/filtering by field's value length

2011-07-13 Thread Ian Lea
Store the string length as a NumericField with each document and use NumericRangeQuery. There are examples in the javadocs. -- Ian. On Tue, Jul 12, 2011 at 10:57 PM, Federico Schroder wrote: > Hello, > I need some help doing a search. > Say I have a really simple document structure, just 1 fie

Snowball analyzers and stop words?

2011-07-13 Thread Christof Schablinski
Hello, is there a reason why the Snowball analyzers don't have a default set of stop words although there are stop word files in the Snowball repository? At least for English as posted in an earlier message here: http://svn.tartarus.org/snowball/trunk/website/algorithms/english/stop.txt?view=

Re: AutomatonQuery Caching

2011-07-13 Thread René Hackl
Hi Simon, Thanks for the tip! Unfortunately I am running this on win. Tried a couple of command line tools / memory clearing apps without success. Will transfer stuff to another machine and see how that pans out. Thanks again, Rene Original-Nachricht > Datum: Tue, 12 Jul 201