Re: Look for strange encodings -- tokenization

2007-09-05 Thread poeta simbolista
contains many unseen terms that may mean encoding problems? Also, what I would like is to be able to at least, measure the impact of such problems, so I can decide whether the effort will be paid back :) Cheers P Steven Rowe wrote: > > poeta simbolista wrote: >> I'd want to kno

Look for strange encodings -- tokenization

2007-09-04 Thread poeta simbolista
Hi all, I'd want to know the best way to look for strange encodings on a Lucene index. i have several inputs where input can have been encoded on different sets. I not always know if my guess about the encoding has been ok. Hence, I'd thought of querying the index for some typical strings that wo

Re: FieldCacheImpl mistake?

2007-02-15 Thread poeta simbolista
Chris Hostetter wrote: > > > : i suggest it could be defined a (e.g.) NOT_FOUND_FLOAT_VALUE, > modifiable, > : so it does not necessarily have to be 0. And used for initialise the > array > : of floats before processing (or do it on the loop if better performance > can > : be reached). > :

Re: Extending Query, Weight, Scorer

2007-02-15 Thread poeta simbolista
Chris Hostetter wrote: > > your Scorer.next method should be skipping over documents that you do not > consider "matches" to your search. > But, since the value on the float[] is the same regardless it has a field with value '0' and has no field, how can i tell one from another? Chris Hostet

Re: FieldCacheImpl mistake?

2007-02-14 Thread poeta simbolista
this (faster) comparison can be done. > > Otis > . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . > . . . > Simpy -- http://www.simpy.com/ - Tag - Search - Share > > - Original Message > From: poeta simbolista <[EMAIL PROTECTED]&g

FieldCacheImpl mistake?

2007-02-14 Thread poeta simbolista
Hi guys, I have been diving into the FieldCacheImpl code. I have seen sth on actual version: Revision 488908 - (view) (download) (annotate) - [select for diffs] Modified Wed Dec 20 03:47:09 2006 UTC (8 weeks ago) by yonik File length: 13425 byte(s) that I wonder if it's not totally right, or if

FieldCacheImpl not "extendible"

2007-02-14 Thread poeta simbolista
Hi, I have been diving into the code and I don't see why the class FieldCacheImpl is not extendible. It is not defined as a public class... though, I would like to be able to subclass it to change a slight bit. Why is it defined like that? Thanks -- View this message in context: http://www.nab

Extending Query, Weight, Scorer

2007-02-14 Thread poeta simbolista
Hi, I have created a Query that works for numerical max-min ranges, that may work for any Field specified. I have done that by extending Query, and creating own Weight and Scorer subclasses as well. So it works ... but I have problems when setting min or max boundary to 0: In this case, those ent

Re: How to return results with null values?

2007-02-08 Thread poeta simbolista
Chris Hostetter wrote: > > I'm not sure wether this question is about docs that have no value for a > field, or docs where the value of the field is null - > The former. Chris Hostetter wrote: > > adding a filter on that Field that requires *some* value might help. > Yep, that is what I

Re: How to return results with null values?

2007-02-07 Thread poeta simbolista
g > > > To > java-user@lucene.apache.org > cc > > Subject > Re: How to return results with null values? > > > > > > > My guess is that if the field has a null empty, don't put the field in the > > index. > > > &g

How to return results with null values?

2007-02-07 Thread poeta simbolista
HI I would like to make a query and take out those with null values in a specific field. How to? Thanks -- View this message in context: http://www.nabble.com/How-to-return-results-with-null-values--tf3186799.html#a8844779 Sent from the Lucene - Java Users mailing list archive at Nabble.com. -

Re: deviations

2007-02-07 Thread poeta simbolista
> > You will have to calculate your deviations separately, but the field > size info should help. > > -Grant > > > On Feb 6, 2007, at 12:13 PM, poeta simbolista wrote: > >> >> Hi, >> I would like to query results, regarding results. >> For examp

deviations

2007-02-06 Thread poeta simbolista
Hi, I would like to query results, regarding results. For example, get those results whose field "size" is 2 standard deviations from the median. It looks like sth I should do myself, processing results first, getting the median, and then calculate that threshold and perform a new query. Is it h

query problem: grouping "with same field"

2007-02-06 Thread poeta simbolista
Hi all, First of all thanks for this forum, I have read stuff that helped me a lot. Second, I would like to query a Lucene index in this way: I want to do operations on groups of entries. These entries I want them to be grouped by a field, (called websiteid), so the queries will only go to those

Use of only a prohibit search

2007-02-01 Thread poeta simbolista
HI guys, I've read on http://www.nabble.com/Using-NOT-queries-inside-parentheses-tf1234775.html#a3272973 this forum about the problem of using, for example, only a prohibit search such as -description:plot I really would like to get this done on a good manner, I am constructing queries where e

Why this query is not correct?

2007-01-30 Thread poeta simbolista
Hi guys, I have been through the docs and I can't see why the parser does not parse this correctly: description:*sql is not correct: Lexical error at line 1, column 16. Encountered: "*" (42), after : "" However, the following: description: sql* is correct. Any idea why you can't use wildca

Re: Problem with lucene.

2007-01-30 Thread poeta simbolista
gt; allow you to examine your index, parse queries through the GUI, examine > the > effects of different analyzers on input etc. It's a great tool and one > that'll make your life much easier. > > Best > Erick > > > On 1/29/07, poeta simbolista <[EM

Problem with lucene.

2007-01-29 Thread poeta simbolista
Hi there, this is my very first post at this forum... please be considerate :) Well, i have a problem when sending a query such as: +description:< Once the query is parsed, it returns me the empty String, which means the String "<" that i want to search for on the field description is ignored.