Hi Andrzej,

On 10/17/05 10:59 AM, "Andrzej Bialecki" <[EMAIL PROTECTED]> wrote:

> Chris Mattmann wrote:
> 
>> I still get no hits. Does anybody have any clue as to what I'm doing wrong?
> 
> I have a clue (which is not the same as a solution ;-) ). Please use
> Luke and check how the terms look like in your index. The best way to do
> it is to open the index, then go to one of the documents and press
> "Reconstruct & Edit". In the dialog that pops up you will have all
> fields content, and also how they were tokenized (which is more
> important). It's possible that NutchAnalyzer swallowed some of the text
> you are looking for... you should see that in the tokenized field
> content. If your query plugin returns the clause as you wrote it, i.e.
> with at sign, dots and whatever, then a corresponding token needs to
> show up in the tokenized content - and I bet it doesn't, because it was
> broken into parts by the tokenizer...
> 

I downloaded Luke from the getopt site during the peaks of my frustration,
and then browsed my small index of 3 documents (which I can send to you in a
separate email if you want to look at it, it's real small). I  looked up the
field for "contactemail" for one of the documents in the index. I also
verified as I mentioned, that my query was being captured by the filter
correctly. For instance a query for
"contactemail:[EMAIL PROTECTED]" correctly shows up as:
"contactemail:[EMAIL PROTECTED]". When I used Luke to look up the
doc in the index, and its corresponding contactemail field, here is what it
appeared as under the "tokenized" tab:

"[EMAIL PROTECTED]"

Which is the exact same way that it was stored, and the same way that I
queried on it. So, not really sure what the problem is here. Thanks for the
suggestion, however. Any other ideas? :-)


Take care,
  Chris


______________________________________________
Chris A. Mattmann
[EMAIL PROTECTED]
Staff Member
Modeling and Data Management Systems Section (387)
Data Management Systems and Technologies Group
 
_________________________________________________
Jet Propulsion Laboratory            Pasadena, CA
Office: 171-266B                        Mailstop:  171-246
_______________________________________________________
 
Disclaimer:  The opinions presented within are my own and do not reflect
those of either NASA, JPL, or the California Institute of Technology.
 
 



Reply via email to