Hi Andrzej,
On 10/17/05 10:59 AM, "Andrzej Bialecki" <[EMAIL PROTECTED]> wrote: > Chris Mattmann wrote: > >> I still get no hits. Does anybody have any clue as to what I'm doing wrong? > > I have a clue (which is not the same as a solution ;-) ). Please use > Luke and check how the terms look like in your index. The best way to do > it is to open the index, then go to one of the documents and press > "Reconstruct & Edit". In the dialog that pops up you will have all > fields content, and also how they were tokenized (which is more > important). It's possible that NutchAnalyzer swallowed some of the text > you are looking for... you should see that in the tokenized field > content. If your query plugin returns the clause as you wrote it, i.e. > with at sign, dots and whatever, then a corresponding token needs to > show up in the tokenized content - and I bet it doesn't, because it was > broken into parts by the tokenizer... > I downloaded Luke from the getopt site during the peaks of my frustration, and then browsed my small index of 3 documents (which I can send to you in a separate email if you want to look at it, it's real small). I looked up the field for "contactemail" for one of the documents in the index. I also verified as I mentioned, that my query was being captured by the filter correctly. For instance a query for "contactemail:[EMAIL PROTECTED]" correctly shows up as: "contactemail:[EMAIL PROTECTED]". When I used Luke to look up the doc in the index, and its corresponding contactemail field, here is what it appeared as under the "tokenized" tab: "[EMAIL PROTECTED]" Which is the exact same way that it was stored, and the same way that I queried on it. So, not really sure what the problem is here. Thanks for the suggestion, however. Any other ideas? :-) Take care, Chris ______________________________________________ Chris A. Mattmann [EMAIL PROTECTED] Staff Member Modeling and Data Management Systems Section (387) Data Management Systems and Technologies Group _________________________________________________ Jet Propulsion Laboratory Pasadena, CA Office: 171-266B Mailstop: 171-246 _______________________________________________________ Disclaimer: The opinions presented within are my own and do not reflect those of either NASA, JPL, or the California Institute of Technology.