Hi,

How do we query for words like "C++" or "[EMAIL PROTECTED]" in Nutch ? I tried to 
modify NutchAnalysis.jj so that when we get a quoted string, it is not 
stripped of characters like +, @ and .

With this change, the <query object>.toString() gives me 
 +((+url:"c++"^4.0) (+anchor:"c++"^2.0) (+content:"c++"))

However this does not give me any hits, though there are documents with the 
word C++ in my index.

Similarly, if I search for the word homework without quotes 
[the resulting query:
 +((+url:homework^4.0) (+anchor:homework^2.0) (+content:homework))], 
it gives me some hits. 
However, when I search for "homework" it does not get any hits.

[+((+url:"homework"^4.0) (+anchor:"homework"^2.0) (+content:"homework"))]

I would appreciate any pointers about the way to solve this problem.

TIA,
Praveen.



-------------------------------------------------------
This SF.Net email is sponsored by BEA Weblogic Workshop
FREE Java Enterprise J2EE developer tools!
Get your free copy of BEA WebLogic Workshop 8.1 today.
http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to