hello everyone,

i m very new to nutch.i m using nutch 0.7.2 version.Im using nutch to crawl
the url's in Unix OS.

its working fine.Im having the crawled data as output.

now i need to modify the code for my requirement..

for example i want to write a java program where i need to use the
pruneIndexTool in my program..

when i import the statements of the nutch package into my sample java
program,say for example 

"import org.apache.lucene.search.Query;", its  showing me compilation
errors...

i thought importing the packages are similar to that of importing "import
java.io.*"..


i copied the sample code from nabble and when i tried tocompile,it is
throwing errors as "package import org.apache.lucene.search.Query does not
exist "(All the nutch packages are not existing..i got 57 errors when i
compiled a sample  program!!!!)

i jus copied the code and compiled as how i compile a normal HelloWorld
program..

am i compiling it correctly? if not please tell me the correct way to
compile the code...

is there any way to compile the code importing the nutch packages or nutch
classes...?

these are some of the nutch packages used in the program...

import org.apache.nutch.io.UTF8;
import org.apache.nutch.util.LogFormatter;
import org.apache.lucene.search.Query;
import org.apache.lucene.store.Directory;
import org.apache.lucene.store.FSDirectory;



Thanks in Advance,
Franklin.S

Siddharth Jonathan wrote:
> 
> Do you mean how do you get rid of some of the fields that are indexed by
> default? eg. content, anchor text etc.
> 
> Jonathan
> On 4/2/07, Ratnesh,V2Solutions India 
> wrote:
>>
>>
>> Hi,
>> I have written a plugin , which finds no. of Object tags in a html and
>> corresponding urls.
>> I am storing "objects" as fields and page url as values.
>>
>> And finally interested in seeing the search realted with "objects"
>> indexed
>> fields not those which is already stored as indexed fields.
>>
>> So how shall I delete those index fields which is already stored????
>>
>> Looking forward towards your reply(Valuable
>> inputs).........................
>>
>> Thnx to Nutch Community
>> --
>> View this message in context:
>> http://www.nabble.com/How-to-delete-already-stored-indexed-fields----tf3504164.html#a9786377
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>
>>
> 
> 

-- 
View this message in context: 
http://www.nabble.com/How-to-delete-already-stored-indexed-fields----tf3504164.html#a10115413
Sent from the Nutch - User mailing list archive at Nabble.com.
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to