Hi,
The following is a configuration of whether and how nutch store the fields(I
got this from jobConf). My question is how do I configure a field is indexed or
not?
ThanksDennis
{lucene.field.vector.cache=NO, lucene.field.index.tstamp=NO,
lucene.field.index.content=TOKENIZED, lucene.field.store.content=NO,
lucene.field.vector.tstamp=NO, mapred.task.id=attempt_local_0001_r_000000_0,
lucene.field.store.cache=YES, lucene.field.store.site=NO,
lucene.field.store.host=NO, lucene.field.index.url=TOKENIZED,
mapred.task.partition=0, lucene.field.index.title=TOKENIZED,
lucene.field.vector.url=NO, mapred.tip.id=task_local_0001_r_000000,
lucene.field.vector.title=NO, mapred.map.tasks=10,
lucene.field.index.site=UNTOKENIZED, lucene.field.index.anchor=TOKENIZED,
lucene.field.index.host=TOKENIZED, lucene.field.store.tstamp=YES,
lucene.field.store.anchor=NO, lucene.field.vector.anchor=NO,
lucene.field.store.title=YES, lucene.field.vector.site=NO,
lucene.field.vector.host=NO, mapred.job.id=job_local_0001,
lucene.field.store.url=YES,
mapred.work.output.dir=file:/home/bill/workspacecloud2/nutch-1.2/crawl/indexes/_temporary/_attempt_local_0001_r_000000_0,
mapred.skip.on=false, lucene.field.index.cache=NO,
lucene.field.vector.content=NO, mapred.task.is.map=false}