Hi, I am using xpath to index different parts of the html pages into different fields. Now, I have some pure text documents that has no html. So I can't use xpath. How do I index these pure text into different fields of the index? How do I make nutch/solr understand these different parts belong to different fields? Maybe I can use existing content in the fields in my index? Thanks.
- How to Index Pure Text into Seperate Fields? Savannah Beckett
- Re: How to Index Pure Text into Seperate Fields? Scott Gonyea
- Re: How to Index Pure Text into Seperate Fields? Savannah Beckett
- Re: How to Index Pure Text into Seperate Fields? Erick Erickson
- Re: How to Index Pure Text into Seperate Fields? Savannah Beckett
- Re: How to Index Pure Text into Seperate Fie... Lance Norskog