Re: Can ExtractingRequestHandler ignore documents metadata

2011-05-11 Thread Grant Ingersoll
You can map the attributes to the ignore field. Alternatively, override the SolrContentHandler's newMethod() method to skip adding them. Come to think of it, I'll put up a quick patch that breaks that out a bit more and makes it easier to override. Longer term, a patch to exclude metadata

Can ExtractingRequestHandler ignore documents metadata

2011-05-09 Thread Tod
I'm indexing content from a CMS' database of metadata. The client would prefer that Solr exclude the properties (metadata) of any documents being indexed. Is there a way to tell Tika to only index a document's text and not its properties? Thanks - Tod