Re: Can ExtractingRequestHandler ignore documents metadata

2011-05-11 Thread Grant Ingersoll
You can map the attributes to the ignore field.  Alternatively, override the 
SolrContentHandler's newMethod() method to skip adding them.  Come to think of 
it, I'll put up a quick patch that breaks that out a bit more and makes it 
easier to override.  Longer term, a patch to exclude metadata would be good.


On May 9, 2011, at 9:47 AM, Tod wrote:

 I'm indexing content from a CMS' database of metadata.  The client would 
 prefer that Solr exclude the properties (metadata) of any documents being 
 indexed.  Is there a way to tell Tika to only index a document's text and not 
 its properties?
 
 Thanks - Tod

--
Grant Ingersoll
http://www.lucenerevolution.org



Can ExtractingRequestHandler ignore documents metadata

2011-05-09 Thread Tod
I'm indexing content from a CMS' database of metadata.  The client would 
prefer that Solr exclude the properties (metadata) of any documents 
being indexed.  Is there a way to tell Tika to only index a document's 
text and not its properties?


Thanks - Tod