On Wed, 27 Feb 2013, eShard wrote:
Here's my quandary: I'm using manifoldcf v1.1.1 to crawl non standard (IBM) RSS feeds and custom RSS feeds. There's additional metadata in each item that we need to capture. I added the additional fields to the Solr schema (4.0 final) but the additional fields are nowhere to be found.
Does Tika extract this metadata? Maybe try using the tika-app with --metadata to check. That'll let us know if the problem is with getting the metadata out of the rss feed, or with how the SOLR plugin handles the data
Nick