All, Over the weekend, Shawn Heisey very kindly drafted a wikipage about the challenges of using Solr's ExtractingRequestHandler and the guidance to avoid it in production.
I completely agree with this point, and I think that Shawn did a very nice job of capturing some of the challenges. If you have any feedback or would like to make edits, see: https://wiki.apache.org/solr/RecommendCustomIndexingWithTika Cheers, Tim