On Aug 9, 2012, at 5:44pm, Jukka Zitting wrote: > Hi, > > On Thu, Aug 9, 2012 at 10:56 PM, Ken Krugler > <[email protected]> wrote: >> You made a note in Changes.txt that this was deprecated, so I'm assuming >> that you >> think we should hold off on fixing the abuse of CONTENT_ENCODING until after >> the >> 1.2 release, right? > > Right, there might still be clients out there that expect this > information to be present as CONTENT_ENCODING. > > In fact, unless the abuse of that field is actively harmful (i.e. > clients need to add extra workarounds to clean up the metadata), I'd > keep the field in place all the way until Tika 2.0.
Agreed - filed https://issues.apache.org/jira/browse/TIKA-974 to track this. -- Ken -------------------------- Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr
