Oops, our emails passed in the ether. Thank you, Jukka! -----Original Message----- From: Jukka Zitting [mailto:jukka.zitt...@gmail.com] Sent: Wednesday, April 22, 2015 12:06 PM To: dev@tika.apache.org Subject: Re: comparing Tika's file detect with other tools?
Hi, Copyright also covers databases, so we'll need to honor the license terms equally when copying file's code or detection patterns. Luckily file (from http://www.darwinsys.com/file/) comes under a BSD license, so reusing the code or data is quite simple from a licensing perspective. In fact we've already done some of that earlier, see https://github.com/apache/tika/commit/f807af0ee947affd34d84b334bbdc32c11576b2e for an example. BR, Jukka