All - I tested Tika with a bunch of miscellaneous text files (shell scripts, etc.), and found that an unknown (or nonexistent) extension results in the failure to get a parser using ParseUtils.getParser(URL, TikaConfig). I think that means that a MIME type could not be determined from the URL. Should an unknown file type default to text/plain and use the text parser?
Also, I believe there was code added to determine the MIME type from the stream of bytes itself, wasn't there? How would that be used? Thanks, - Keith -- View this message in context: http://www.nabble.com/Default-MIME-Type--tf4609978.html#a13164776 Sent from the Apache Tika - Development mailing list archive at Nabble.com.
