Chris - I'm not sure...on the one hand, since Tika is basically a text parsing tool, we might want to make plain text the default MIME type. We couldn't really do anything with an octet stream anyway, right?
On the other hand, we wouldn't want to attempt to parse something that does not have text, so a nonparseable MIME type such as octet stream as default might make more sense. Isn't our framework supposed to determine the MIME type based on the content? Is there perhaps just a configuration or code change that needs to be made? If so, then this is not an issue. - Keith Chris Mattmann wrote: > > Hi Keith, > > The default mime type in TIKA is application/octet-stream. It gets set > when > the mime type can't be determined using 3 main means (url resolution, > extension resolution, or magic chars). This is in the MimeTypes.java file > within the mime package. The reason no parser gets called is because there > is no parser registered to handle that mime type. > > Are you suggesting that there is another, more sensible default? > > Thanks! > > Cheers, > Chris > > -- View this message in context: http://www.nabble.com/Default-MIME-Type--tf4609978.html#a13165862 Sent from the Apache Tika - Development mailing list archive at Nabble.com.
