Hi, On 9/20/07, Chris Mattmann <[EMAIL PROTECTED]> wrote: > Here, we have a very similar case. Jerome Charron has clearly used the > shared mime-info DTD format from freedesktop.org as the "data model" for his > mime type db format. However, the entire code-base that reads, interprets, > reformats, and processes the information made available by the mime type > database, was written by Jerome. Also, I'm not sure that Jerome's mime DTD > (and associated mime XML format) provided in TIKA-6 doesn't add some new > capability (e.g., I notice that the Nutch mimes are in this new mime db as > well) that was not present in the freedesktop.org original one.
The problem is not about the code that reads and interprets the database, but about the database (freedesktop.org.xml) and the related database description (freedesktop.org.dtd). If we wanted we could recreate both the database description (by reading the spec and writing our own DTD file) and even the database (by collecting vast amounts of content type information) under the Apache license, but AFAIK the current versions included in the patch are largely based on the GPL-licensed versions from freedesktop.org. So my suggestion would be to drop the xml and dtd files from the patch and replace them with configuration options for pointing the (Apache licensed) code to externally acquired database files. BR, Jukka Zitting
