Hi all, Issues 895 and 914 are about how <title></title> should be generated when the title is empty, instead of <title/>, which is what Tika currently generates. There's a fix for this problem, in the XHTMLContentHandler class, that was committed when TIKA-725 was closed, but it doesn't seem to work. I have some ideas for different non-breaking (and sometimes zero-length) space characters that could be inserted in between <title> and </title>: \uFEFF, \u200B, \u202F, or \u2060. Is anyone interested in hearing how well they work (or don't) with my versions of Windows and Linux?
Thanks, John Mastarone
