[ https://issues.apache.org/jira/browse/TIKA-1082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14363973#comment-14363973 ]
Nick Burch commented on TIKA-1082: ---------------------------------- IIRC, 1601-01-01 is a date value of 0. Tika does check if the Microsoft Office property value is set, and only apply it to the metadata if so, but it would seem that in some cases tools are creating word files with a value set but of 0. I wonder if we need to put in a special check for dates of value 0? While normally we do want Tika to return the values saved in the file, for this case it looks like the value that was written into the file was bogus... > Incorrect date in Doc metadata > ------------------------------ > > Key: TIKA-1082 > URL: https://issues.apache.org/jira/browse/TIKA-1082 > Project: Tika > Issue Type: Bug > Components: metadata > Affects Versions: 1.3 > Reporter: Bernhard Berger > Priority: Minor > Attachments: EnglishDoc.doc > > > I get the incorrect date "1601-01-01T00:00:00Z" from a MS Word document with > the Tika 1.3 metadatas. > The same document gives the correct date "2011-10-05T11:32:21Z" with Tika 1.2. -- This message was sent by Atlassian JIRA (v6.3.4#6332)