[ 
https://issues.apache.org/jira/browse/TIKA-1082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14363973#comment-14363973
 ] 

Nick Burch commented on TIKA-1082:
----------------------------------

IIRC, 1601-01-01 is a date value of 0. Tika does check if the Microsoft Office 
property value is set, and only apply it to the metadata if so, but it would 
seem that in some cases tools are creating word files with a value set but of 0.

I wonder if we need to put in a special check for dates of value 0? While 
normally we do want Tika to return the values saved in the file, for this case 
it looks like the value that was written into the file was bogus...

> Incorrect date in Doc metadata
> ------------------------------
>
>                 Key: TIKA-1082
>                 URL: https://issues.apache.org/jira/browse/TIKA-1082
>             Project: Tika
>          Issue Type: Bug
>          Components: metadata
>    Affects Versions: 1.3
>            Reporter: Bernhard Berger
>            Priority: Minor
>         Attachments: EnglishDoc.doc
>
>
> I get the incorrect date "1601-01-01T00:00:00Z" from a MS Word document with 
> the Tika 1.3 metadatas.
> The same document gives the correct date "2011-10-05T11:32:21Z" with Tika 1.2.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to