[ https://issues.apache.org/jira/browse/CONNECTORS-1617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17036165#comment-17036165 ]
Karl Wright commented on CONNECTORS-1617: ----------------------------------------- Jira -> Create -> pull down "TIKA" in the "Project" pulldown. > Date format extraction problem in XLS/XLSX > ------------------------------------------ > > Key: CONNECTORS-1617 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1617 > Project: ManifoldCF > Issue Type: Task > Components: Tika extractor, Tika service connector > Affects Versions: ManifoldCF 2.10 > Reporter: Zoltan Farago > Assignee: Karl Wright > Priority: Major > Fix For: ManifoldCF 2.16 > > Attachments: exceldatum.xlsx > > > Currently TIKA/ManifoldCF 2.10 extracts dates from the attached file tis way: > 2018.05.10 -> 10/05/18 > 2002.02.02 -> 2/2/2 > We need this format: > 2018.05.10 -> 2018-05-10 > 2002.02.02 -> 2002-02-02 > This occurs only when the field type is date. When the field type is text > then the output is fine. > > Please help us with a recommendation with any settings in the pipeline (Tika > configs, excel setting, OS local settings, etc.), or provide a fix. -- This message was sent by Atlassian Jira (v8.3.4#803005)