liuxiaoyu created ORC-1287:
------------------------------
Summary: C++ read timestamp value is different from java read when
using csv-import tool converter CSV to ORC files
Key: ORC-1287
URL: https://issues.apache.org/jira/browse/ORC-1287
Project: ORC
Issue Type: Bug
Components: C++
Affects Versions: 1.7.3
Environment: centos7
Reporter: liuxiaoyu
Attachments: test.csv, test.orc
I have a csv file. Convert to orc files with the c++ csv-import tool.
ORC Version is v1.7.3
Command
```
csv-import struct<a:timestamp> ./test.csv ./test.orc
java -jar orc-tools-1.7.3-uber.jar data test.orc
orc-contents ./test.orc
```
CSV File
```
0001-01-01 00:00:00.000000
0001-10-19 10:23:54.123456
0099-10-19 10:23:54.123456
1900-10-19 10:23:54.123456
1969-12-31 23:59:59.001
1969-12-31 23:59:59.999999
1970-01-01 00:00:00.000
1970-01-01 00:00:00.001
1970-01-01 23:59:59.999999
```
c++ read orc file
```
{"a": "1-01-01 00:00:00.0"}
{"a": "1-10-19 10:23:54.123456"}
{"a": "99-10-19 10:23:54.123456"}
{"a": "1900-10-19 10:23:54.123456"}
{"a": "1970-01-01 00:00:00.001"}
{"a": "1970-01-01 00:00:00.999999"}
{"a": "1970-01-01 00:00:00.0"}
{"a": "1970-01-01 00:00:00.001"}
{"a": "1970-01-01 23:59:59.999999"}
```
java read orc file
```
{"a":"0001-01-03 08:00:00.0"}
{"a":"0001-10-21 18:23:54.123456"}
{"a":"0099-10-21 18:23:54.123456"}
{"a":"1900-10-19 18:29:37.123456"}
{"a":"1970-01-01 08:00:00.001"}
{"a":"1970-01-01 08:00:00.999999"}
{"a":"1970-01-01 08:00:00.0"}
{"a":"1970-01-01 08:00:00.001"}
{"a":"1970-01-02 07:59:59.999999"}
```
`0001-01-01 00:00:00.000000` java and c++ show timestamp are different
Tried the version orc main branch is the same results.
this issue looks similar to this issue
https://issues.apache.org/jira/browse/ORC-1055
--
This message was sent by Atlassian Jira
(v8.20.10#820010)