[
https://issues.apache.org/jira/browse/ORC-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17450870#comment-17450870
]
Dongjoon Hyun commented on ORC-1054:
------------------------------------
BTW, it would be great if you can upload the ORC file into this JIRA.
> Unable to compare data (generated using CSV to ORC converter) on timestamp
> column
> ---------------------------------------------------------------------------------
>
> Key: ORC-1054
> URL: https://issues.apache.org/jira/browse/ORC-1054
> Project: ORC
> Issue Type: Bug
> Components: C++, Java
> Reporter: Varun Raval
> Priority: Major
>
> I have a CSV file with timestamp columns. Then I convert CSV file to ORC file
> using CSV to ORC converter and place the ORC file in a hive table backed by
> ORC files. I am not able to query the data using timestamp column on Apache
> Hive beeline. If timestamp is present in the select query, the corresponding
> rows are not retrieved.
> For example, table csvtest has single column (t) as timestamp datatype. It
> has a row '2021-11-10 01:02:15'. Query "select * from csvtest where t >
> '2021-11-10 00:00:00'" does not return any result. Query "select * from
> csvtest" returns the correct row.
> However, the same query "select * from csvtest where t > '2021-11-10
> 00:00:00'" works with Spark SQL and rows are retrieved correctly.
> Is this issue with how ORC file is created or is it some hive configuration
> issue?
> I have tested it on the master branch and results are same for both cpp and
> java csv to orc converters.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)