hudi-bot opened a new issue, #14781: URL: https://github.com/apache/hudi/issues/14781
Implement HoodieInternalRowOrcWriter for spark datasource integration with ORC. ## JIRA info - Link: https://issues.apache.org/jira/browse/HUDI-1824 - Type: Task - Epic: https://issues.apache.org/jira/browse/HUDI-57 - Fix version(s): - 0.16.0 - 1.1.0 --- ## Comments 31/May/21 17:55;manasaks;[~Teresa] Looking at the ORC api , we don't seem to have corresponding api's like ParquetWriteSupport and orc write method which accepts InternalRow... But instead an abstraction of VectorizedRowBatch... So i presume we would have to explicitly convert from InternalRow -> VectorizedRowBatch Also are there any alternate class like ParquetWriteSupport for ORC for implementing bloom filter functionality;;; -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
