[ https://issues.apache.org/jira/browse/ARROW-7808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17033357#comment-17033357 ]
Hongze Zhang commented on ARROW-7808: ------------------------------------- The major goal is to adopt existing C++ file formats in Java code (maybe via JNI bridge). To have the Datasets APIs implemented in Java is the most reasonable approach as users would be able to access any of the layers defined in C++ Datasets API. Could others in the community please reconfirm this proposal (I recall that we have a discussion for this months ago)? And I can file PRs after then. > [Java][Dataset] Implement Datasets Java API > -------------------------------------------- > > Key: ARROW-7808 > URL: https://issues.apache.org/jira/browse/ARROW-7808 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ - Dataset, Java > Reporter: Hongze Zhang > Priority: Major > Labels: dataset > > Porting following C++ Datasets APIs to Java: > * DataSource > * DataSourceDiscovery > * DataFragment > * Dataset > * Scanner > * ScanTask > * ScanOptions -- This message was sent by Atlassian Jira (v8.3.4#803005)