[ https://issues.apache.org/jira/browse/HUDI-1296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17486166#comment-17486166 ]
Alexey Kudinkin commented on HUDI-1296: --------------------------------------- I've done high-level scoping of this effort, on a high-level we'd need to: # Implement base HFile (Spark-compatible) reader ## Similar to ParquetFileFormat.buildReaderWithPartitionValues ## Used in MergeOnRead\{Snapshot|Incremental}Relation, passed to HoodieMergeOnReadRDD # Modify MergeOnReadSnapshotRelation to not assume the base file format and instead deduce it based on extension > Implement Spark DataSource using range metadata for file/partition pruning > -------------------------------------------------------------------------- > > Key: HUDI-1296 > URL: https://issues.apache.org/jira/browse/HUDI-1296 > Project: Apache Hudi > Issue Type: Task > Components: spark > Affects Versions: 0.9.0 > Reporter: Vinoth Chandar > Assignee: Alexey Kudinkin > Priority: Blocker > Fix For: 0.11.0 > > -- This message was sent by Atlassian Jira (v8.20.1#820001)