[jira] [Commented] (HUDI-1332) Introduce FlinkHoodieBloomIndex to hudi-flink-client

2020-12-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17247621#comment-17247621 ] Gary Li commented on HUDI-1332: --- need to switch priority, unassign myself for now. Anyone in

[jira] [Assigned] (HUDI-1404) Make flink engine support bulkinsert operation

2020-12-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li reassigned HUDI-1404: - Assignee: Gary Li > Make flink engine support bulkinsert operation >

[jira] [Updated] (HUDI-1337) Deduplicate data in one batch for flink engine

2020-12-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1337: -- Status: Open (was: New) > Deduplicate data in one batch for flink engine >

[jira] [Updated] (HUDI-1418) Set up unit test infra for flink client

2020-12-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1418: -- Fix Version/s: 0.7.0 > Set up unit test infra for flink client > --- > >

[jira] [Updated] (HUDI-1418) Set up unit test infra for flink client

2020-12-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1418: -- Status: Open (was: New) > Set up unit test infra for flink client > --- > >

[jira] [Updated] (HUDI-1418) Set up unit test infra for flink client

2020-12-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1418: -- Status: Patch Available (was: In Progress) > Set up unit test infra for flink client >

[jira] [Updated] (HUDI-1418) Set up unit test infra for flink client

2020-12-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1418: -- Status: In Progress (was: Open) > Set up unit test infra for flink client > ---

[jira] [Updated] (HUDI-1434) MOR table commit metadata has wrong log path

2020-12-05 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1434: -- Fix Version/s: 0.7.0 > MOR table commit metadata has wrong log path > --

[jira] [Updated] (HUDI-1434) MOR table commit metadata has wrong log path

2020-12-05 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1434: -- Status: In Progress (was: Open) > MOR table commit metadata has wrong log path > --

[jira] [Updated] (HUDI-1434) MOR table commit metadata has wrong log path

2020-12-04 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1434: -- Description: When writing 3 delta commits in sequence, commit 1 write parquet files, commit 2 write log files wi

[jira] [Updated] (HUDI-1434) MOR table commit metadata has wrong log path

2020-12-04 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1434: -- Status: Open (was: New) > MOR table commit metadata has wrong log path > --

[jira] [Updated] (HUDI-1434) MOR table commit metadata has wrong log path

2020-12-04 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1434: -- Attachment: Screen Shot 2020-12-05 at 2.02.28 PM.png > MOR table commit metadata has wrong log path > --

[jira] [Updated] (HUDI-1434) MOR table commit metadata has wrong log path

2020-12-04 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1434: -- Attachment: Screen Shot 2020-12-05 at 2.00.12 PM.png > MOR table commit metadata has wrong log path > --

[jira] [Created] (HUDI-1434) MOR table commit metadata has wrong log path

2020-12-04 Thread Gary Li (Jira)
Gary Li created HUDI-1434: - Summary: MOR table commit metadata has wrong log path Key: HUDI-1434 URL: https://issues.apache.org/jira/browse/HUDI-1434 Project: Apache Hudi Issue Type: Bug

[jira] [Commented] (HUDI-1332) Introduce FlinkHoodieBloomIndex to hudi-flink-client

2020-11-26 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17239114#comment-17239114 ] Gary Li commented on HUDI-1332: --- hi [~wangxianghu], are you currently working on this issue?

[jira] [Updated] (HUDI-1332) Introduce FlinkHoodieBloomIndex to hudi-flink-client

2020-11-26 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1332: -- Status: Open (was: New) > Introduce FlinkHoodieBloomIndex to hudi-flink-client > --

[jira] [Resolved] (HUDI-1392) lose partition info when using spark parameter "basePath"

2020-11-25 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li resolved HUDI-1392. --- Resolution: Fixed > lose partition info when using spark parameter "basePath" > -

[jira] [Created] (HUDI-1418) Set up unit test infra for flink client

2020-11-25 Thread Gary Li (Jira)
Gary Li created HUDI-1418: - Summary: Set up unit test infra for flink client Key: HUDI-1418 URL: https://issues.apache.org/jira/browse/HUDI-1418 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-1392) lose partition info when using spark parameter "basePath"

2020-11-23 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Li updated HUDI-1392: -- Status: Open (was: New) > lose partition info when using spark parameter "basePath" >

[jira] [Commented] (HUDI-1397) Different behavior between RealtimeCompactedRecordReader and HoodieMergeOnReadRDD

2020-11-17 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234186#comment-17234186 ] Gary Li commented on HUDI-1397: --- Awesome! > Different behavior between RealtimeCompactedRec

[jira] [Commented] (HUDI-1397) Different behavior between RealtimeCompactedRecordReader and HoodieMergeOnReadRDD

2020-11-16 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232896#comment-17232896 ] Gary Li commented on HUDI-1397: --- [~advancedxy] thanks for reporting. We need a serializer he

[jira] [Commented] (HUDI-791) Replace null by Option in Delta Streamer

2020-11-10 Thread Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17229208#comment-17229208 ] Gary Li commented on HUDI-791: -- [~rxu] please close. not sure why but this apache account does

[jira] [Created] (HUDI-1270) NoSuchMethod PartitionedFile on AWS EMR Spark 2.4.5

2020-09-04 Thread Gary Li (Jira)
Gary Li created HUDI-1270: - Summary: NoSuchMethod PartitionedFile on AWS EMR Spark 2.4.5 Key: HUDI-1270 URL: https://issues.apache.org/jira/browse/HUDI-1270 Project: Apache Hudi Issue Type: Bug

[jira] [Updated] (HUDI-1205) Serialization fail when log file is larger than 2GB

2020-08-19 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1205: - Status: Open (was: New) > Serialization fail when log file is larger than 2GB > -

[jira] [Updated] (HUDI-1205) Serialization fail when log file is larger than 2GB

2020-08-19 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1205: - Description: When scanning the log file, if the log file(or log file group) is larger than 2GB, s

[jira] [Created] (HUDI-1205) Serialization fail when log file is larger than 2GB

2020-08-19 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1205: Summary: Serialization fail when log file is larger than 2GB Key: HUDI-1205 URL: https://issues.apache.org/jira/browse/HUDI-1205 Project: Apache Hudi Issue T

[jira] [Commented] (HUDI-920) Incremental view on MOR table using Spark Datasource

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173594#comment-17173594 ] Yanjia Gary Li commented on HUDI-920: - The most challenging thing of the incremental qu

[jira] [Resolved] (HUDI-69) Support realtime view in Spark datasource #136

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-69?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-69. Resolution: Fixed > Support realtime view in Spark datasource #136 > -

[jira] [Reopened] (HUDI-69) Support realtime view in Spark datasource #136

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-69?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li reopened HUDI-69: > Support realtime view in Spark datasource #136 > -- > >

[jira] [Resolved] (HUDI-1051) Improve MOR datasource reader file listing and path handling

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-1051. -- Resolution: Fixed > Improve MOR datasource reader file listing and path handling > -

[jira] [Updated] (HUDI-69) Support realtime view in Spark datasource #136

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-69?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-69: --- Status: Closed (was: Patch Available) > Support realtime view in Spark datasource #136 > --

[jira] [Resolved] (HUDI-1052) Support vectorized reader for MOR datasource reader

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-1052. -- Resolution: Fixed > Support vectorized reader for MOR datasource reader > --

[jira] [Resolved] (HUDI-1050) Support filter pushdown and column pruning for MOR table on Spark Datasource

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-1050. -- Resolution: Fixed > Support filter pushdown and column pruning for MOR table on Spark Datasource

[jira] [Updated] (HUDI-1052) Support vectorized reader for MOR datasource reader

2020-08-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1052: - Status: In Progress (was: Open) > Support vectorized reader for MOR datasource reader > -

[jira] [Updated] (HUDI-1141) Serialization fail when loading two log files

2020-07-31 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1141: - Summary: Serialization fail when loading two log files (was: Serialization fail when loading larg

[jira] [Created] (HUDI-1141) Serialization fail when loading large log files

2020-07-31 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1141: Summary: Serialization fail when loading large log files Key: HUDI-1141 URL: https://issues.apache.org/jira/browse/HUDI-1141 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-1050) Support filter pushdown and column pruning for MOR table on Spark Datasource

2020-07-26 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1050: - Status: In Progress (was: Open) > Support filter pushdown and column pruning for MOR table on Spa

[jira] [Updated] (HUDI-1120) Support spotless for scala

2020-07-22 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1120: - Component/s: Code Cleanup > Support spotless for scala > -- > >

[jira] [Updated] (HUDI-1120) Support spotless for scala

2020-07-22 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1120: - Fix Version/s: 0.6.0 > Support spotless for scala > -- > >

[jira] [Updated] (HUDI-1120) Support spotless for scala

2020-07-22 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1120: - Status: In Progress (was: Open) > Support spotless for scala > -- > >

[jira] [Updated] (HUDI-1120) Support spotless for scala

2020-07-22 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1120: - Status: Open (was: New) > Support spotless for scala > -- > >

[jira] [Created] (HUDI-1120) Support spotless for scala

2020-07-22 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1120: Summary: Support spotless for scala Key: HUDI-1120 URL: https://issues.apache.org/jira/browse/HUDI-1120 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-1050) Support filter pushdown and column pruning for MOR table on Spark Datasource

2020-07-21 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1050: - Fix Version/s: (was: 0.6.1) 0.6.0 > Support filter pushdown and column prun

[jira] [Updated] (HUDI-1114) Explore Spark Structure Streaming for Hudi Dataset

2020-07-20 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1114: - Status: Open (was: New) > Explore Spark Structure Streaming for Hudi Dataset > --

[jira] [Created] (HUDI-1114) Explore Spark Structure Streaming for Hudi Dataset

2020-07-20 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1114: Summary: Explore Spark Structure Streaming for Hudi Dataset Key: HUDI-1114 URL: https://issues.apache.org/jira/browse/HUDI-1114 Project: Apache Hudi Issue Ty

[jira] [Created] (HUDI-1101) Decouple Hive dependencies from hudi-spark and hudi-utilities

2020-07-16 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1101: Summary: Decouple Hive dependencies from hudi-spark and hudi-utilities Key: HUDI-1101 URL: https://issues.apache.org/jira/browse/HUDI-1101 Project: Apache Hudi

[jira] [Updated] (HUDI-1051) Improve MOR datasource reader file listing and path handling

2020-06-24 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1051: - Status: Open (was: New) > Improve MOR datasource reader file listing and path handling >

[jira] [Updated] (HUDI-1052) Support vectorized reader for MOR datasource reader

2020-06-24 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1052: - Status: Open (was: New) > Support vectorized reader for MOR datasource reader > -

[jira] [Updated] (HUDI-1050) Support filter pushdown and column pruning for MOR table on Spark Datasource

2020-06-24 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1050: - Status: Open (was: New) > Support filter pushdown and column pruning for MOR table on Spark Datas

[jira] [Created] (HUDI-1052) Support vectorized reader for MOR datasource reader

2020-06-24 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1052: Summary: Support vectorized reader for MOR datasource reader Key: HUDI-1052 URL: https://issues.apache.org/jira/browse/HUDI-1052 Project: Apache Hudi Issue T

[jira] [Created] (HUDI-1051) Improve MOR datasource reader file listing and path handling

2020-06-24 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1051: Summary: Improve MOR datasource reader file listing and path handling Key: HUDI-1051 URL: https://issues.apache.org/jira/browse/HUDI-1051 Project: Apache Hudi

[jira] [Created] (HUDI-1050) Support filter pushdown and column pruning for MOR table on Spark Datasource

2020-06-24 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1050: Summary: Support filter pushdown and column pruning for MOR table on Spark Datasource Key: HUDI-1050 URL: https://issues.apache.org/jira/browse/HUDI-1050 Project: Apa

[jira] [Updated] (HUDI-1028) Hudi write job stuck when start EmbeddedTimelineService failed

2020-06-17 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1028: - Description: With "hoodie.embed.timeline.server" set to "true" as default in 0.5.3, I deployed a

[jira] [Updated] (HUDI-1028) Hudi write job stuck when start EmbeddedTimelineService failed

2020-06-17 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1028: - Summary: Hudi write job stuck when start EmbeddedTimelineService failed (was: Hudi write job stuc

[jira] [Commented] (HUDI-1028) Hudi write job stuck when start EmbeddedTimelineService failed

2020-06-17 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17138910#comment-17138910 ] Yanjia Gary Li commented on HUDI-1028: -- Hi [~xleesf], have you seen similar things ha

[jira] [Updated] (HUDI-1028) Hudi write job stuck when create EmbeddedTimelineService failed

2020-06-17 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1028: - Description: With "hoodie.embed.timeline.server" set to "true" as default in 0.5.3, I deployed a

[jira] [Updated] (HUDI-1028) Hudi write job stuck when create EmbeddedTimelineService failed

2020-06-17 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1028: - Attachment: stack_trace.txt > Hudi write job stuck when create EmbeddedTimelineService failed > --

[jira] [Updated] (HUDI-1028) Hudi write job stuck when create EmbeddedTimelineService failed

2020-06-17 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1028: - Status: Open (was: New) > Hudi write job stuck when create EmbeddedTimelineService failed > -

[jira] [Created] (HUDI-1028) Hudi write job stuck when create EmbeddedTimelineService failed

2020-06-17 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1028: Summary: Hudi write job stuck when create EmbeddedTimelineService failed Key: HUDI-1028 URL: https://issues.apache.org/jira/browse/HUDI-1028 Project: Apache Hudi

[jira] [Commented] (HUDI-1018) Handle empty checkpoint better in delta streamer

2020-06-12 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17134533#comment-17134533 ] Yanjia Gary Li commented on HUDI-1018: -- [~Litianye] since we solve this ticket togeth

[jira] [Assigned] (HUDI-1018) Handle empty checkpoint better in delta streamer

2020-06-12 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li reassigned HUDI-1018: Assignee: Tianye Li > Handle empty checkpoint better in delta streamer > --

[jira] [Updated] (HUDI-1018) Handle empty checkpoint better in delta streamer

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1018: - Component/s: DeltaStreamer > Handle empty checkpoint better in delta streamer > --

[jira] [Updated] (HUDI-1018) Handle empty checkpoint better in delta streamer

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1018: - Status: Open (was: New) > Handle empty checkpoint better in delta streamer >

[jira] [Created] (HUDI-1018) Handle empty checkpoint better in delta streamer

2020-06-09 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1018: Summary: Handle empty checkpoint better in delta streamer Key: HUDI-1018 URL: https://issues.apache.org/jira/browse/HUDI-1018 Project: Apache Hudi Issue Type

[jira] [Closed] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li closed HUDI-905. --- Resolution: Not A Problem TableScan already supported filter and projection pushdown. > Support Pruned

[jira] [Updated] (HUDI-610) MOR table Impala read support

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-610: Summary: MOR table Impala read support (was: Impala nea real time table support) > MOR table Impala

[jira] [Assigned] (HUDI-610) Impala nea real time table support

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li reassigned HUDI-610: --- Assignee: (was: Yanjia Gary Li) > Impala nea real time table support > ---

[jira] [Resolved] (HUDI-494) [DEBUGGING] Huge amount of tasks when writing files into HDFS

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-494. - Resolution: Fixed > [DEBUGGING] Huge amount of tasks when writing files into HDFS > ---

[jira] [Closed] (HUDI-494) [DEBUGGING] Huge amount of tasks when writing files into HDFS

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li closed HUDI-494. --- > [DEBUGGING] Huge amount of tasks when writing files into HDFS > -

[jira] [Resolved] (HUDI-822) Decouple hoodie related methods with Hoodie Input Formats

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-822. - Resolution: Fixed > Decouple hoodie related methods with Hoodie Input Formats > ---

[jira] [Closed] (HUDI-822) Decouple hoodie related methods with Hoodie Input Formats

2020-06-09 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li closed HUDI-822. --- > Decouple hoodie related methods with Hoodie Input Formats > -

[jira] [Updated] (HUDI-1011) Refactor hudi-client unit tests structure

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1011: - Status: Open (was: New) > Refactor hudi-client unit tests structure > ---

[jira] [Updated] (HUDI-1011) Refactor hudi-client unit tests structure

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1011: - Component/s: Testing > Refactor hudi-client unit tests structure > ---

[jira] [Updated] (HUDI-1010) Fix the memory leak for hudi-client unit tests

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1010: - Status: Open (was: New) > Fix the memory leak for hudi-client unit tests > --

[jira] [Updated] (HUDI-1010) Fix the memory leak for hudi-client unit tests

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1010: - Component/s: Testing > Fix the memory leak for hudi-client unit tests > --

[jira] [Updated] (HUDI-1011) Refactor hudi-client unit tests structure

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1011: - Labels: help-wanted (was: ) > Refactor hudi-client unit tests structure > ---

[jira] [Created] (HUDI-1011) Refactor hudi-client unit tests structure

2020-06-08 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1011: Summary: Refactor hudi-client unit tests structure Key: HUDI-1011 URL: https://issues.apache.org/jira/browse/HUDI-1011 Project: Apache Hudi Issue Type: Impro

[jira] [Updated] (HUDI-1010) Fix the memory leak for hudi-client unit tests

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1010: - Description: hudi-client unit test has a memory leak, which could be some resources are not prope

[jira] [Updated] (HUDI-1010) Fix the memory leak for hudi-client unit tests

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1010: - Description: hudi-client unit test has a memory leak, which could be some resources are not prope

[jira] [Updated] (HUDI-1010) Fix the memory leak for hudi-client unit tests

2020-06-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-1010: - Labels: help-wanted (was: ) > Fix the memory leak for hudi-client unit tests > --

[jira] [Created] (HUDI-1010) Fix the memory leak for hudi-client unit tests

2020-06-08 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-1010: Summary: Fix the memory leak for hudi-client unit tests Key: HUDI-1010 URL: https://issues.apache.org/jira/browse/HUDI-1010 Project: Apache Hudi Issue Type:

[jira] [Resolved] (HUDI-773) Hudi On Azure Data Lake Storage V2

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-773. - Resolution: Fixed Azure info was added to the docs. > Hudi On Azure Data Lake Storage V2 > ---

[jira] [Closed] (HUDI-773) Hudi On Azure Data Lake Storage V2

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li closed HUDI-773. --- > Hudi On Azure Data Lake Storage V2 > -- > > Key: HUDI-773

[jira] [Resolved] (HUDI-804) Add Azure Support to Hudi Doc

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-804. - Resolution: Fixed > Add Azure Support to Hudi Doc > - > >

[jira] [Closed] (HUDI-804) Add Azure Support to Hudi Doc

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li closed HUDI-804. --- > Add Azure Support to Hudi Doc > - > > Key: HUDI-804 >

[jira] [Resolved] (HUDI-805) Verify which types of Azure storage support Hudi

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li resolved HUDI-805. - Resolution: Fixed Azure Data Lake Storage Gen 2 and Azure Blob Storage support Hudi. > Verify whic

[jira] [Closed] (HUDI-805) Verify which types of Azure storage support Hudi

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li closed HUDI-805. --- > Verify which types of Azure storage support Hudi > > >

[jira] [Updated] (HUDI-805) Verify which types of Azure storage support Hudi

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-805: Status: Open (was: New) > Verify which types of Azure storage support Hudi > ---

[jira] [Updated] (HUDI-805) Verify which types of Azure storage support Hudi

2020-05-27 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-805: Status: In Progress (was: Open) > Verify which types of Azure storage support Hudi > ---

[jira] [Updated] (HUDI-804) Add Azure Support to Hudi Doc

2020-05-25 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-804: Status: In Progress (was: Open) > Add Azure Support to Hudi Doc > - > >

[jira] [Updated] (HUDI-804) Add Azure Support to Hudi Doc

2020-05-25 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-804: Status: Open (was: New) > Add Azure Support to Hudi Doc > - > >

[jira] [Commented] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer

2020-05-23 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17114980#comment-17114980 ] Yanjia Gary Li commented on HUDI-110: - [~shivnarayan] no, the PR is not related to this

[jira] [Updated] (HUDI-110) Better defaults for Partition extractor for Spark DataSOurce and DeltaStreamer

2020-05-23 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-110: Labels: (was: bug-bash-0.6.0 pull-request-available) > Better defaults for Partition extractor for

[jira] [Assigned] (HUDI-494) [DEBUGGING] Huge amount of tasks when writing files into HDFS

2020-05-23 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li reassigned HUDI-494: --- Assignee: Yanjia Gary Li (was: lamber-ken) > [DEBUGGING] Huge amount of tasks when writing fi

[jira] [Commented] (HUDI-494) [DEBUGGING] Huge amount of tasks when writing files into HDFS

2020-05-23 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17114967#comment-17114967 ] Yanjia Gary Li commented on HUDI-494: - [~shivnarayan] this is still under review. [htt

[jira] [Updated] (HUDI-920) Incremental view on MOR table using Spark Datasource

2020-05-22 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-920: Fix Version/s: 0.6.0 > Incremental view on MOR table using Spark Datasource > ---

[jira] [Updated] (HUDI-920) Incremental view on MOR table using Spark Datasource

2020-05-22 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-920: Status: Open (was: New) > Incremental view on MOR table using Spark Datasource > ---

[jira] [Created] (HUDI-920) Incremental view on MOR table using Spark Datasource

2020-05-22 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-920: --- Summary: Incremental view on MOR table using Spark Datasource Key: HUDI-920 URL: https://issues.apache.org/jira/browse/HUDI-920 Project: Apache Hudi (incubating)

[jira] [Assigned] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-05-21 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li reassigned HUDI-905: --- Assignee: Yanjia Gary Li > Support PrunedFilteredScan for Spark Datasource > -

[jira] [Updated] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-05-20 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-905: Status: Open (was: New) > Support PrunedFilteredScan for Spark Datasource >

<    2   3   4   5   6   7   8   9   >