[jira] [Assigned] (HUDI-5232) Add flushTasks config in StreamWriteFunction in hudi-flink

2022-11-17 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang reassigned HUDI-5232: Assignee: JinxinTang > Add flushTasks config in StreamWriteFunction in hudi-flink >

[jira] [Created] (HUDI-5232) Add flushTasks config in StreamWriteFunction in hudi-flink

2022-11-17 Thread JinxinTang (Jira)
JinxinTang created HUDI-5232: Summary: Add flushTasks config in StreamWriteFunction in hudi-flink Key: HUDI-5232 URL: https://issues.apache.org/jira/browse/HUDI-5232 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-5230) Lazy init secondaryView in PriorityBasedFileSystemView

2022-11-16 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-5230: - Issue Type: Improvement (was: Bug) > Lazy init secondaryView in PriorityBasedFileSystemView >

[jira] [Assigned] (HUDI-5230) Lazy init secondaryView in PriorityBasedFileSystemView

2022-11-16 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang reassigned HUDI-5230: Assignee: JinxinTang > Lazy init secondaryView in PriorityBasedFileSystemView >

[jira] [Created] (HUDI-5230) Lazy init secondaryView in PriorityBasedFileSystemView

2022-11-16 Thread JinxinTang (Jira)
JinxinTang created HUDI-5230: Summary: Lazy init secondaryView in PriorityBasedFileSystemView Key: HUDI-5230 URL: https://issues.apache.org/jira/browse/HUDI-5230 Project: Apache Hudi Issue Type:

[jira] [Assigned] (HUDI-5128) Fix getFileSystem way in FileSystemBackedTableMetadata, DatePartitionPathSelector and BootstrapUtils not consistent issue

2022-11-01 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang reassigned HUDI-5128: Assignee: JinxinTang > Fix getFileSystem way in FileSystemBackedTableMetadata, >

[jira] [Updated] (HUDI-5128) Fix getFileSystem way in FileSystemBackedTableMetadata, DatePartitionPathSelector and BootstrapUtils not consistent issue

2022-11-01 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-5128: - Summary: Fix getFileSystem way in FileSystemBackedTableMetadata, DatePartitionPathSelector and

[jira] [Created] (HUDI-5128) Unify getFileSystem way in FileSystemBackedTableMetadata, DatePartitionPathSelector and BootstrapUtils

2022-11-01 Thread JinxinTang (Jira)
JinxinTang created HUDI-5128: Summary: Unify getFileSystem way in FileSystemBackedTableMetadata, DatePartitionPathSelector and BootstrapUtils Key: HUDI-5128 URL: https://issues.apache.org/jira/browse/HUDI-5128

[jira] [Created] (HUDI-5107) Fix hadoop config in DirectWriteMarkers, HoodieFlinkEngineContext and StreamerUtil are not consistent issue

2022-10-31 Thread JinxinTang (Jira)
JinxinTang created HUDI-5107: Summary: Fix hadoop config in DirectWriteMarkers, HoodieFlinkEngineContext and StreamerUtil are not consistent issue Key: HUDI-5107 URL: https://issues.apache.org/jira/browse/HUDI-5107

[jira] [Assigned] (HUDI-5107) Fix hadoop config in DirectWriteMarkers, HoodieFlinkEngineContext and StreamerUtil are not consistent issue

2022-10-31 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang reassigned HUDI-5107: Assignee: JinxinTang > Fix hadoop config in DirectWriteMarkers, HoodieFlinkEngineContext and >

[jira] [Created] (HUDI-5086) The doc of org.apache.hudi.sink.meta.CkpMetadata#bootstrap is not correct

2022-10-24 Thread JinxinTang (Jira)
JinxinTang created HUDI-5086: Summary: The doc of org.apache.hudi.sink.meta.CkpMetadata#bootstrap is not correct Key: HUDI-5086 URL: https://issues.apache.org/jira/browse/HUDI-5086 Project: Apache Hudi

[jira] [Assigned] (HUDI-5086) The doc of org.apache.hudi.sink.meta.CkpMetadata#bootstrap is not correct

2022-10-24 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang reassigned HUDI-5086: Assignee: JinxinTang > The doc of org.apache.hudi.sink.meta.CkpMetadata#bootstrap is not correct >

[jira] [Updated] (HUDI-5005) flink stream write reuse abort instant will lead to coordinator delete file not right.

2022-10-10 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-5005: - Description: # When stream write reuse aborted instant, there is chance this one is older than instant

[jira] [Assigned] (HUDI-5005) flink stream write reuse abort instant will lead to coordinator delete file not right.

2022-10-10 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang reassigned HUDI-5005: Assignee: JinxinTang > flink stream write reuse abort instant will lead to coordinator delete file

[jira] [Created] (HUDI-5005) flink stream write reuse abort instant will lead to coordinator delete file not right.

2022-10-10 Thread JinxinTang (Jira)
JinxinTang created HUDI-5005: Summary: flink stream write reuse abort instant will lead to coordinator delete file not right. Key: HUDI-5005 URL: https://issues.apache.org/jira/browse/HUDI-5005 Project:

[jira] [Assigned] (HUDI-4950) Use HoodieHiveCatalog to infer by read log file will exit due to oom

2022-09-29 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang reassigned HUDI-4950: Assignee: JinxinTang > Use HoodieHiveCatalog to infer by read log file will exit due to oom >

[jira] [Created] (HUDI-4950) Use HoodieHiveCatalog to infer by read log file will exit due to oom

2022-09-29 Thread JinxinTang (Jira)
JinxinTang created HUDI-4950: Summary: Use HoodieHiveCatalog to infer by read log file will exit due to oom Key: HUDI-4950 URL: https://issues.apache.org/jira/browse/HUDI-4950 Project: Apache Hudi

[jira] [Created] (HUDI-4877) org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct

2022-09-19 Thread JinxinTang (Jira)
JinxinTang created HUDI-4877: Summary: org.apache.hudi.index.bucket.TestHoodieSimpleBucketIndex#testTagLocation not work correct Key: HUDI-4877 URL: https://issues.apache.org/jira/browse/HUDI-4877

[jira] [Created] (HUDI-4813) Infer keygen not work in sparksql side

2022-09-08 Thread JinxinTang (Jira)
JinxinTang created HUDI-4813: Summary: Infer keygen not work in sparksql side Key: HUDI-4813 URL: https://issues.apache.org/jira/browse/HUDI-4813 Project: Apache Hudi Issue Type: Bug

[jira] [Created] (HUDI-4808) HoodieSimpleBucketIndex should also consider bucket num in log file not in base file which written by flink mor table

2022-09-07 Thread JinxinTang (Jira)
JinxinTang created HUDI-4808: Summary: HoodieSimpleBucketIndex should also consider bucket num in log file not in base file which written by flink mor table Key: HUDI-4808 URL:

[jira] [Updated] (HUDI-4777) Flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue

2022-09-05 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4777: - Summary: Flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue

[jira] [Created] (HUDI-4777) flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue

2022-09-05 Thread JinxinTang (Jira)
JinxinTang created HUDI-4777: Summary: flink gen bucket index of mor table not consistent with spark lead to duplicate bucket issue Key: HUDI-4777 URL: https://issues.apache.org/jira/browse/HUDI-4777

[jira] [Updated] (HUDI-4767) non partition table in hudi-filnk module should also respect KEYGEN_CLASS_NAME in conf

2022-09-01 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4767: - Summary: non partition table in hudi-filnk module should also respect KEYGEN_CLASS_NAME in conf (was:

[jira] [Created] (HUDI-4767) non partition table in hudi-filnk module should also KEYGEN_CLASS_NAME in conf

2022-09-01 Thread JinxinTang (Jira)
JinxinTang created HUDI-4767: Summary: non partition table in hudi-filnk module should also KEYGEN_CLASS_NAME in conf Key: HUDI-4767 URL: https://issues.apache.org/jira/browse/HUDI-4767 Project: Apache

[jira] [Created] (HUDI-4628) hudi-flink support GLOBAL_BLOOM,GLOBAL_SIMPLE,BUCKET index type

2022-08-16 Thread JinxinTang (Jira)
JinxinTang created HUDI-4628: Summary: hudi-flink support GLOBAL_BLOOM,GLOBAL_SIMPLE,BUCKET index type Key: HUDI-4628 URL: https://issues.apache.org/jira/browse/HUDI-4628 Project: Apache Hudi

[jira] [Updated] (HUDI-4461) org.apache.hudi.sink.TestWriteCopyOnWrite will failed when local hadoop env exists

2022-07-25 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4461: - Description: org.apache.hudi.exception.HoodieException: Error while checking whether table exists under

[jira] [Updated] (HUDI-4461) org.apache.hudi.sink.TestWriteCopyOnWrite will failed when local hadoop env exists

2022-07-25 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4461: - Description: (was: org.apache.hudi.exception.HoodieException: Error while checking whether table

[jira] [Updated] (HUDI-4461) org.apache.hudi.sink.TestWriteCopyOnWrite will failed when local hadoop env exists

2022-07-25 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4461: - Description: org.apache.hudi.exception.HoodieException: Error while checking whether table exists under

[jira] [Created] (HUDI-4461) org.apache.hudi.sink.TestWriteCopyOnWrite will failed when local hadoop env exists

2022-07-25 Thread JinxinTang (Jira)
JinxinTang created HUDI-4461: Summary: org.apache.hudi.sink.TestWriteCopyOnWrite will failed when local hadoop env exists Key: HUDI-4461 URL: https://issues.apache.org/jira/browse/HUDI-4461 Project:

[jira] [Created] (HUDI-4460) org.apache.hudi.sink.TestWriteCopyOnWrite will failed when local hadoop env exists

2022-07-25 Thread JinxinTang (Jira)
JinxinTang created HUDI-4460: Summary: org.apache.hudi.sink.TestWriteCopyOnWrite will failed when local hadoop env exists Key: HUDI-4460 URL: https://issues.apache.org/jira/browse/HUDI-4460 Project:

[jira] [Updated] (HUDI-4422) read parquet failed due to length is 0 or corrupt parquet file

2022-07-19 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4422: - Description: Caused by: java.lang.RuntimeException: 

[jira] [Commented] (HUDI-4422) read parquet failed due to length is 0 or corrupt parquet file

2022-07-19 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568442#comment-17568442 ] JinxinTang commented on HUDI-4422: -- Please assign to me, I can fix it. > read parquet failed due to

[jira] [Updated] (HUDI-4422) read parquet failed due to length is 0 or corrupt parquet file

2022-07-19 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4422: - Description: Caused by: java.lang.RuntimeException: 

[jira] [Updated] (HUDI-4422) read parquet failed due to length is 0 or corrupt parquet file

2022-07-19 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] JinxinTang updated HUDI-4422: - Description: Caused by: java.lang.RuntimeException: 

[jira] [Created] (HUDI-4422) read parquet failed due to length is 0 or corrupt parquet file

2022-07-19 Thread JinxinTang (Jira)
JinxinTang created HUDI-4422: Summary: read parquet failed due to length is 0 or corrupt parquet file Key: HUDI-4422 URL: https://issues.apache.org/jira/browse/HUDI-4422 Project: Apache Hudi

[jira] [Commented] (HUDI-4397) Flink Inline Cluster and Compact plan distribute strategy changed from rebalance to hash to avoid potential multiple threads accessing the same file

2022-07-19 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568409#comment-17568409 ] JinxinTang commented on HUDI-4397: -- great ~ > Flink Inline Cluster and Compact plan distribute strategy