[jira] [Updated] (HUDI-7622) Add sanity check for HoodieTableSource

2024-04-27 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 updated HUDI-7622: Summary: Add sanity check for HoodieTableSource (was: Optimization function MergeOnReadTableState#g

[jira] [Created] (HUDI-7622) Optimization function MergeOnReadTableState#getRequiredPositions

2024-04-16 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7622: --- Summary: Optimization function MergeOnReadTableState#getRequiredPositions Key: HUDI-7622 URL: https://issues.apache.org/jira/browse/HUDI-7622 Project: Apache Hudi

[jira] [Created] (HUDI-7529) Resolve hotspots in stream read

2024-03-22 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7529: --- Summary: Resolve hotspots in stream read Key: HUDI-7529 URL: https://issues.apache.org/jira/browse/HUDI-7529 Project: Apache Hudi Issue Type: Improvement

[jira] [Created] (HUDI-7477) Optimize print write error msg in StreamWriteOperatorCoordinator#doCommit

2024-03-04 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7477: --- Summary: Optimize print write error msg in StreamWriteOperatorCoordinator#doCommit Key: HUDI-7477 URL: https://issues.apache.org/jira/browse/HUDI-7477 Project: Apache H

[jira] [Created] (HUDI-7425) StreamerUtil prints wrong table path

2024-02-19 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7425: --- Summary: StreamerUtil prints wrong table path Key: HUDI-7425 URL: https://issues.apache.org/jira/browse/HUDI-7425 Project: Apache Hudi Issue Type: Improvement

[jira] [Created] (HUDI-7248) add log for HoodieActiveTimeline.transitionPendingState to check instant file

2023-12-21 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7248: --- Summary: add log for HoodieActiveTimeline.transitionPendingState to check instant file Key: HUDI-7248 URL: https://issues.apache.org/jira/browse/HUDI-7248 Project: Apac

[jira] [Created] (HUDI-7230) stream read supports skipping insert overwrite instant

2023-12-14 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7230: --- Summary: stream read supports skipping insert overwrite instant Key: HUDI-7230 URL: https://issues.apache.org/jira/browse/HUDI-7230 Project: Apache Hudi Issue

[jira] [Created] (HUDI-7156) Abstract an independent hoodie table filesystem view lock

2023-11-28 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7156: --- Summary: Abstract an independent hoodie table filesystem view lock Key: HUDI-7156 URL: https://issues.apache.org/jira/browse/HUDI-7156 Project: Apache Hudi Iss

[jira] [Updated] (HUDI-7155) Add log to print wrong number of instant metadata files

2023-11-28 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 updated HUDI-7155: Summary: Add log to print wrong number of instant metadata files (was: Add LOG to print wrong numbe

[jira] [Created] (HUDI-7155) Add LOG to print wrong number of instant metadata files

2023-11-28 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7155: --- Summary: Add LOG to print wrong number of instant metadata files Key: HUDI-7155 URL: https://issues.apache.org/jira/browse/HUDI-7155 Project: Apache Hudi Issue

[jira] [Created] (HUDI-7041) Optimize the mem usage of partitionToFileGroupsMap during the cleaning

2023-11-07 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-7041: --- Summary: Optimize the mem usage of partitionToFileGroupsMap during the cleaning Key: HUDI-7041 URL: https://issues.apache.org/jira/browse/HUDI-7041 Project: Apache Hudi

[jira] [Created] (HUDI-6992) IncrementalInputSplits incorrectly set the latestCommit attr

2023-10-26 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6992: --- Summary: IncrementalInputSplits incorrectly set the latestCommit attr Key: HUDI-6992 URL: https://issues.apache.org/jira/browse/HUDI-6992 Project: Apache Hudi

[jira] [Created] (HUDI-6976) Add table name and range msg for streaming reads logs

2023-10-24 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6976: --- Summary: Add table name and range msg for streaming reads logs Key: HUDI-6976 URL: https://issues.apache.org/jira/browse/HUDI-6976 Project: Apache Hudi Issue T

[jira] [Created] (HUDI-6971) OOM caused by configuring read.start_commit as earliest in stream reading

2023-10-23 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6971: --- Summary: OOM caused by configuring read.start_commit as earliest in stream reading Key: HUDI-6971 URL: https://issues.apache.org/jira/browse/HUDI-6971 Project: Apache H

[jira] [Created] (HUDI-6970) Stream read allows skipping archived commits

2023-10-23 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6970: --- Summary: Stream read allows skipping archived commits Key: HUDI-6970 URL: https://issues.apache.org/jira/browse/HUDI-6970 Project: Apache Hudi Issue Type: Impr

[jira] [Created] (HUDI-6969) Add speed limit for stream read

2023-10-23 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6969: --- Summary: Add speed limit for stream read Key: HUDI-6969 URL: https://issues.apache.org/jira/browse/HUDI-6969 Project: Apache Hudi Issue Type: Improvement

[jira] [Created] (HUDI-6953) Optimizing hudi sink operators generation

2023-10-17 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6953: --- Summary: Optimizing hudi sink operators generation Key: HUDI-6953 URL: https://issues.apache.org/jira/browse/HUDI-6953 Project: Apache Hudi Issue Type: Improve

[jira] [Created] (HUDI-6927) CDC file clean not work

2023-10-09 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6927: --- Summary: CDC file clean not work Key: HUDI-6927 URL: https://issues.apache.org/jira/browse/HUDI-6927 Project: Apache Hudi Issue Type: Improvement Com

[jira] [Created] (HUDI-6894) ReflectionUtils is not thread safe

2023-09-26 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6894: --- Summary: ReflectionUtils is not thread safe Key: HUDI-6894 URL: https://issues.apache.org/jira/browse/HUDI-6894 Project: Apache Hudi Issue Type: Improvement

[jira] [Created] (HUDI-6848) fix non-unique uid for hudi operators

2023-09-11 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6848: --- Summary: fix non-unique uid for hudi operators Key: HUDI-6848 URL: https://issues.apache.org/jira/browse/HUDI-6848 Project: Apache Hudi Issue Type: Improvement

[jira] [Created] (HUDI-6809) Optimizing the judgment of generating clustering plans

2023-08-30 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6809: --- Summary: Optimizing the judgment of generating clustering plans Key: HUDI-6809 URL: https://issues.apache.org/jira/browse/HUDI-6809 Project: Apache Hudi Issue

[jira] [Created] (HUDI-6808) SkipCompaction Config should not affect the stream read of the cow table

2023-08-30 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6808: --- Summary: SkipCompaction Config should not affect the stream read of the cow table Key: HUDI-6808 URL: https://issues.apache.org/jira/browse/HUDI-6808 Project: Apache H

[jira] [Created] (HUDI-6581) Remove unnecessary validations in function getOldestInstantToRetainForClustering

2023-07-23 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6581: --- Summary: Remove unnecessary validations in function getOldestInstantToRetainForClustering Key: HUDI-6581 URL: https://issues.apache.org/jira/browse/HUDI-6581 Project: A

[jira] [Updated] (HUDI-6580) Duplicate calculation of earliestInstantToRetain when generating a cleanplan

2023-07-23 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 updated HUDI-6580: Priority: Minor (was: Major) > Duplicate calculation of earliestInstantToRetain when generating a c

[jira] [Created] (HUDI-6580) Duplicate calculation of earliestInstantToRetain when generating a cleanplan

2023-07-23 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6580: --- Summary: Duplicate calculation of earliestInstantToRetain when generating a cleanplan Key: HUDI-6580 URL: https://issues.apache.org/jira/browse/HUDI-6580 Project: Apach

[jira] [Created] (HUDI-6573) Flink supports disable log append in mor table

2023-07-19 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6573: --- Summary: Flink supports disable log append in mor table Key: HUDI-6573 URL: https://issues.apache.org/jira/browse/HUDI-6573 Project: Apache Hudi Issue Type: Im

[jira] [Created] (HUDI-6424) getOldestInstantToRetainForCompaction needs to add clean validation

2023-06-22 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6424: --- Summary: getOldestInstantToRetainForCompaction needs to add clean validation Key: HUDI-6424 URL: https://issues.apache.org/jira/browse/HUDI-6424 Project: Apache Hudi

[jira] [Created] (HUDI-6423) Incremental cleaning should consider inflight compaction instant

2023-06-22 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6423: --- Summary: Incremental cleaning should consider inflight compaction instant Key: HUDI-6423 URL: https://issues.apache.org/jira/browse/HUDI-6423 Project: Apache Hudi

[jira] [Created] (HUDI-6422) Solve the issue of compiling based on Hadoop 3.1.1

2023-06-22 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6422: --- Summary: Solve the issue of compiling based on Hadoop 3.1.1 Key: HUDI-6422 URL: https://issues.apache.org/jira/browse/HUDI-6422 Project: Apache Hudi Issue Type

[jira] [Created] (HUDI-6361) retain the commit before inflignt compaction to support schema Evolution

2023-06-12 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6361: --- Summary: retain the commit before inflignt compaction to support schema Evolution Key: HUDI-6361 URL: https://issues.apache.org/jira/browse/HUDI-6361 Project: Apache Hu

[jira] [Created] (HUDI-6318) The skip merge config for incremental-read ensures consistency in both stream and batch scenarios

2023-06-05 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6318: --- Summary: The skip merge config for incremental-read ensures consistency in both stream and batch scenarios Key: HUDI-6318 URL: https://issues.apache.org/jira/browse/HUDI-6318

[jira] [Created] (HUDI-6106) Spark offline compaction/Clustering Job will do clean like Flink job

2023-04-19 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6106: --- Summary: Spark offline compaction/Clustering Job will do clean like Flink job Key: HUDI-6106 URL: https://issues.apache.org/jira/browse/HUDI-6106 Project: Apache Hudi

[jira] [Closed] (HUDI-6046) Remove clean operator for comapaction turn off in mor table upsert

2023-04-19 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 closed HUDI-6046. --- Resolution: Fixed > Remove clean operator for comapaction turn off in mor table upsert > -

[jira] [Created] (HUDI-6046) Remove clean operator for comapaction turn off in mor table upsert

2023-04-06 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6046: --- Summary: Remove clean operator for comapaction turn off in mor table upsert Key: HUDI-6046 URL: https://issues.apache.org/jira/browse/HUDI-6046 Project: Apache Hudi

[jira] [Created] (HUDI-6045) Adjust HoodieTableSink for sink operator generation

2023-04-06 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-6045: --- Summary: Adjust HoodieTableSink for sink operator generation Key: HUDI-6045 URL: https://issues.apache.org/jira/browse/HUDI-6045 Project: Apache Hudi Issue Typ

[jira] [Updated] (HUDI-5341) Incremental cleaning should consider later completed clustering instant

2022-12-06 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 updated HUDI-5341: Fix Version/s: 0.13.0 Labels: pull-request-available (was: ) > Incremental cleaning shou

[jira] [Created] (HUDI-5341) Incremental cleaning should consider later completed clustering instant

2022-12-06 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-5341: --- Summary: Incremental cleaning should consider later completed clustering instant Key: HUDI-5341 URL: https://issues.apache.org/jira/browse/HUDI-5341 Project: Apache Hud

[jira] [Created] (HUDI-5235) Add parameters-check between CLUSTERING_PLAN_STRATEGY_TARGET_FILE_MAX_BYTES and CLUSTERING_PLAN_STRATEGY_SMALL_FILE_LIMIT

2022-11-17 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-5235: --- Summary: Add parameters-check between CLUSTERING_PLAN_STRATEGY_TARGET_FILE_MAX_BYTES and CLUSTERING_PLAN_STRATEGY_SMALL_FILE_LIMIT Key: HUDI-5235 URL: https://issues.apache.org/jir

[jira] [Created] (HUDI-5234) Streaming read skip clustering instants Configurable

2022-11-17 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-5234: --- Summary: Streaming read skip clustering instants Configurable Key: HUDI-5234 URL: https://issues.apache.org/jira/browse/HUDI-5234 Project: Apache Hudi Issue Ty

[jira] [Reopened] (HUDI-5173) Skip if there is only one file in clusteringGroup

2022-11-07 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 reopened HUDI-5173: - > Skip if there is only one file in clusteringGroup >

[jira] [Resolved] (HUDI-5173) Skip if there is only one file in clusteringGroup

2022-11-07 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 resolved HUDI-5173. - > Skip if there is only one file in clusteringGroup >

[jira] [Updated] (HUDI-5173) Skip if there is only one file in clusteringGroup

2022-11-07 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 updated HUDI-5173: Labels: pull-request-available (was: ) > Skip if there is only one file in clusteringGroup > --

[jira] [Updated] (HUDI-5173) Skip if there is only one file in clusteringGroup

2022-11-07 Thread zhuanshenbsj1 (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhuanshenbsj1 updated HUDI-5173: Summary: Skip if there is only one file in clusteringGroup (was: Skip if there is only on file in c

[jira] [Created] (HUDI-5173) Skip if there is only on file in clusteringGroup

2022-11-07 Thread zhuanshenbsj1 (Jira)
zhuanshenbsj1 created HUDI-5173: --- Summary: Skip if there is only on file in clusteringGroup Key: HUDI-5173 URL: https://issues.apache.org/jira/browse/HUDI-5173 Project: Apache Hudi Issue Type: