[jira] [Created] (HUDI-7936) Add CI check for import ordering

2024-06-27 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7936: - Summary: Add CI check for import ordering Key: HUDI-7936 URL: https://issues.apache.org/jira/browse/HUDI-7936 Project: Apache Hudi Issue Type: Test

[jira] [Created] (HUDI-7935) Update developer setup on website to apply import style rules correctly

2024-06-27 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7935: - Summary: Update developer setup on website to apply import style rules correctly Key: HUDI-7935 URL: https://issues.apache.org/jira/browse/HUDI-7935 Project:

[jira] [Commented] (HUDI-7932) Fix the import ordering

2024-06-27 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17860522#comment-17860522 ] Jonathan Vexler commented on HUDI-7932: --- [https://github.com/apache/hudi/pull/11524] is the fix.

[jira] [Updated] (HUDI-7932) Fix the import ordering

2024-06-27 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7932: -- Status: In Progress (was: Open) > Fix the import ordering > --- > >

[jira] [Resolved] (HUDI-7932) Fix the import ordering

2024-06-27 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler resolved HUDI-7932. --- > Fix the import ordering > --- > > Key: HUDI-7932 >

[jira] [Updated] (HUDI-7932) Fix the import ordering

2024-06-27 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7932: -- Status: Patch Available (was: In Progress) > Fix the import ordering > ---

[jira] [Created] (HUDI-7932) Fix the import ordering

2024-06-26 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7932: - Summary: Fix the import ordering Key: HUDI-7932 URL: https://issues.apache.org/jira/browse/HUDI-7932 Project: Apache Hudi Issue Type: Task

[jira] [Updated] (HUDI-7918) Remove support of Spark 2, 3.0, 3.1, and 3.2

2024-06-26 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7918: -- Status: In Progress (was: Open) > Remove support of Spark 2, 3.0, 3.1, and 3.2 >

[jira] [Created] (HUDI-7873) Remove getStorage method from HoodieReaderContext

2024-06-13 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7873: - Summary: Remove getStorage method from HoodieReaderContext Key: HUDI-7873 URL: https://issues.apache.org/jira/browse/HUDI-7873 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-7876) Use TypedProperties to store the spillable map configs for the FG reader

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7876: -- Status: In Progress (was: Open) > Use TypedProperties to store the spillable map configs for

[jira] [Updated] (HUDI-7875) Remove tablePath from HoodieFileGroupReader

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7875: -- Status: Patch Available (was: In Progress) > Remove tablePath from HoodieFileGroupReader >

[jira] [Updated] (HUDI-7875) Remove tablePath from HoodieFileGroupReader

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7875: -- Status: In Progress (was: Open) > Remove tablePath from HoodieFileGroupReader >

[jira] [Updated] (HUDI-7873) Remove getStorage method from HoodieReaderContext

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7873: -- Status: In Progress (was: Open) > Remove getStorage method from HoodieReaderContext >

[jira] [Updated] (HUDI-7876) Use TypedProperties to store the spillable map configs for the FG reader

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7876: -- Status: Patch Available (was: In Progress) > Use TypedProperties to store the spillable map

[jira] [Updated] (HUDI-7873) Remove getStorage method from HoodieReaderContext

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7873: -- Status: Patch Available (was: In Progress) > Remove getStorage method from HoodieReaderContext

[jira] [Closed] (HUDI-7840) Add position merging back to file group reader

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-7840. - Resolution: Fixed > Add position merging back to file group reader >

[jira] [Closed] (HUDI-7869) Ensure properties are copied when modifying schema

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-7869. - Resolution: Fixed > Ensure properties are copied when modifying schema >

[jira] [Updated] (HUDI-7840) Add position merging back to file group reader

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7840: -- Status: In Progress (was: Open) > Add position merging back to file group reader >

[jira] [Updated] (HUDI-7840) Add position merging back to file group reader

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7840: -- Status: Patch Available (was: In Progress) > Add position merging back to file group reader >

[jira] [Created] (HUDI-7876) Use TypedProperties to store the spillable map configs for the FG reader

2024-06-13 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7876: - Summary: Use TypedProperties to store the spillable map configs for the FG reader Key: HUDI-7876 URL: https://issues.apache.org/jira/browse/HUDI-7876 Project:

[jira] [Created] (HUDI-7875) Remove tablePath from HoodieFileGroupReader

2024-06-13 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7875: - Summary: Remove tablePath from HoodieFileGroupReader Key: HUDI-7875 URL: https://issues.apache.org/jira/browse/HUDI-7875 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-7871) Remove tableconfig from HoodieFilegroupReader params

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7871: -- Status: Patch Available (was: In Progress) > Remove tableconfig from HoodieFilegroupReader

[jira] [Updated] (HUDI-7871) Remove tableconfig from HoodieFilegroupReader params

2024-06-13 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7871: -- Status: In Progress (was: Open) > Remove tableconfig from HoodieFilegroupReader params >

[jira] [Created] (HUDI-7871) Remove tableconfig from HoodieFilegroupReader params

2024-06-13 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7871: - Summary: Remove tableconfig from HoodieFilegroupReader params Key: HUDI-7871 URL: https://issues.apache.org/jira/browse/HUDI-7871 Project: Apache Hudi

[jira] [Updated] (HUDI-7869) Ensure properties are copied when modifying schema

2024-06-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7869: -- Status: In Progress (was: Open) > Ensure properties are copied when modifying schema >

[jira] [Updated] (HUDI-7869) Ensure properties are copied when modifying schema

2024-06-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7869: -- Status: Patch Available (was: In Progress) > Ensure properties are copied when modifying

[jira] [Created] (HUDI-7869) Ensure properties are copied when modifying schema

2024-06-12 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7869: - Summary: Ensure properties are copied when modifying schema Key: HUDI-7869 URL: https://issues.apache.org/jira/browse/HUDI-7869 Project: Apache Hudi Issue

[jira] [Closed] (HUDI-6792) Integrate FileGroupReader with NewHoodieParquetFileFormat for Spark Incremental Query

2024-06-11 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-6792. - Resolution: Duplicate > Integrate FileGroupReader with NewHoodieParquetFileFormat for Spark >

[jira] [Closed] (HUDI-6787) Hive Integrate FileGroupReader with HoodieMergeOnReadSnapshotReader and RealtimeCompactedRecordReader for Hive

2024-06-10 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-6787. - Resolution: Fixed > Hive Integrate FileGroupReader with HoodieMergeOnReadSnapshotReader and >

[jira] [Updated] (HUDI-7693) Allow Vectorized Reading for bootstrap in the new fg reader under some conditions

2024-06-07 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7693: -- Description: Vectorized reading can be used for bootstrap if we don't need to do any merging.

[jira] [Created] (HUDI-7840) Add position merging back to file group reader

2024-06-06 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7840: - Summary: Add position merging back to file group reader Key: HUDI-7840 URL: https://issues.apache.org/jira/browse/HUDI-7840 Project: Apache Hudi Issue

[jira] [Created] (HUDI-7838) Use Config hoodie.schema.cache.enable in HoodieBaseFileGroupRecordBuffer and AbstractHoodieLogRecordReader

2024-06-06 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7838: - Summary: Use Config hoodie.schema.cache.enable in HoodieBaseFileGroupRecordBuffer and AbstractHoodieLogRecordReader Key: HUDI-7838 URL:

[jira] [Created] (HUDI-7833) Validate that fg reader works with nested column as record key

2024-06-05 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7833: - Summary: Validate that fg reader works with nested column as record key Key: HUDI-7833 URL: https://issues.apache.org/jira/browse/HUDI-7833 Project: Apache Hudi

[jira] [Created] (HUDI-7813) Hive Style partitioning on a bootstrap table is not configurable

2024-05-29 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7813: - Summary: Hive Style partitioning on a bootstrap table is not configurable Key: HUDI-7813 URL: https://issues.apache.org/jira/browse/HUDI-7813 Project: Apache Hudi

[jira] [Created] (HUDI-7770) Bootstrap read tries to parse partition from the bootstrap base path

2024-05-15 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7770: - Summary: Bootstrap read tries to parse partition from the bootstrap base path Key: HUDI-7770 URL: https://issues.apache.org/jira/browse/HUDI-7770 Project: Apache

[jira] [Commented] (HUDI-7764) DefaultHoodieRecordPayload should be projection compatible

2024-05-15 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846745#comment-17846745 ] Jonathan Vexler commented on HUDI-7764: --- Changing this leads to OOM issues with spark payload >

[jira] [Closed] (HUDI-7764) DefaultHoodieRecordPayload should be projection compatible

2024-05-15 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-7764. - Resolution: Not A Bug > DefaultHoodieRecordPayload should be projection compatible >

[jira] [Updated] (HUDI-7764) DefaultHoodieRecordPayload should be projection compatible

2024-05-15 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7764: -- Status: Patch Available (was: In Progress) > DefaultHoodieRecordPayload should be projection

[jira] [Updated] (HUDI-7764) DefaultHoodieRecordPayload should be projection compatible

2024-05-15 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7764: -- Status: In Progress (was: Open) > DefaultHoodieRecordPayload should be projection compatible >

[jira] [Created] (HUDI-7764) DefaultHoodieRecordPayload should be projection compatible

2024-05-15 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7764: - Summary: DefaultHoodieRecordPayload should be projection compatible Key: HUDI-7764 URL: https://issues.apache.org/jira/browse/HUDI-7764 Project: Apache Hudi

[jira] [Created] (HUDI-7760) Row Writer Clustering should use fg reader

2024-05-14 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7760: - Summary: Row Writer Clustering should use fg reader Key: HUDI-7760 URL: https://issues.apache.org/jira/browse/HUDI-7760 Project: Apache Hudi Issue Type:

[jira] [Created] (HUDI-7754) Remove AvroWriteSupport and ParquetReaderIterator from hudi-common

2024-05-13 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7754: - Summary: Remove AvroWriteSupport and ParquetReaderIterator from hudi-common Key: HUDI-7754 URL: https://issues.apache.org/jira/browse/HUDI-7754 Project: Apache

[jira] [Created] (HUDI-7747) In MetaClient remove getBasePathV2() and return StoragePath from getBasePath()

2024-05-11 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7747: - Summary: In MetaClient remove getBasePathV2() and return StoragePath from getBasePath() Key: HUDI-7747 URL: https://issues.apache.org/jira/browse/HUDI-7747

[jira] [Created] (HUDI-7746) HadoopConf loses set values when HoodieStorage.getConf is called

2024-05-10 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7746: - Summary: HadoopConf loses set values when HoodieStorage.getConf is called Key: HUDI-7746 URL: https://issues.apache.org/jira/browse/HUDI-7746 Project: Apache Hudi

[jira] [Created] (HUDI-7744) Create HoodieIOFactory and config to set it

2024-05-10 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7744: - Summary: Create HoodieIOFactory and config to set it Key: HUDI-7744 URL: https://issues.apache.org/jira/browse/HUDI-7744 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-7731) Fix usage of new Configuration() in production code

2024-05-10 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7731: -- Status: Patch Available (was: In Progress) > Fix usage of new Configuration() in production

[jira] [Updated] (HUDI-7731) Fix usage of new Configuration() in production code

2024-05-10 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7731: -- Status: In Progress (was: Open) > Fix usage of new Configuration() in production code >

[jira] [Updated] (HUDI-7743) Fix simple mistakes with StoragePath in production code.

2024-05-10 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7743: -- Status: Patch Available (was: In Progress) > Fix simple mistakes with StoragePath in

[jira] [Updated] (HUDI-7743) Fix simple mistakes with StoragePath in production code.

2024-05-10 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7743: -- Status: In Progress (was: Open) > Fix simple mistakes with StoragePath in production code. >

[jira] [Created] (HUDI-7743) Fix simple mistakes with StoragePath in production code.

2024-05-10 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7743: - Summary: Fix simple mistakes with StoragePath in production code. Key: HUDI-7743 URL: https://issues.apache.org/jira/browse/HUDI-7743 Project: Apache Hudi

[jira] [Created] (HUDI-7741) Implement methods in HFileUtils extends BaseFileUtils

2024-05-09 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7741: - Summary: Implement methods in HFileUtils extends BaseFileUtils Key: HUDI-7741 URL: https://issues.apache.org/jira/browse/HUDI-7741 Project: Apache Hudi

[jira] [Assigned] (HUDI-7731) Fix usage of new Configuration() in production code

2024-05-09 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler reassigned HUDI-7731: - Assignee: Jonathan Vexler > Fix usage of new Configuration() in production code >

[jira] [Closed] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-05-09 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-7350. - Resolution: Fixed > Introduce HoodieIOFactory to abstract the reader and writer implementation >

[jira] [Created] (HUDI-7733) Reduce number of constructors in HadoopStorageConfig

2024-05-08 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7733: - Summary: Reduce number of constructors in HadoopStorageConfig Key: HUDI-7733 URL: https://issues.apache.org/jira/browse/HUDI-7733 Project: Apache Hudi

[jira] [Created] (HUDI-7732) Reduce number of constructors in HoodieHadoopStorage

2024-05-08 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7732: - Summary: Reduce number of constructors in HoodieHadoopStorage Key: HUDI-7732 URL: https://issues.apache.org/jira/browse/HUDI-7732 Project: Apache Hudi

[jira] [Created] (HUDI-7731) Fix usage of new Configuration() in production code

2024-05-08 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7731: - Summary: Fix usage of new Configuration() in production code Key: HUDI-7731 URL: https://issues.apache.org/jira/browse/HUDI-7731 Project: Apache Hudi

[jira] [Created] (HUDI-7730) HoodieStorage.openSeekable should not have wrapStream param

2024-05-08 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7730: - Summary: HoodieStorage.openSeekable should not have wrapStream param Key: HUDI-7730 URL: https://issues.apache.org/jira/browse/HUDI-7730 Project: Apache Hudi

[jira] [Updated] (HUDI-7729) Move ParquetUtils to hudi-hadoop-common

2024-05-08 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7729: -- Issue Type: Task (was: Improvement) > Move ParquetUtils to hudi-hadoop-common >

[jira] [Created] (HUDI-7729) Move ParquetUtils to hudi-hadoop-common

2024-05-08 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7729: - Summary: Move ParquetUtils to hudi-hadoop-common Key: HUDI-7729 URL: https://issues.apache.org/jira/browse/HUDI-7729 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-7725) Restructure HFileBootstrapIndex to separate Hadoop-dependent logic

2024-05-07 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7725: -- Status: Patch Available (was: In Progress) > Restructure HFileBootstrapIndex to separate

[jira] [Updated] (HUDI-7726) Restructure TableSchemaResolver to separate Hadoop logic and use BaseFileUtils

2024-05-07 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7726: -- Status: In Progress (was: Open) > Restructure TableSchemaResolver to separate Hadoop logic and

[jira] [Updated] (HUDI-7725) Restructure HFileBootstrapIndex to separate Hadoop-dependent logic

2024-05-07 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7725: -- Status: In Progress (was: Open) > Restructure HFileBootstrapIndex to separate Hadoop-dependent

[jira] [Created] (HUDI-7721) Fix broken build on master

2024-05-06 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7721: - Summary: Fix broken build on master Key: HUDI-7721 URL: https://issues.apache.org/jira/browse/HUDI-7721 Project: Apache Hudi Issue Type: Bug

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-05-06 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7350: -- Status: Patch Available (was: In Progress) > Introduce HoodieIOFactory to abstract the reader

[jira] [Updated] (HUDI-7350) Introduce HoodieIOFactory to abstract the reader and writer implementation

2024-05-06 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7350: -- Status: In Progress (was: Open) > Introduce HoodieIOFactory to abstract the reader and writer

[jira] [Updated] (HUDI-7587) Move hadoop-dependent reader and writer implementation to hudi-hadoop-common module

2024-05-02 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7587: -- Status: Patch Available (was: In Progress) > Move hadoop-dependent reader and writer

[jira] [Created] (HUDI-7704) Unify test client storage classes with duplicate code

2024-05-02 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7704: - Summary: Unify test client storage classes with duplicate code Key: HUDI-7704 URL: https://issues.apache.org/jira/browse/HUDI-7704 Project: Apache Hudi

[jira] [Created] (HUDI-7693) Allow Vectorized Reading for bootstrap in the new fg reader under some conditions

2024-04-30 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7693: - Summary: Allow Vectorized Reading for bootstrap in the new fg reader under some conditions Key: HUDI-7693 URL: https://issues.apache.org/jira/browse/HUDI-7693

[jira] [Updated] (HUDI-7658) Log time taken when meta sync fails in stream sync

2024-04-23 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7658: -- Status: Patch Available (was: In Progress) > Log time taken when meta sync fails in stream

[jira] [Created] (HUDI-7658) Log time taken when meta sync fails in stream sync

2024-04-23 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7658: - Summary: Log time taken when meta sync fails in stream sync Key: HUDI-7658 URL: https://issues.apache.org/jira/browse/HUDI-7658 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-7658) Log time taken when meta sync fails in stream sync

2024-04-23 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7658: -- Status: In Progress (was: Open) > Log time taken when meta sync fails in stream sync >

[jira] [Assigned] (HUDI-7658) Log time taken when meta sync fails in stream sync

2024-04-23 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler reassigned HUDI-7658: - Assignee: Jonathan Vexler > Log time taken when meta sync fails in stream sync >

[jira] [Updated] (HUDI-7269) Fallback to key-based merging if there is no positions in log header

2024-04-17 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7269: -- Status: Patch Available (was: In Progress) > Fallback to key-based merging if there is no

[jira] [Commented] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-16 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17837761#comment-17837761 ] Jonathan Vexler commented on HUDI-7610: --- delete precombine less than insert is consistent though:

[jira] [Commented] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-16 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17837759#comment-17837759 ] Jonathan Vexler commented on HUDI-7610: --- Tested with compaction and that isn't even consistent with

[jira] [Closed] (HUDI-7566) Break-up schema evolution: add schema evolution changes to ported spark

2024-04-16 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-7566. - Resolution: Fixed > Break-up schema evolution: add schema evolution changes to ported spark >

[jira] [Comment Edited] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836740#comment-17836740 ] Jonathan Vexler edited comment on HUDI-7610 at 4/12/24 8:40 PM: retest

[jira] [Comment Edited] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836738#comment-17836738 ] Jonathan Vexler edited comment on HUDI-7610 at 4/12/24 8:40 PM: retest

[jira] [Commented] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836740#comment-17836740 ] Jonathan Vexler commented on HUDI-7610: --- retest delete where delete precombine is less than insert

[jira] [Commented] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836738#comment-17836738 ] Jonathan Vexler commented on HUDI-7610: --- retest because default payload changed in the last few days 

[jira] [Commented] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836737#comment-17836737 ] Jonathan Vexler commented on HUDI-7610: --- use hoodie is deleted where delete precombine is less than

[jira] [Comment Edited] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836735#comment-17836735 ] Jonathan Vexler edited comment on HUDI-7610 at 4/12/24 7:58 PM: use hoodie

[jira] [Commented] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836735#comment-17836735 ] Jonathan Vexler commented on HUDI-7610: --- use hoodie is deleted: {code:java} @Test def

[jira] [Created] (HUDI-7612) HoodieSparkRecordMerger does not handle deletes based on the preCombine/ordering field

2024-04-12 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7612: - Summary: HoodieSparkRecordMerger does not handle deletes based on the preCombine/ordering field Key: HUDI-7612 URL: https://issues.apache.org/jira/browse/HUDI-7612

[jira] [Created] (HUDI-7611) DELETE operation does not route preCombine/ordering field values to the delete records

2024-04-12 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7611: - Summary: DELETE operation does not route preCombine/ordering field values to the delete records Key: HUDI-7611 URL: https://issues.apache.org/jira/browse/HUDI-7611

[jira] [Updated] (HUDI-7611) DELETE operation does not route preCombine/ordering field values to the delete records

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7611: -- Fix Version/s: 1.0.0 > DELETE operation does not route preCombine/ordering field values to the

[jira] [Closed] (HUDI-7565) Break-up schema evolution: port spark code to file readers

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler closed HUDI-7565. - Resolution: Fixed > Break-up schema evolution: port spark code to file readers >

[jira] [Updated] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7610: -- Description: Here is a test that can be run on master:   {code:java} @Test def

[jira] [Updated] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7610: -- Description: Here is a test that can be run on master:   {code:java} @Test def

[jira] [Updated] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7610: -- Description: Here is a test that can be run on master:   {code:java} @Test def

[jira] [Updated] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7610: -- Description: Here is a test that can be run on master:   {code:java} @Test def

[jira] [Created] (HUDI-7610) Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled

2024-04-12 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7610: - Summary: Delete records are inconsistent depending on MOR/COW, Avro/Spark record merger, new filegroup reader enabled/disabled Key: HUDI-7610 URL:

[jira] [Updated] (HUDI-7607) Test with timestamp based key generator

2024-04-11 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7607: -- Fix Version/s: 1.0.0 > Test with timestamp based key generator >

[jira] [Created] (HUDI-7607) Test with timestamp based key generator

2024-04-11 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7607: - Summary: Test with timestamp based key generator Key: HUDI-7607 URL: https://issues.apache.org/jira/browse/HUDI-7607 Project: Apache Hudi Issue Type:

[jira] [Created] (HUDI-7605) Unable to set merger strategy with DataSourceWriteOptions.RECORD_MERGER_STRATEGY

2024-04-11 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7605: - Summary: Unable to set merger strategy with DataSourceWriteOptions.RECORD_MERGER_STRATEGY Key: HUDI-7605 URL: https://issues.apache.org/jira/browse/HUDI-7605

[jira] [Updated] (HUDI-7604) DataSourceWriteOptions.TABLE_NAME() does not work

2024-04-11 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7604: -- Status: Patch Available (was: In Progress) > DataSourceWriteOptions.TABLE_NAME() does not work

[jira] [Updated] (HUDI-7604) DataSourceWriteOptions.TABLE_NAME() does not work

2024-04-11 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Vexler updated HUDI-7604: -- Status: In Progress (was: Open) > DataSourceWriteOptions.TABLE_NAME() does not work >

[jira] [Created] (HUDI-7604) DataSourceWriteOptions.TABLE_NAME() does not work

2024-04-11 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-7604: - Summary: DataSourceWriteOptions.TABLE_NAME() does not work Key: HUDI-7604 URL: https://issues.apache.org/jira/browse/HUDI-7604 Project: Apache Hudi Issue

[jira] [Comment Edited] (HUDI-7269) Fallback to key-based merging if there is no positions in log header

2024-04-10 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17835888#comment-17835888 ] Jonathan Vexler edited comment on HUDI-7269 at 4/10/24 9:05 PM: Did an

[jira] [Comment Edited] (HUDI-7269) Fallback to key-based merging if there is no positions in log header

2024-04-10 Thread Jonathan Vexler (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17835888#comment-17835888 ] Jonathan Vexler edited comment on HUDI-7269 at 4/10/24 9:03 PM: Did an

  1   2   3   4   5   6   7   >