Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-22 Thread via GitHub


stream2000 merged PR #10515:
URL: https://github.com/apache/hudi/pull/10515


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-22 Thread via GitHub


stream2000 commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1905223014

   @boneanxs Thanks for your review!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-22 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1904102413

   
   ## CI report:
   
   * 1e86412ee7476d56cc66e597f8c30157569b06cf Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22093)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-22 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1903813522

   
   ## CI report:
   
   * 7e2d36fc73a2143afdda0f6d1d088fed0bee5367 Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22068)
 
   * 1e86412ee7476d56cc66e597f8c30157569b06cf Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22093)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-22 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1903800451

   
   ## CI report:
   
   * 7e2d36fc73a2143afdda0f6d1d088fed0bee5367 Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22068)
 
   * 1e86412ee7476d56cc66e597f8c30157569b06cf UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-22 Thread via GitHub


stream2000 commented on code in PR #10515:
URL: https://github.com/apache/hudi/pull/10515#discussion_r1461688651


##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))

Review Comment:
   > For the second issue, partitionBy + sortWithinPartitions also does shuffle 
only once?
   
   Yes, I was mistaken. I've made the changes that use spark native API and 
remove custom comparators. PTAL~ 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-21 Thread via GitHub


boneanxs commented on code in PR #10515:
URL: https://github.com/apache/hudi/pull/10515#discussion_r1461279717


##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))

Review Comment:
   Don't get the point here.
   For key, you only need bucketId, right?
   I mean we can change `.mapToPair(row -> new Tuple2<>(row, row))` to 
`.mapToPair(row -> new Tuple2<>(row, row))`
   
   For the second issue, partitionBy + sortWithinPartitions also does shuffle 
only once?
   
   I'm trying to avoid customized comparators here like 
`CustomRowColumnsComparator`



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))
+  .repartitionAndSortWithinPartitions(partitioner, new 
CustomRowColumnsComparator())
+  .values(),
+  rows.schema());
+} else if (table.requireSortedRecords() || 
table.getConfig().getBulkInsertSortMode() != BulkInsertSortMode.NONE) {

Review Comment:
   I'm fine with the current behavior. Different sort modes are rarely set from 
user side, and bucket index + partition sort is already one special kind of 
`PARTITION_PATH_REPARTITION`.
   
   I'm ok automatically switching to `PARTITION_SORT` to not annoying users.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-18 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1899877102

   
   ## CI report:
   
   * 7e2d36fc73a2143afdda0f6d1d088fed0bee5367 Azure: 
[SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22068)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-18 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1899729127

   
   ## CI report:
   
   * 21509bc638de40df8ddaebbb4544c002aabe0bd2 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22000)
 
   * 7e2d36fc73a2143afdda0f6d1d088fed0bee5367 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22068)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-18 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1899696732

   
   ## CI report:
   
   * 21509bc638de40df8ddaebbb4544c002aabe0bd2 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22000)
 
   * 7e2d36fc73a2143afdda0f6d1d088fed0bee5367 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-18 Thread via GitHub


stream2000 commented on code in PR #10515:
URL: https://github.com/apache/hudi/pull/10515#discussion_r1458254203


##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/clustering/run/strategy/SparkConsistentBucketClusteringExecutionStrategy.java:
##
@@ -72,7 +72,7 @@ public HoodieData 
performClusteringWithRecordsAsRow(Dataset in
 
 HoodieWriteConfig newConfig = 
HoodieWriteConfig.newBuilder().withProps(props).build();
 
-ConsistentBucketIndexBulkInsertPartitionerWithRows partitioner = new 
ConsistentBucketIndexBulkInsertPartitionerWithRows(getHoodieTable(), 
shouldPreserveHoodieMetadata);
+ConsistentBucketIndexBulkInsertPartitionerWithRows partitioner = new 
ConsistentBucketIndexBulkInsertPartitionerWithRows(getHoodieTable(), 
strategyParams, shouldPreserveHoodieMetadata);

Review Comment:
   sure and done.



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -142,10 +203,11 @@ public void addHashingChildrenNodes(String partition, 
List 0)
+|| table.requireSortedRecords() || 
table.getConfig().getBulkInsertSortMode() != BulkInsertSortMode.NONE;
   }
 
-  private int getBucketId(Row row) {
+  private Integer getBucketId(Row row) {

Review Comment:
   reverted it.



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))
+  .repartitionAndSortWithinPartitions(partitioner, new 
CustomRowColumnsComparator())
+  .values(),
+  rows.schema());
+} else if (table.requireSortedRecords() || 
table.getConfig().getBulkInsertSortMode() != BulkInsertSortMode.NONE) {

Review Comment:
   Yes we are actually implementing `PARTITION_SORT`, I'm just wondering for 
sort modes other than PARTITION_SORT, should we default to a 'no sort' behavior 
similar to `BulkInsertSortMode=NONE`, automatically switch to `PARTITION_SORT`, 
or should we throw an exception to indicate that the sort mode is not 
supported? 
   
   Hope for your opinion, or we can keep the current behavior that switch to 
`PARTITION_SORT` automatically.



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))

Review Comment:
   We will still need the row for comparing and sort it, so keep this line `  
.mapToPair(row -> new Tuple2<>(row, row))` is OK. 
   
   Also comparing with partitionBy + sortWithinPartitions, 
repartitionAndSortWithinPartitions will be more efficient because it performs 
the shuffle operation only once, with both repartitioning and sorting happening 
in the same step. What do you think? 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-18 Thread via GitHub


boneanxs commented on code in PR #10515:
URL: https://github.com/apache/hudi/pull/10515#discussion_r1457269006


##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))

Review Comment:
   getBucketId(row) more suitable? Unless we'll double records size here.
   
   Besides, we can move this map out to reduce duplicates.



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -66,7 +74,7 @@ public class 
ConsistentBucketIndexBulkInsertPartitionerWithRows
 
   private final RowRecordKeyExtractor extractor;
 
-  public ConsistentBucketIndexBulkInsertPartitionerWithRows(HoodieTable table, 
boolean populateMetaFields) {
+  public ConsistentBucketIndexBulkInsertPartitionerWithRows(HoodieTable table, 
Map strategyParams, boolean populateMetaFields) {

Review Comment:
   ditto



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))
+  .repartitionAndSortWithinPartitions(partitioner, new 
CustomRowColumnsComparator())
+  .values(),
+  rows.schema());
+} else if (table.requireSortedRecords() || 
table.getConfig().getBulkInsertSortMode() != BulkInsertSortMode.NONE) {

Review Comment:
   Here we should only support `PARTITION_SORT`?



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -142,10 +203,11 @@ public void addHashingChildrenNodes(String partition, 
List 0)
+|| table.requireSortedRecords() || 
table.getConfig().getBulkInsertSortMode() != BulkInsertSortMode.NONE;
   }
 
-  private int getBucketId(Row row) {
+  private Integer getBucketId(Row row) {

Review Comment:
   Any reason change here?



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/clustering/run/strategy/SparkConsistentBucketClusteringExecutionStrategy.java:
##
@@ -72,7 +72,7 @@ public HoodieData 
performClusteringWithRecordsAsRow(Dataset in
 
 HoodieWriteConfig newConfig = 
HoodieWriteConfig.newBuilder().withProps(props).build();
 
-ConsistentBucketIndexBulkInsertPartitionerWithRows partitioner = new 
ConsistentBucketIndexBulkInsertPartitionerWithRows(getHoodieTable(), 
shouldPreserveHoodieMetadata);
+ConsistentBucketIndexBulkInsertPartitionerWithRows partitioner = new 
ConsistentBucketIndexBulkInsertPartitionerWithRows(getHoodieTable(), 
strategyParams, shouldPreserveHoodieMetadata);

Review Comment:
   nit: Start a new line to enhance readability



##
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/execution/bulkinsert/ConsistentBucketIndexBulkInsertPartitionerWithRows.java:
##
@@ -105,10 +121,55 @@ public int numPartitions() {
   }
 };
 
-return rows.sparkSession().createDataFrame(rowJavaRDD
-.mapToPair(row -> new Tuple2<>(getBucketId(row), row))
-.partitionBy(partitioner)
-.values(), rows.schema());
+if (sortColumnNames != null && sortColumnNames.length > 0) {
+  return rows.sparkSession().createDataFrame(rowJavaRDD
+  .mapToPair(row -> new Tuple2<>(row, row))
+  .repartitionAndSortWithinPartitions(partitioner, new 
CustomRowColumnsComparator())

Review Comment:
   Prefer use spark native api here, e.g.
   
   ```scala
   rows.sparkSession().createDataFrame(rowJavaRDD
 .mapToPair(row -> new Tuple2<>(getBucketId(row), row))
 .partitionBy(partitioner)
 .values(), rows.schema())
 .sortWithinPartitions(...);
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this serv

Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


stream2000 commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1897743885

   > Can we elaborate a little why we need the sort for consistent hashing?
   
   When we read Parquet files, sorting will help skip data using parquet row 
group metadata.  Some users may need this I think. And it align with the 
original implementation of `RDDConsistentBucketBulkInsertPartitioner`. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


danny0405 commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1897721096

   Can we elaborate a little why we need the sort for consistent hashing?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1895587560

   
   ## CI report:
   
   * 21509bc638de40df8ddaebbb4544c002aabe0bd2 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22000)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1895575239

   
   ## CI report:
   
   * 21509bc638de40df8ddaebbb4544c002aabe0bd2 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22000)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


stream2000 commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1895571431

   @hudi-bot run azure


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1895460941

   
   ## CI report:
   
   * 21509bc638de40df8ddaebbb4544c002aabe0bd2 Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22000)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1895298811

   
   ## CI report:
   
   * b4df6b857e79dfb636e3af695d305e8ea50077cc Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21994)
 
   * 6d7150a24ab2169d780e5a98193144f5a16ad230 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21996)
 
   * 21509bc638de40df8ddaebbb4544c002aabe0bd2 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=22000)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-17 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1895288017

   
   ## CI report:
   
   * b4df6b857e79dfb636e3af695d305e8ea50077cc Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21994)
 
   * 6d7150a24ab2169d780e5a98193144f5a16ad230 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21996)
 
   * 21509bc638de40df8ddaebbb4544c002aabe0bd2 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-16 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1895004422

   
   ## CI report:
   
   * b4df6b857e79dfb636e3af695d305e8ea50077cc Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21994)
 
   * 6d7150a24ab2169d780e5a98193144f5a16ad230 Azure: 
[PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21996)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-16 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1894997004

   
   ## CI report:
   
   * b4df6b857e79dfb636e3af695d305e8ea50077cc Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21994)
 
   * 6d7150a24ab2169d780e5a98193144f5a16ad230 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-16 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1894955959

   
   ## CI report:
   
   * b4df6b857e79dfb636e3af695d305e8ea50077cc Azure: 
[FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21994)
 
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] [HUDI-7302] Consistent hashing row writer support sorting [hudi]

2024-01-16 Thread via GitHub


hudi-bot commented on PR #10515:
URL: https://github.com/apache/hudi/pull/10515#issuecomment-1894950233

   
   ## CI report:
   
   * b4df6b857e79dfb636e3af695d305e8ea50077cc UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org