pan3793 commented on code in PR #36: URL: https://github.com/apache/incubator-celeborn-website/pull/36#discussion_r1479160064
########## docs/community/release_notes/release_note_0.4.0.md: ########## @@ -0,0 +1,268 @@ +--- +hide: + - navigation + +license: | + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + http://www.apache.org/licenses/LICENSE-2.0 + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--- + +# Apache Celeborn(Incubating) 0.4.0 Release Notes + +## Highlight + +- Rerun Spark Stage for Celeborn Shuffle Fetch Failure +- Added support for Hadoop MapReduce +- Added support for Flink 1.18 +- Implemented JVM monitoring in Celeborn Worker using JVMQuake +- Added support for SBT build system + +### IMPOROVEMENT + +- [CELEBORN-1052] Introduce dynamic ConfigService at SystemLevel and TenantLevel +- [CELEBORN-977] Support RocksDB as recover DB backend +- [CELEBORN-851] Mention Celeborn 0.4 server requires 0.3 or above clients +- [CELEBORN-808] Remove unnecessary RssShuffleManager in 0.4.0 +- [CELEBORN-980] Asynchronously delete original files to fix `ReusedExchange` bug +- [CELEBORN-1149] Improve replica selection when rack aware +- [CELEBORN-448] Support exclude worker manually +- [CELEBORN-1236][METRICS] Celeborn add metrics about thread pool +- [CELEBORN-1242] Unify celeborn thread name format +- [CELEBORN-1226][FOLLOWUP] Unify creation of thread using ThreadUtils +- [CELEBORN-1226][BRANCH-0.4.0] Unify creation of thread using ThreadUtils (#2245) +- [CELEBORN-1238] deviceCheckThreadPool is only initialized when diskCheck is enabled +- [CELEBORN-1225][FOLLOWUP] Worker should build replicate factory to get client for sending replicate data +- [CELEBORN-1233] Treat unfound PartitionLocation as failed in Controller#commitFiles +- [CELEBORN-1218] Optimize dataPusher to get partitionLocationMap only once +- [CELEBORN-1225] Worker should build replicate factory to get client for sending replicate data +- [CELEBORN-1228] Format the timestamp when recording worker failure +- [CELEBORN-1224] Make TransportMessage#type transient for backward compatibility +- [CELEBORN-1219] takeBuffer() avoid checking source.metricsCollectCriticalEnabled twice +- [CELEBORN-1220][IMPROVEMENT] Make trim logic more robust +- [CELEBORN-1177] OpenStream should register stream via ChunkStreamManager to close stream for ReusedExchange +- [CELEBORN-1217] Improve exception message of loadFileGroup for ShuffleClientImpl +- [CELEBORN-1215] Introduce PausePushDataAndReplicateTime metric to record time for a worker to stop receiving pushData from clients and other workers +- [CELEBORN-1216] Resolve error occurring during distribution creation with profile -Pspark-2.4 +- [CELEBORN-1214] Introduce WriteDataHardSplitCount metric to record HARD_SPLIT partitions of PushData and PushMergedData +- [CELEBORN-891] Remove pipeline feature for sort based writer +- [CELEBORN-1210] Fix potential memory leak in PartitionFilesCleaner +- [CELEBORN-1100] Introduce ChunkStreamCount, OpenStreamFailCount metrics about opening stream of FetchHandler +- [CELEBORN-1211] Add extension for celeborn shuffle handler +- [CELEBORN-1201] Optimize memory usage of cache in partition sorter +- [CELEBORN-1190][FOLLOWUP] Apply error prone patch and suppress some problems +- [CELEBORN-1252] Fix resource consumption of worker does not update when update interval is greater than heartbeat interval +- [CELEBORN-1253] Improve exception message of fetching chunk failure for WorkerPartitionReader +- [CELEBORN-1246][FOLLOWUP] Introduce OpenStreamSuccessCount, FetchChunkSuccessCount and WriteDataSuccessCount metric in Grafana dashboard +- [CELEBORN-1189] Introduce RunningApplicationCount metric and /applications API to record running applications of worker +- [CELEBORN-1187][FOLLOWUP] Unify the size and file count of active shuffle metrics for master and worker +- [CELEBORN-1187] Unify the size and file count of active shuffle metrics for master and worker +- [CELEBORN-1196] Slots allocator will increment disk index repeatedly +- [CELEBORN-1193] ResettableSlidingWindowReservoir should reset `full` to false +- [CELEBORN-1188][TEST] Using JUnit function instead of java assert +- [CELEBORN-1190] Apply error prone patch and suppress some problems +- [CELEBORN-1036][FOLLOWUP] totalInflightReqs should decrement when batchIdSet contains the batchId to avoid duplicate caller of removeBatch +- [CELEBORN-1150] support io encryption for spark +- [CELEBORN-1176] Server side support for Sasl Auth +- [CELEBORN-1180] Changed the version of Sasl Auth related config to 0.5 +- [CELEBORN-1157] Add client-side support for Sasl Authentication in the transport layer +- [CELEBORN-1164] Introduce FetchChunkFailCount metric to expose the count of fetching chunk failed in current worker +- [CELEBORN-1162][BUG] Fix refCnt 0 Exception in FetchHandler#handleChunkFetchRequest +- [CELEBORN-1151] Request slots when register shuffle should filter the workers excluded by application +- [CELEBORN-1152] fix GetShuffleId RPC NPE for empty shuffle +- [CELEBORN-1147] Added a dedicated API for RPC messages which also accepts an RpcResponseCallback instance +- [MINOR] Update log level of ChunkFetchSuccess failed for `FetchHandler#handleChunkFetchRequest` from error to warn +- [CELEBORN-1127] Add JVM classloader metrics +- [CELEBORN-1122] Metrics supports json format +- [CELEBORN-1052][FOLLOWUP] Introduce dynamic ConfigService at SystemLevel and TenantLevel +- [CELEBORN-1145] Separate clientPushBufferMaxSize from CelebornInputStreamImpl +- [CELEBORN-1131] Add Client/Server bootstrap framework to transport layer +- [CELEBORN-1142] clear shuffleIdCache in shutdown method of ShuffleClientImpl +- [CELEBORN-1081][FOLLOWUP] Remove UNKNOWN_DISK and allocate all slots to disk +- [CELEBORN-1125][FOLLOWUP] Add failureaccess shade +- [CELEBORN-1140] Use try-with-resources to avoid FSDataInputStream not being closed +- [CELEBORN-1135] Added tests for the RpcEnv and related classes +- [CELEBORN-856] Add mapreduce integration test +- [CELEBORN-1123] Support fallback to non-columnar shuffle for schema that cannot be obtained from shuffle dependency +- [CELEBORN-1134] Celeborn Flink client should validate whether execution.batch-shuffle-mode is ALL_EXCHANGES_BLOCKING +- [CELEBORN-1106] Ensure data is written into flush buffer before sending message to client +- [CELEBORN-1110][FOLLOWUP] Support celeborn.worker.storage.disk.reserve.ratio to configure worker reserved ratio for each disk +- [CELEBORN-1108][FOLLOWUP] Use rat plugin check Flink 1.18 +- [CELEBORN-1048][FOLLOWUP] MR module compile +- [CELEBORN-1108] Rat plugin check for more modules +- [CELEBORN-247][FOLLOWUP] Add metrics for each user's quota usage of Celeborn Worker +- [CELEBORN-1095] Support configuration of fastest available XXHashFactory instance for checksum of Lz4Decompressor +- [CELEBORN-1087] Remove SimpleStateMachineStorageUtil in master module +- Revert "[CELEBORN-255] Add counter of outstandingFetches, outstanding… +- [CELEBORN-1082] Fixing partition sorter task failures due to duplicate sorting +- [CELEBORN-1070] Add error-prone to pom.xml +- [CELEBORN-1079] Fix use of GuardedBy in client-flink/common +- [MINOR] Remove unnecessary increment index of Master#timeoutDeadWorkers +- [CELEBORN-1066] Skip looping streamimg sets in numShuffleSteams of ChunkStreamManager +- [CELEBORN-1065] Prevent the local variable 'time' declared in one 'switch' branch and used in another +- [CELEBORN-1069] Avoid double brace initialization +- [CELEBORN-1072] Fix misc error prone reports found +- [CELEBORN-1071] Ensure guardedBy is satisfied, fix DCL bugs as well +- [CELEBORN-1068] Fix hashCode computation +- [CELEBORN-1060] Fix the master's http port conflicts with rpc port in celeborn default template file +- [MINOR] Delete redundant Loggers +- [CELEBORN-1047] Remove conf `celeborn.worker.sortPartition.eagerlyRemoveOriginalFiles.enabled` +- [CELEBORN-1044] Enhance the check of parameter array length +- [CELEBORN-772] Convert StreamChunkSlice, ChunkFetchRequest, TransportableError to PB +- [CELEBORN-299] Deprecate `celeborn.worker.storage.baseDir.prefix` and `celeborn.worker.storage.baseDir.number` +- [CELEBORN-829] Improve response message of invalid HTTP request +- [CELEBORN-983] Rename PrometheusMetric configuration +- [CELEBORN-1038] Clean up deprecated api +- [CELEBORN-973] Improve HttpRequestHandler handle HTTP request with base, master and worker +- [CELEBORN-999] MR deps check +- [CELEBORN-1028] Make prometheus path configurable +- [CELEBORN-1018] Fix throw exception when exec create-package.sh script +- [CELEBORN-313] Add rest endpoint to show master group info +- [CELEBORN-985] Change default value of numConnectionsPerPeer to 1 +- [MINOR] Remove unexpected $ symbol +- [CELEBORN-1006] Add support for Apache Hadoop 2.x in Celeborn build +- [MINOR] Fix incorrect default resume ratio in trafficcontrol doc +- [CELEBORN-1000] MR module style check +- [CELEBORN-951][FOLLOWUP] IssueNavigationLink adapts to early Github Issues +- [CELEBORN-978] Improve dependency.sh replacement mode +- [CELEBORN-953] Remove unused-imports in Utils.scala +- [CELEBORN-913] Implement method ShuffleDriverComponents#supportsReliableStorage +- [CELEBORN-929][INFRA] Add dependencies check CI +- [CELEBORN-951] Add IssueNavigationLink and icon for IDEA +- [CELEBORN-937][INFRA] Improve branch suggestion for backporting +- [CELEBORN-939] Change register to unregister in Log +- [CELEBORN-940] Make the number of arguments and placeholders consistent +- [CELEBORN-919][FOLLOWUP] Put map index args after partition index args in CelebornShuffleReader constructor +- [CELEBORN-936] Shuffle master urls to avoid always connect first mast… +- [CELEBORN-931][INFRA] Fix merged pull requests resolution +- [CELEBORN-930][INFRA][FOLLOWUP] Fix environment variable naming +- [CELEBORN-930][INFRA] Eagerly check if the token is valid to align with the behavior of username/password auth +- [CELEBORN-919][FOLLOWUP] Unify the order of map index args and partit… +- [CELEBORN-912] Support build with Spark 3.5 +- [CELEBORN-919] Move Columnar Shuffle code into an individual module +- [CELEBORN-918][INFRA] Auto Assign First-time contributor with Contributors role +- [CELEBORN-879] Add `dev/dependencies.sh` for audit dependencies +- [CELEBORN-910][INFRA] Support JIRA_ACCESS_TOKEN for merging script +- [CELEBORN-907][INFRA] The Jira Python misses our assignee when it searches users again +- [CELEBORN-903][INFRA] Fix list index out of range for JIRA resolution in merge_pr +- [CELEBORN-887] Option --config should take effect in celeborn-daemon.sh script +- [CELEBORN-885][SPARK] Shade RoaringBitmap to avoid dependency conflicts +- [CELEBORN-875][FOLLOWUP] Enhance `DataPushQueueSuiteJ` for thread safety and prevent `NullPointerException` +- [CELEBORN-868][MASTER] Adjust logic of SlotsAllocator#offerSlotsLoadAware fallback to roundrobin +- [CELEBORN-835] Format specifiers should be used instead of string concatenation +- [CELEBORN-833] Remove unused code +- [CELEBORN-815] Remove unused ShuffleClient.readPartition +- [CELEBORN-794] Fix link of CONFIGURATIONS in README +- [CELEBORN-781] Refactor RPC message type name +- [CELEBORN-780] Change SPARK_SHUFFLE_FORCE_FALLBACK_PARTITION_THRESHOLD default to Int.MaxValue since slot's is not a bottleneck +- [CELEBORN-733] Clean unused GetBlacklist & GetBlacklistResponse +- [CELEBORN-655][SPARK] Rename newAppId to appUniqueId + +### BUILD + +- [CELEBORN-1205] Disable Maven local caches to improve SBT building stability +- [CELEBORN-1203] Add `LicenseAndNoticeMergeStrategy` to resolve inner project LICENSE/NOTICE conflict for shaded client packaging +- [CELEBORN-1199][FOLLOWUP] Disabled the plugin `AddMetaInfLicenseFiles` for shaded clients +- [CELEBORN-1199] Add LICENSE and NOTICE files for service related sub-projects +- [CELEBORN-1198] Keep debug info when use SBT build +- [CELEBORN-1194] Add sbt-pgp plugin for publishing signed artifacts +- [CELEBORN-1191][FOLLOWUP] Migrate the release script from Maven to SBT +- [CELEBORN-1191] Migrate the release script from Maven to SBT +- [CELEBORN-1156][BUILD] SBT publish support +- [CELEBORN-1159][BUILD] Update the scope of the `protobuf-java` dependency from `protobuf` to `runtime` +- [CELEBORN-1031] SBT correct the LICENSE and NOTICE for shaded client jars +- [CELEBORN-1002] Add SBT MRClientProject +- [CELEBORN-989] Add support for making distribution package via SBT +- [CELEBORN-913][FOLLOWUP] Recover SBT CI jobs skipped due to last commit +- [CELEBORN-919][FOLLOWUP] Add SBT project `sparkColumnarShuffle` to `sparkGroup` +- [CELEBORN-906][FOLLOWUP] Removal of redundant dependency `log4j-slf4j2-impl` from SBT profile `spark-3.4` +- [CELEBORN-906][BUILD] Aligning dependencies between SBT and Maven +- [CELEBORN-921] Upgrade sbt to 1.9.4 +- [CELEBORN-898][INFRA] Fix java.lang.NoClassDefFoundError: org/hamcrest/SelfDescribing for SBT CI +- [CELEBORN-884][BUILD] Consolidate all dependencies into a global object `Dependencies` +- [CELEBORN-880] Remove sbt compiler plugin `genjavadoc-plugin` +- [CELEBORN-850][INFRA] Add SBT CI +- [CELEBORN-843][BUILD] `sbt` support flink-related module build/test +- [CELEBORN-867][BUILD] Add maven local repository to sbt respositories +- [CELEBORN-836][BUILD] Initial support sbt + +### Documentation + +- [CELEBORN-1207] SBT http repository documentation +- [CELEBORN-1223] Align master and worker metrics of document with MasterSource and WorkerSource +- [CELEBORN-1247] Output config's alternatives to doc +- Add the Celeborn Helm Chart doc Review Comment: for such cases, let's add a [MINOR] prefix ########## docs/community/release_notes/release_note_0.4.0.md: ########## @@ -0,0 +1,269 @@ +--- +hide: + - navigation + +license: | + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + https://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--- + +# Apache Celeborn(Incubating) 0.4.0 Release Notes + +## Highlight + +- Rerun Spark Stage for Celeborn Shuffle Fetch Failure +- Added support for Apache Hadoop MapReduce +- Added support for Apache Flink 1.18 +- Implemented JVM monitoring in Celeborn Worker using JVMQuake +- Added support for SBT build system + +### IMPOROVEMENT + +- [CELEBORN-1052] Introduce dynamic ConfigService at SystemLevel and TenantLevel +- [CELEBORN-977] Support RocksDB as recover DB backend +- [CELEBORN-851] Mention Celeborn 0.4 server requires 0.3 or above clients +- [CELEBORN-808] Remove unnecessary RssShuffleManager in 0.4.0 +- [CELEBORN-980] Asynchronously delete original files to fix `ReusedExchange` bug +- [CELEBORN-1149] Improve replica selection when rack aware +- [CELEBORN-448] Support exclude worker manually +- [CELEBORN-1236][METRICS] Celeborn add metrics about thread pool +- [CELEBORN-1242] Unify celeborn thread name format +- [CELEBORN-1226][FOLLOWUP] Unify creation of thread using ThreadUtils +- [CELEBORN-1226][BRANCH-0.4.0] Unify creation of thread using ThreadUtils (#2245) +- [CELEBORN-1238] deviceCheckThreadPool is only initialized when diskCheck is enabled +- [CELEBORN-1225][FOLLOWUP] Worker should build replicate factory to get client for sending replicate data +- [CELEBORN-1233] Treat unfound PartitionLocation as failed in Controller#commitFiles +- [CELEBORN-1218] Optimize dataPusher to get partitionLocationMap only once +- [CELEBORN-1225] Worker should build replicate factory to get client for sending replicate data +- [CELEBORN-1228] Format the timestamp when recording worker failure +- [CELEBORN-1224] Make TransportMessage#type transient for backward compatibility +- [CELEBORN-1219] takeBuffer() avoid checking source.metricsCollectCriticalEnabled twice +- [CELEBORN-1220][IMPROVEMENT] Make trim logic more robust +- [CELEBORN-1177] OpenStream should register stream via ChunkStreamManager to close stream for ReusedExchange +- [CELEBORN-1217] Improve exception message of loadFileGroup for ShuffleClientImpl +- [CELEBORN-1215] Introduce PausePushDataAndReplicateTime metric to record time for a worker to stop receiving pushData from clients and other workers +- [CELEBORN-1216] Resolve error occurring during distribution creation with profile -Pspark-2.4 +- [CELEBORN-1214] Introduce WriteDataHardSplitCount metric to record HARD_SPLIT partitions of PushData and PushMergedData +- [CELEBORN-891] Remove pipeline feature for sort based writer +- [CELEBORN-1210] Fix potential memory leak in PartitionFilesCleaner +- [CELEBORN-1100] Introduce ChunkStreamCount, OpenStreamFailCount metrics about opening stream of FetchHandler +- [CELEBORN-1211] Add extension for celeborn shuffle handler +- [CELEBORN-1201] Optimize memory usage of cache in partition sorter +- [CELEBORN-1190][FOLLOWUP] Apply error prone patch and suppress some problems +- [CELEBORN-1252] Fix resource consumption of worker does not update when update interval is greater than heartbeat interval +- [CELEBORN-1253] Improve exception message of fetching chunk failure for WorkerPartitionReader +- [CELEBORN-1246][FOLLOWUP] Introduce OpenStreamSuccessCount, FetchChunkSuccessCount and WriteDataSuccessCount metric in Grafana dashboard +- [CELEBORN-1189] Introduce RunningApplicationCount metric and /applications API to record running applications of worker +- [CELEBORN-1187][FOLLOWUP] Unify the size and file count of active shuffle metrics for master and worker +- [CELEBORN-1187] Unify the size and file count of active shuffle metrics for master and worker +- [CELEBORN-1196] Slots allocator will increment disk index repeatedly +- [CELEBORN-1193] ResettableSlidingWindowReservoir should reset `full` to false +- [CELEBORN-1188][TEST] Using JUnit function instead of java assert +- [CELEBORN-1190] Apply error prone patch and suppress some problems +- [CELEBORN-1036][FOLLOWUP] totalInflightReqs should decrement when batchIdSet contains the batchId to avoid duplicate caller of removeBatch +- [CELEBORN-1150] support io encryption for spark +- [CELEBORN-1176] Server side support for Sasl Auth +- [CELEBORN-1180] Changed the version of Sasl Auth related config to 0.5 +- [CELEBORN-1157] Add client-side support for Sasl Authentication in the transport layer +- [CELEBORN-1164] Introduce FetchChunkFailCount metric to expose the count of fetching chunk failed in current worker +- [CELEBORN-1162][BUG] Fix refCnt 0 Exception in FetchHandler#handleChunkFetchRequest +- [CELEBORN-1151] Request slots when register shuffle should filter the workers excluded by application +- [CELEBORN-1152] fix GetShuffleId RPC NPE for empty shuffle +- [CELEBORN-1147] Added a dedicated API for RPC messages which also accepts an RpcResponseCallback instance +- [MINOR] Update log level of ChunkFetchSuccess failed for `FetchHandler#handleChunkFetchRequest` from error to warn +- [CELEBORN-1127] Add JVM classloader metrics +- [CELEBORN-1122] Metrics supports json format +- [CELEBORN-1052][FOLLOWUP] Introduce dynamic ConfigService at SystemLevel and TenantLevel +- [CELEBORN-1145] Separate clientPushBufferMaxSize from CelebornInputStreamImpl +- [CELEBORN-1131] Add Client/Server bootstrap framework to transport layer +- [CELEBORN-1142] clear shuffleIdCache in shutdown method of ShuffleClientImpl +- [CELEBORN-1081][FOLLOWUP] Remove UNKNOWN_DISK and allocate all slots to disk +- [CELEBORN-1125][FOLLOWUP] Add failureaccess shade +- [CELEBORN-1140] Use try-with-resources to avoid FSDataInputStream not being closed +- [CELEBORN-1135] Added tests for the RpcEnv and related classes +- [CELEBORN-856] Add mapreduce integration test +- [CELEBORN-1123] Support fallback to non-columnar shuffle for schema that cannot be obtained from shuffle dependency +- [CELEBORN-1134] Celeborn Flink client should validate whether execution.batch-shuffle-mode is ALL_EXCHANGES_BLOCKING +- [CELEBORN-1106] Ensure data is written into flush buffer before sending message to client +- [CELEBORN-1110][FOLLOWUP] Support celeborn.worker.storage.disk.reserve.ratio to configure worker reserved ratio for each disk +- [CELEBORN-1108][FOLLOWUP] Use rat plugin check Flink 1.18 +- [CELEBORN-1048][FOLLOWUP] MR module compile +- [CELEBORN-1108] Rat plugin check for more modules +- [CELEBORN-247][FOLLOWUP] Add metrics for each user's quota usage of Celeborn Worker +- [CELEBORN-1095] Support configuration of fastest available XXHashFactory instance for checksum of Lz4Decompressor +- [CELEBORN-1087] Remove SimpleStateMachineStorageUtil in master module +- [CELEBORN-1082] Fixing partition sorter task failures due to duplicate sorting +- [CELEBORN-1070] Add error-prone to pom.xml +- [CELEBORN-1079] Fix use of GuardedBy in client-flink/common +- [MINOR] Remove unnecessary increment index of Master#timeoutDeadWorkers +- [CELEBORN-1066] Skip looping streamimg sets in numShuffleSteams of ChunkStreamManager +- [CELEBORN-1065] Prevent the local variable 'time' declared in one 'switch' branch and used in another +- [CELEBORN-1069] Avoid double brace initialization +- [CELEBORN-1072] Fix misc error prone reports found +- [CELEBORN-1071] Ensure guardedBy is satisfied, fix DCL bugs as well +- [CELEBORN-1068] Fix hashCode computation +- [CELEBORN-1060] Fix the master's http port conflicts with rpc port in celeborn default template file +- [MINOR] Delete redundant Loggers +- [CELEBORN-1047] Remove conf `celeborn.worker.sortPartition.eagerlyRemoveOriginalFiles.enabled` +- [CELEBORN-1044] Enhance the check of parameter array length +- [CELEBORN-772] Convert StreamChunkSlice, ChunkFetchRequest, TransportableError to PB +- [CELEBORN-299] Deprecate `celeborn.worker.storage.baseDir.prefix` and `celeborn.worker.storage.baseDir.number` +- [CELEBORN-829] Improve response message of invalid HTTP request +- [CELEBORN-983] Rename PrometheusMetric configuration +- [CELEBORN-1038] Clean up deprecated api +- [CELEBORN-973] Improve HttpRequestHandler handle HTTP request with base, master and worker +- [CELEBORN-999] MR deps check +- [CELEBORN-1028] Make prometheus path configurable +- [CELEBORN-1018] Fix throw exception when exec create-package.sh script +- [CELEBORN-313] Add rest endpoint to show master group info +- [CELEBORN-985] Change default value of numConnectionsPerPeer to 1 +- [MINOR] Remove unexpected $ symbol +- [CELEBORN-1006] Add support for Apache Hadoop 2.x in Celeborn build +- [MINOR] Fix incorrect default resume ratio in trafficcontrol doc +- [CELEBORN-1000] MR module style check +- [CELEBORN-951][FOLLOWUP] IssueNavigationLink adapts to early Github Issues +- [CELEBORN-978] Improve dependency.sh replacement mode +- [CELEBORN-953] Remove unused-imports in Utils.scala +- [CELEBORN-913] Implement method ShuffleDriverComponents#supportsReliableStorage +- [CELEBORN-929][INFRA] Add dependencies check CI +- [CELEBORN-951] Add IssueNavigationLink and icon for IDEA +- [CELEBORN-937][INFRA] Improve branch suggestion for backporting +- [CELEBORN-939] Change register to unregister in Log +- [CELEBORN-940] Make the number of arguments and placeholders consistent +- [CELEBORN-919][FOLLOWUP] Put map index args after partition index args in CelebornShuffleReader constructor +- [CELEBORN-936] Shuffle master urls to avoid always connect first mast… +- [CELEBORN-931][INFRA] Fix merged pull requests resolution +- [CELEBORN-930][INFRA][FOLLOWUP] Fix environment variable naming +- [CELEBORN-930][INFRA] Eagerly check if the token is valid to align with the behavior of username/password auth +- [CELEBORN-919][FOLLOWUP] Unify the order of map index args and partit… +- [CELEBORN-912] Support build with Spark 3.5 +- [CELEBORN-919] Move Columnar Shuffle code into an individual module +- [CELEBORN-918][INFRA] Auto Assign First-time contributor with Contributors role +- [CELEBORN-879] Add `dev/dependencies.sh` for audit dependencies +- [CELEBORN-910][INFRA] Support JIRA_ACCESS_TOKEN for merging script +- [CELEBORN-907][INFRA] The Jira Python misses our assignee when it searches users again +- [CELEBORN-903][INFRA] Fix list index out of range for JIRA resolution in merge_pr +- [CELEBORN-887] Option --config should take effect in celeborn-daemon.sh script +- [CELEBORN-885][SPARK] Shade RoaringBitmap to avoid dependency conflicts +- [CELEBORN-875][FOLLOWUP] Enhance `DataPushQueueSuiteJ` for thread safety and prevent `NullPointerException` +- [CELEBORN-868][MASTER] Adjust logic of SlotsAllocator#offerSlotsLoadAware fallback to roundrobin +- [CELEBORN-835] Format specifiers should be used instead of string concatenation +- [CELEBORN-833] Remove unused code +- [CELEBORN-815] Remove unused ShuffleClient.readPartition +- [CELEBORN-794] Fix link of CONFIGURATIONS in README +- [CELEBORN-781] Refactor RPC message type name +- [CELEBORN-780] Change SPARK_SHUFFLE_FORCE_FALLBACK_PARTITION_THRESHOLD default to Int.MaxValue since slot's is not a bottleneck +- [CELEBORN-733] Clean unused GetBlacklist & GetBlacklistResponse +- [CELEBORN-655][SPARK] Rename newAppId to appUniqueId + +### BUILD + +- [CELEBORN-1205] Disable Maven local caches to improve SBT building stability +- [CELEBORN-1203] Add `LicenseAndNoticeMergeStrategy` to resolve inner project LICENSE/NOTICE conflict for shaded client packaging +- [CELEBORN-1199][FOLLOWUP] Disabled the plugin `AddMetaInfLicenseFiles` for shaded clients +- [CELEBORN-1199] Add LICENSE and NOTICE files for service related sub-projects +- [CELEBORN-1198] Keep debug info when use SBT build +- [CELEBORN-1194] Add sbt-pgp plugin for publishing signed artifacts +- [CELEBORN-1191][FOLLOWUP] Migrate the release script from Maven to SBT +- [CELEBORN-1191] Migrate the release script from Maven to SBT +- [CELEBORN-1156][BUILD] SBT publish support +- [CELEBORN-1159][BUILD] Update the scope of the `protobuf-java` dependency from `protobuf` to `runtime` +- [CELEBORN-1031] SBT correct the LICENSE and NOTICE for shaded client jars +- [CELEBORN-1002] Add SBT MRClientProject +- [CELEBORN-989] Add support for making distribution package via SBT +- [CELEBORN-913][FOLLOWUP] Recover SBT CI jobs skipped due to last commit +- [CELEBORN-919][FOLLOWUP] Add SBT project `sparkColumnarShuffle` to `sparkGroup` +- [CELEBORN-906][FOLLOWUP] Removal of redundant dependency `log4j-slf4j2-impl` from SBT profile `spark-3.4` +- [CELEBORN-906][BUILD] Aligning dependencies between SBT and Maven +- [CELEBORN-921] Upgrade sbt to 1.9.4 +- [CELEBORN-898][INFRA] Fix java.lang.NoClassDefFoundError: org/hamcrest/SelfDescribing for SBT CI +- [CELEBORN-884][BUILD] Consolidate all dependencies into a global object `Dependencies` +- [CELEBORN-880] Remove sbt compiler plugin `genjavadoc-plugin` +- [CELEBORN-850][INFRA] Add SBT CI +- [CELEBORN-843][BUILD] `sbt` support flink-related module build/test +- [CELEBORN-867][BUILD] Add maven local repository to sbt respositories +- [CELEBORN-836][BUILD] Initial support sbt + +### Documentation + +- [CELEBORN-1207] SBT http repository documentation +- [CELEBORN-1223] Align master and worker metrics of document with MasterSource and WorkerSource +- [CELEBORN-1247] Output config's alternatives to doc +- Add the Celeborn Helm Chart doc +- [MINOR] Updated sbt.md documentation to be consistent with description +- [CELEBORN-1104][DOC] Fix SBT documentation incorrect command +- [CELEBORN-1101] Update metrics doc to add necessary configuration steps +- [CELEBORN-1091] Update docs about how to use error-prone plugin in IDEA +- [CELEBORN-1085] Update metrics doc +- [CELEBORN-987][FOLLOWUP][DOC] README#Build and sbt#System Requirements should extend to Scala 2.13 and Spark 3.5 +- [CELEBORN-987][DOC] README#Build should extend to Java8/11/17 +- [CELEBORN-1028][FOLLOWUP][DOCS] Make prometheus path configurable +- [CELEBORN-1010] Update docs about `spark.shuffle.service.enabled` +- [CELEBORN-954] Add documentation about reliable shuffle data storage +- [CELEBORN-997][DOC] Fix Rolling upgrade broken link +- [CELEBORN-991][DOC] Remove incorrect `spark.metrics.conf` +- [CELEBORN-974] Add quick start guide about using MapReduce with Celeborn +- [CELEBORN-948][DOC] fix quick start doc about failed to submit flink wordcount +- [CELEBORN-822][DOC] Introduce a quick start guide for running Apache Flink with Apache Celeborn +- [CELEBORN-927][DOC] Correct celeborn.metrics.conf.*.sink.csv.class configuration example for a CSV sink +- [CELEBORN-927][DOC] Run dev/reformat before you create a new pull request for code style +- [CELEBORN-927][DOC] Guideline to add new RPC messages +- [CELEBORN-909][DOC] Mention `celeborn.worker.directMemoryRatioToResume` default value changed in main/0.4 +- [CELEBORN-893][DOC] Fix Spark patch list text in Readme +- [CELEBORN-877][FOLLOWUP][DOC] Expand 'note' blocks by default in the docs sbt.md +- [CELEBORN-877][DOC] Document on SBT +- [CELEBORN-870][FOLLOWUP][DOC] Document on usage together with Gluten (#1793) +- [CELEBORN-870][DOC] Document on usage together with Gluten +- [CELEBORN-869][FOLLOWUP][DOC] Document on Integrating Celeborn +- [CELEBORN-869][DOC] Document on Integrating Celeborn +- [CELEBORN-864][DOC] Document on blacklist +- [CELEBORN-860][DOC] Document on ShuffleClient +- [CELEBORN-853][DOC] Document on LifecycleManager +- [CELEBORN-834][DOC] Add fault tolerant document +- [CELEBORN-849][DOC] Document on Master +- [CELEBORN-822][FOLLOWUP] Format the example code in the docs/README.md +- [CELEBORN-831][DOC] Add traffic control document +- [CELEBORN-826][DOC] Add storage document +- [CELEBORN-828] Merge Monitoring to Development doc +- [CELEBORN-824][DOC] Add PushData document +- [CELEBORN-823][DOC] Add Celeborn architecture document +- [CELEBORN-822][DOC] Add quick start guide + +### Dependencies + +- [CELEBORN-1184] Update the snakeyaml version from 1.33 to 2.2 +- [CELEBORN-1173] Upgrade netty version from 4.1.93.Final to 4.1.101.Final +- [CELEBORN-1163] Upgrade protobuf from 3.19.2 to 3.21.7 +- [CELEBORN-1169] Bump Spark from 3.4.1 to 3.4.2 +- [CELEBORN-1170] Upgrade snappy-java from 1.1.8.2 to 1.1.10.5 +- upgrade snappy-java from 1.1.8.2 to 1.1.10.5 Review Comment: ditto -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@celeborn.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org