commits
Thread
Date
Earlier messages
Later messages
Messages by Thread
(spark) branch master updated: [SPARK-48056][CONNECT][PYTHON] Re-execute plan if a SESSION_NOT_FOUND error is raised and no partial response was received
gurwls223
(spark) branch branch-3.5 updated: [SPARK-45988][SPARK-45989][PYTHON] Fix typehints to handle `list` GenericAlias in Python 3.11+
gurwls223
(spark) branch master updated: [SPARK-48075][SS] Add type checking for PySpark avro functions
gurwls223
(spark) branch master updated: [SPARK-48080][K8S] Promote `*MainAppResource` and `NonJVMResource` to `DeveloperApi`
dongjoon
(spark) branch master updated: [SPARK-48064][SQL] Update error messages for routine related error classes
gurwls223
(spark) branch master updated: [SPARK-47683][PYTHON][BUILD][FOLLOW-UP] Exclude `lib/py4j*zip` in `pyspark-connect` package
gurwls223
(spark) branch master updated (4b16238784e0 -> 2fbfb21896bb)
gurwls223
(spark) branch master updated: [SPARK-48078][K8S] Promote `o.a.s.d.k8s.Constants` to `DeveloperApi`
dongjoon
(spark) branch master updated: [SPARK-46894][PYTHON][FOLLOW-UP] Includes `error-conditions.json` into PyPI package
gurwls223
(spark) branch master updated: [SPARK-48077][K8S] Promote `KubernetesClientUtils` to `DeveloperApi`
dongjoon
(spark) branch master updated (04f3a938895c -> 0fc7c4a29c46)
dongjoon
(spark) branch master updated (e521d3c1f357 -> 04f3a938895c)
dongjoon
(spark) branch master updated: [MINOR] Fix the grammar of some comments on renaming error classes
gurwls223
(spark) branch master updated (fd57c3493af7 -> f86a51921f73)
gurwls223
(spark) branch master updated (69ea082fc69a -> fd57c3493af7)
dongjoon
(spark) branch master updated (5ac803079b30 -> 69ea082fc69a)
dongjoon
(spark) branch master updated (35767bb09fe1 -> 5ac803079b30)
dongjoon
(spark) branch branch-3.5 updated: [SPARK-48016][SQL][3.5] Fix a bug in try_divide function when with decimals
gengliang
(spark) branch master updated: [SPARK-48070][SQL][TESTS] Support `AdaptiveQueryExecSuite.runAdaptiveAndVerifyResult` to skip check results
dongjoon
(spark) branch master updated: [SPARK-46009][SQL][FOLLOWUP] Remove unused PERCENTILE_CONT and PERCENTILE_DISC in g4
dongjoon
(spark) branch branch-3.5 updated: Revert "[SPARK-48016][SQL] Fix a bug in try_divide function when with decimals"
dongjoon
(spark) branch branch-3.4 updated: [SPARK-48068][PYTHON] `mypy` should have `--python-executable` parameter
dongjoon
(spark) branch branch-3.5 updated: [SPARK-48068][PYTHON] `mypy` should have `--python-executable` parameter
dongjoon
(spark) branch master updated: [SPARK-48068][PYTHON] `mypy` should have `--python-executable` parameter
dongjoon
(spark) branch master updated: [SPARK-48069][INFRA] Handle `PEP-632` by checking `ModuleNotFoundError` on `setuptools` in Python 3.12
dongjoon
(spark) branch master updated: [SPARK-48016][SQL][TESTS][FOLLOWUP] Update Java 21 golden file
dongjoon
(spark) branch master updated: [SPARK-48047][SQL] Reduce memory pressure of empty TreeNode tags
dongjoon
(spark) branch master updated (c71d02ab7c80 -> 991763c2cdf8)
gurwls223
(spark) branch master updated: [SPARK-48063][CORE] Enable `spark.stage.ignoreDecommissionFetchFailure` by default
dongjoon
(spark) branch master updated: [SPARK-48060][SS][TESTS] Fix `StreamingQueryHashPartitionVerifySuite` to update golden files correctly
dongjoon
(spark) branch master updated: [SPARK-48057][PYTHON][CONNECT][TESTS] Enable `GroupedApplyInPandasTests.test_grouped_with_empty_partition`
dongjoon
(spark) branch master updated (0329479acb67 -> 9caa6f7f8b8e)
dongjoon
(spark) branch master updated: [SPARK-47359][SQL] Support TRANSLATE function to work with collated strings
wenchen
(spark) branch master updated: [SPARK-47793][SS][PYTHON] Implement SimpleDataSourceStreamReader for python streaming data source
kabhwan
(spark) branch master updated: [SPARK-48003][SQL] Add collation support for hll sketch aggregate
wenchen
(spark) branch master updated (332570f42203 -> 94763438943e)
kabhwan
(spark) branch master updated (12a507464f10 -> 332570f42203)
gurwls223
(spark) branch master updated: [SPARK-47566][SQL] Support SubstringIndex function to work with collated strings
wenchen
(spark) branch master updated: [SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default
dongjoon
(spark) branch master updated: [SPARK-48055][PYTHON][CONNECT][TESTS] Enable `PandasUDFScalarParityTests.{test_vectorized_udf_empty_partition, test_vectorized_udf_struct_with_empty_partition}`
ruifengz
(spark) branch branch-3.4 updated: [SPARK-47129][CONNECT][SQL][3.4] Make ResolveRelations cache connect plan properly
ruifengz
(spark) branch master updated (87b20b166c41 -> e0af82497607)
gurwls223
(spark) branch master updated: [SPARK-47585][SQL] SQL core: Migrate logInfo with variables to structured logging framework
gengliang
(spark) branch branch-3.4 updated: [SPARK-48016][SQL][3.4] Fix a bug in try_divide function when with decimals
gengliang
(spark) branch branch-3.5 updated: [SPARK-47129][CONNECT][SQL][3.5] Make ResolveRelations cache connect plan properly
ruifengz
(spark) branch master updated: [SPARK-48030][SQL] SPJ: cache rowOrdering and structType for InternalRowComparableWrapper
sunchao
(spark) branch master updated: [SPARK-48033][SQL] Fix `RuntimeReplaceable` expressions being used in default columns
wenchen
(spark) branch master updated (3fbcb26d8e99 -> fe05eb8fa3b2)
wenchen
(spark) branch branch-3.5 updated: [SPARK-48016][SQL] Fix a bug in try_divide function when with decimals
gengliang
(spark) branch master updated: [SPARK-48016][SQL] Fix a bug in try_divide function when with decimals
gengliang
(spark) branch master updated: [SPARK-48042][SQL] Use a timestamp formatter with timezone at class level instead of making copies at method level
dongjoon
(spark) branch master updated (f781d153a5e4 -> c35a21e5984f)
dongjoon
(spark) branch master updated (d42c10d9411d -> f781d153a5e4)
dongjoon
(spark) branch master updated (ccb0eb699f7c -> d42c10d9411d)
dongjoon
(spark) branch master updated: [SPARK-48038][K8S] Promote driverServiceName to KubernetesDriverConf
dongjoon
(spark) branch master updated (3f15ad40640c -> d913d1b2662c)
wenchen
(spark) branch master updated: [SPARK-47994][SQL] Fix bug with CASE WHEN column filter push down in SQLServer
yao
(spark) branch master updated: [SPARK-48039][PYTHON][CONNECT] Update the error class for `group.apply`
gurwls223
(spark) branch master updated: [SPARK-47567][SQL] Support LOCATE function to work with collated strings
wenchen
(spark) branch master updated: [SPARK-47939][SQL] Implement a new Analyzer rule to move ParameterizedQuery inside ExplainCommand and DescribeQueryCommand
wenchen
(spark) branch master updated: [SPARK-48002][PYTHON][SS][TESTS] Adds sleep before event testing after query termination
gurwls223
(spark) branch master updated: [MINOR][DOCS] Remove space in the middle of configuration name in Arrow-optimized Python UDF page
dongjoon
(spark) branch master updated (9a42610d5ad8 -> e1445e3f1cf5)
dongjoon
(spark) branch master updated: [SPARK-48029][INFRA] Update the packages name removed in building the spark docker image
dongjoon
(spark) branch branch-3.4 updated: [SPARK-48034][TESTS] NullPointerException in MapStatusesSerDeserBenchmark
yao
(spark) branch branch-3.5 updated: [SPARK-48034][TESTS] NullPointerException in MapStatusesSerDeserBenchmark
yao
(spark) branch master updated: [SPARK-48034][TESTS] NullPointerException in MapStatusesSerDeserBenchmark
yao
(spark) branch master updated (3d62dd72a58f -> 8f1634e833ce)
dongjoon
(spark) branch master updated: [SPARK-47730][K8S] Support `APP_ID` and `EXECUTOR_ID` placeholders in labels
dongjoon
(spark) branch master updated (506b2d5eb8d9 -> 8c446f35dc03)
gurwls223
(spark) branch master updated (023f07d845c3 -> 506b2d5eb8d9)
gurwls223
(spark) branch master updated: [SPARK-47933][CONNECT][PYTHON][FOLLOW-UP] Remove `pyspark.sql.classic` reference in `pyspark.ml.stat`
gurwls223
(spark) branch master updated: [SPARK-48025][SQL][TESTS] Fix org.apache.spark.sql.execution.benchmark.DateTimeBenchmark
yao
(spark) branch master updated: [SPARK-48024][PYTHON][CONNECT][TESTS] Enable `UDFParityTests.test_udf_timestamp_ntz`
gurwls223
(spark) branch master updated: [SPARK-48002][PYTHON][SS] Add test for observed metrics in PySpark StreamingQueryListener
gurwls223
(spark) branch master updated: [SPARK-48021][ML][BUILD][FOLLOWUP] add `--add-modules=jdk.incubator.vector` to maven compile args
yangjie01
(spark) branch branch-3.4 updated: [SPARK-47927][SQL] Fix nullability attribute in UDF decoder
wenchen
(spark) branch branch-3.5 updated: [SPARK-47927][SQL] Fix nullability attribute in UDF decoder
wenchen
(spark) branch master updated: [SPARK-47927][SQL] Fix nullability attribute in UDF decoder
wenchen
(spark) branch branch-3.5 updated: [SPARK-48019] Fix incorrect behavior in ColumnVector/ColumnarArray with dictionary and nulls
wenchen
(spark) branch master updated: [SPARK-48019] Fix incorrect behavior in ColumnVector/ColumnarArray with dictionary and nulls
wenchen
(spark) branch master updated: [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions`
dongjoon
(spark) branch master updated: [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2'
yao
(spark) branch master updated: [SPARK-47408][SQL] Fix mathExpressions that use StringType
dongjoon
(spark) branch master updated (2b2a33cc35a8 -> d5712cea88cd)
kabhwan
(spark-kubernetes-operator) branch main updated: [SPARK-48015] Update `build.gradle` to fix deprecation warnings
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-47950] Add Java API Module for Spark Operator
dongjoon
(spark) branch master updated: [SPARK-48011][CORE] Store LogKey name as a value to avoid generating new string instances
dongjoon
(spark) branch master updated: [SPARK-48010][SQL] Avoid repeated calls to conf.resolver in resolveExpression
dongjoon
(spark) branch master updated: [SPARK-47818][CONNECT][FOLLOW-UP] Introduce plan cache in SparkConnectPlanner to improve performance of Analyze requests
hvanhovell
(spark) branch master updated: [SPARK-47818][CONNECT][FOLLOW-UP] Introduce plan cache in SparkConnectPlanner to improve performance of Analyze requests
hvanhovell
(spark) branch master updated: [SPARK-48005][PS][CONNECT][TESTS] Enable `DefaultIndexParityTests.test_index_distributed_sequence_cleanup`
dongjoon
(spark) branch master updated: [SPARK-47440][SQL][FOLLOWUP] Reenable predicate pushdown for syntax with boolean comparison in MsSqlServer
yao
(spark) branch master updated: [SPARK-47968][SQL] MsSQLServer: Map datatimeoffset to TimestampType
yao
(spark) branch master updated: [SPARK-47476][SQL] Support REPLACE function to work with collated strings
wenchen
(spark) branch master updated: [SPARK-48007][BUILD][TESTS] Upgrade `mssql.jdbc` to `12.6.1.jre11`
dongjoon
(spark) branch master updated: [SPARK-47351][SQL] Add collation support for StringToMap & Mask string expressions
wenchen
(spark) branch master updated: [SPARK-47350][SQL] Add collation support for SplitPart string expression
wenchen
(spark) branch master updated: [SPARK-48004][SQL] Add WriteFilesExecBase trait for v1 write
yao
(spark) branch master updated (e04ac56e645f -> 95d6c615c081)
gurwls223
(spark) branch master updated: [SPARK-45225][SQL][FOLLOW-UP] XML: Fix nested XSD file path resolution
gurwls223
(spark) branch master updated (2e5825fb32c0 -> 3451e66fe71d)
gurwls223
(spark) branch master updated: [SPARK-47858][PYTHON][FOLLOWUP] Excluding Python magic methods from error context target
gurwls223
(spark) branch master updated: [SPARK-48001][CORE] Remove unused `private implicit def arrayToArrayWritable` from `SparkContext`
yao
(spark) branch master updated: [SPARK-47986][CONNECT][PYTHON] Unable to create a new session when the default session is closed by the server
ruifengz
(spark) branch master updated (033ca3e7dd5b -> b0e03a193531)
kabhwan
(spark) branch master updated: [SPARK-47922][SQL] Implement the try_parse_json expression
wenchen
(spark) branch master updated: [SPARK-47991][SQL][TEST] Arrange the test cases for window frames and window functions
dongjoon
(spark) branch master updated: [SPARK-47933][CONNECT][PYTHON][FOLLOW-UP] Avoid referencing _to_seq in `pyspark-connect`
dongjoon
(spark) branch master updated: [SPARK-47597][STREAMING] Streaming: Migrate logInfo with variables to structured logging framework
gengliang
(spark) branch master updated (e1d021214c61 -> 994775a624f3)
ptoth
(spark) branch master updated: [SPARK-45425][DOCS][FOLLOWUP] Add a migration guide for TINYINT type mapping change
dongjoon
(spark) branch master updated (de5c512e0179 -> 287d02073929)
dongjoon
(spark) branch master updated: [SPARK-47987][PYTHON][CONNECT][TESTS] Enable `ArrowParityTests.test_createDataFrame_empty_partition`
dongjoon
(spark) branch master updated: [SPARK-47990][BUILD] Upgrade `zstd-jni` to 1.5.6-3
dongjoon
(spark) branch master updated: [SPARK-47985][PYTHON] Simplify functions with `lit`
ruifengz
(spark) branch master updated: [SPARK-47982][BUILD] Update some code style's plugins to latest version
yao
(spark) branch master updated: [SPARK-47984][ML][SQL] Change `MetricsAggregate/V2Aggregator#serialize/deserialize` to call `SparkSerDeUtils#serialize/deserialize`
yao
(spark) branch master updated: [SPARK-47981][BUILD] Upgrade `Arrow` to 16.0.0
yao
(spark) branch master updated: [SPARK-47983][SQL] Demote spark.sql.pyspark.legacy.inferArrayTypeFromFirstElement.enabled to internal
yao
(spark) branch master updated (08caa567fb29 -> 775bc54fcd0d)
gengliang
(spark) branch master updated: [SPARK-47980][SQL][TESTS] Reactivate test 'Empty float/double array columns raise EOFException'
yao
(spark) branch master updated (b4624bf4be28 -> dab4a044b647)
gurwls223
(spark) branch master updated (c6aaa18e6cfd -> b4624bf4be28)
wenchen
(spark) branch master updated: Revert "[SPARK-45302][PYTHON] Remove PID communication between Pythonworkers when no demon is used"
gurwls223
(spark) branch master updated (d23389252a7d -> ea37c860a1a8)
gurwls223
(spark) branch master updated (0fcced63be99 -> d23389252a7d)
yao
(spark) branch master updated: [SPARK-47979][SQL][TESTS] Use Hive tables explicitly for Hive table capability tests
dongjoon
(spark) branch branch-3.5 updated: [SPARK-47921][CONNECT] Fix ExecuteJobTag creation in ExecuteHolder
ueshin
(spark) branch master updated (62dd64a5d13d -> 5a1559a7ef03)
ueshin
(spark) branch master updated: [SPARK-47583][CORE] SQL core: Migrate logError with variables to structured logging framework
gengliang
(spark) branch branch-3.5 updated: [SPARK-47633][SQL][3.5] Include right-side plan output in `LateralJoin#allAttributes` for more consistent canonicalization
dongjoon
(spark) branch master updated (09ed09cb18e7 -> 03d4ea6a707c)
dongjoon
(spark) branch branch-3.5 updated: [SPARK-47819][CONNECT][3.5] Use asynchronous callback for execution cleanup
hvanhovell
(spark) branch master updated: [SPARK-47958][TESTS] Change LocalSchedulerBackend to notify scheduler of executor on start
wenchen
(spark) branch master updated: [SPARK-47965][CORE] Avoid orNull in TypedConfigBuilder and OptionalConfigEntry
gurwls223
(spark) branch master updated: [SPARK-47971][PYTHON][CONNECT][TESTS] Reenable `PandasUDFGroupedAggParityTests.test_grouped_with_empty_partition`
ruifengz
(spark) branch master updated: [SPARK-47764][CORE][SQL] Cleanup shuffle dependencies based on ShuffleCleanupMode
wenchen
(spark) branch master updated: [SPARK-47692][SQL] Fix default StringType meaning in implicit casting
wenchen
(spark) branch master updated: [SPARK-47418][SQL] Add hand-crafted implementations for lowercase unicode-aware contains, startsWith and endsWith and optimize UTF8_BINARY_LCASE
wenchen
(spark) branch master updated (fd695be19d3f -> 6f01982094f6)
gurwls223
(spark) branch master updated: [SPARK-47903][PYTHON][FOLLOW-UP] Removed changes relating to try_parse_json
gurwls223
(spark) branch master updated: [SPARK-47969][PYTHON][TESTS] Make `test_creation_index` deterministic
dongjoon
(spark) branch master updated: [SPARK-47771][PYTHON][DOCS][TESTS][FOLLOWUP] Make `max_by, min_by` doctests deterministic
ruifengz
(spark) branch master updated: [SPARK-47933][PYTHON][CONNECT][FOLLOW-UP] Add a check of `__name__` at `_with_origin`
gurwls223
(spark) branch master updated: Revert "Revert "[SPARK-45302][PYTHON] Remove PID communication between Python workers when no demon is used""
gurwls223
(spark) branch master updated (390fb7429029 -> e8f529bb89a6)
gurwls223
(spark) branch master updated (c88fabfee41d -> 390fb7429029)
gurwls223
(spark) branch master updated: [SPARK-47604][CORE] Resource managers: Migrate logInfo with variables to structured logging framework
gengliang
(spark) branch master updated: [SPARK-47864][FOLLOWUP][PYTHON][DOCS] Fix minor typo: "MLLib" -> "MLlib"
xinrong
(spark) branch master updated: [SPARK-47956][SQL] Sanity check for unresolved LCA reference
dongjoon
(spark) branch master updated: [SPARK-47948][PYTHON] Upgrade the minimum `Pandas` version to 2.0.0
dongjoon
(spark) branch master updated (cf5fc0c720ee -> 9c4f12ca04ac)
dongjoon
(spark) branch master updated: [MINOR][DOCS] Fix type hint of 3 functions
dongjoon
(spark) branch master updated (ca916258b991 -> 33fa77cb4868)
dongjoon
(spark) branch master updated: [SPARK-47953][DOCS] MsSQLServer: Document Mapping Spark SQL Data Types to Microsoft SQL Server
dongjoon
(spark) branch master updated: [SPARK-47873][SQL] Write collated strings to Hive metastore using the regular string type
wenchen
(spark) branch master updated: [SPARK-47352][SQL] Fix Upper, Lower, InitCap collation awareness
wenchen
(spark) branch master updated: [SPARK-47805][SS] Implementing TTL for MapState
kabhwan
(spark) branch master updated (d1298e73a8d5 -> e74221e6525e)
yao
(spark) branch master updated (885e98ecbe64 -> d1298e73a8d5)
yao
(spark) branch master updated: [SPARK-47412][SQL] Add Collation Support for LPad/RPad
wenchen
(spark) branch master updated: [SPARK-47633][SQL] Include right-side plan output in `LateralJoin#allAttributes` for more consistent canonicalization
wenchen
(spark-kubernetes-operator) branch main updated: [SPARK-47943] Add `GitHub Action` CI for Java Build and Test
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-47929] Setup Static Analysis for Operator
dongjoon
(spark) branch master updated (9d715ba49171 -> 876c2cf34a35)
dongjoon
(spark) branch master updated: [SPARK-47938][SQL] MsSQLServer: Cannot find data type BYTE error
dongjoon
(spark) branch master updated (e01ac581f46a -> 3c7905e00d2e)
gengliang
(spark) branch master updated (e4fb7dd98219 -> a97e72cfa7d4)
dongjoon
(spark) branch master updated (b335dd366fb1 -> e4fb7dd98219)
dongjoon
(spark) branch master updated (9e7ee7601d38 -> b335dd366fb1)
gurwls223
(spark) branch master updated: [SPARK-47903][PYTHON] Add support for remaining scalar types in the PySpark Variant library
gurwls223
(spark) branch branch-3.5 updated: [SPARK-47904][SQL][3.5] Preserve case in Avro schema when using enableStableIdentifiersForUnionType
dongjoon
(spark) branch master updated: [SPARK-47942][K8S][DOCS] Drop K8s v1.26 Support
dongjoon
(spark) branch master updated (f2d0cf23018f -> fc0c8553ea05)
dongjoon
(spark) branch master updated (86563169eef8 -> f2d0cf23018f)
gengliang
(spark) branch master updated: [SPARK-47940][BUILD][TESTS] Upgrade `guava` dependency to `33.1.0-jre` in Docker IT
dongjoon
(spark) branch master updated (256fc51508e4 -> 676d47ffe091)
dongjoon
(spark) branch master updated: [SPARK-47411][SQL] Support StringInstr & FindInSet functions to work with collated strings
wenchen
(spark) branch master updated: [SPARK-47928][SQL][TEST] Speed up test "Add jar support Ivy URI in SQL"
yangjie01
(spark) branch master updated (2fb31dea1c53 -> b20356ef55a2)
wenchen
(spark) branch master updated: [SPARK-47930][BUILD] Upgrade RoaringBitmap to 1.0.6
dongjoon
(spark) branch master updated (e1432ef6405a -> 79a1fa4b84dd)
gurwls223
(spark) branch master updated (2d0b56c3eac6 -> e1432ef6405a)
wenchen
(spark) branch master updated (458f70bd5213 -> 2d0b56c3eac6)
yangjie01
(spark) branch master updated (9f34b8eca2f3 -> 458f70bd5213)
wenchen
(spark) branch master updated (2d9d444b122d -> 9f34b8eca2f3)
ruifengz
(spark) branch master updated (393a84fb074a -> 2d9d444b122d)
gurwls223
(spark) branch master updated (adf02d38061b -> 393a84fb074a)
gurwls223
(spark) branch master updated: [SPARK-47925][SQL][TESTS] Mark `BloomFilterAggregateQuerySuite` as `ExtendedSQLTest`
dongjoon
(spark) branch master updated: [SPARK-47923][R] Upgrade the minimum version of `arrow` R package to 10.0.0
dongjoon
(spark) branch master updated (3fcc0f7ac142 -> 2613516110a4)
dongjoon
(spark) branch branch-3.5 updated (afd99d19a2b8 -> 6a358ff7d633)
dongjoon
(spark) branch branch-3.4 updated (bcaf61b975d6 -> e7a2e5a196a8)
dongjoon
(spark) branch master updated: [SPARK-47915][BUILD][K8S] Upgrade `kubernetes-client` to 6.12.1
dongjoon
(spark) branch master updated (2bf43460b923 -> 0d553d06fe2f)
ruifengz
(spark) branch master updated: [SPARK-47833][SQL][CORE] Supply caller stackstrace for checkAndGlobPathIfNecessary AnalysisException
yao
(spark) branch master updated: [SPARK-47901][BUILD] Upgrade common-text 1.12.0
yangjie01
(spark) branch master updated (fe47edece059 -> 8aa2dad46b79)
gengliang
(spark) branch master updated: [SPARK-47883][SQL] Make `CollectTailExec.doExecute` lazy with RowQueue
ruifengz
Earlier messages
Later messages