DuckDB provides "PySpark syntax" on top of fast single node engine:
https://duckdb.org/docs/clients/python/spark_api.html
As I remember, DuckDB is much faster than pandas on a single node and it
already provides a spark-compatible API.
On 2/10/25 1:02 PM, José Müller wrote:
Hi all,
I'm new
+1 (non-binding)
On Wed, 2025-04-09 at 07:22 -0700, Sandy Ryza wrote:
> We started to get some votes on the discussion thread, so I'd like to
> move to a formal vote on adding support for declarative pipelines.
>
> *Discussion thread:
> * https://lists.apache.org/thread/lsv8f829ps0bog41fjoqc45xk7
+1 (non-binding)
On April 9, 2025 7:29:40 AM GMT+02:00, Rishab Joshi
wrote:
>+1 Exciting.
>Rishab Joshi
>
>On Tue, Apr 8, 2025, 10:04 PM Ruifeng Zheng wrote:
>
>> +1
>>
>> On Wed, Apr 9, 2025 at 12:57 PM Denny Lee wrote:
>>
>>> +1 (non-binding)
>>>
>>> On Tue, Apr 8, 2025 at 9:53 PM Yuming Wan
> I may not need anything from spark but if I'll declare a dependency
in Jackson or guava with a different version than spark already use and
package- I might break things...
In that case I would recommend you to use assembly / assemblyShadeRules
for sbt-assembly or maven-shade-plugin for maven an
all the details and explanations, fill the Jira
ticket, etc. But my main question is does GraphX accept patches or
because of deprecation it is not considered anymore?
Thanks in advance!
Best regards,
Sem
-
To unsubscribe e-mai
Apache Spark GraphX
Copyright 2014-2025 The Apache Software Foundation.
This product includes software developed at
The Apache Software Foundation (http://www.apache.org/).
"""
?
Best regards,
Sem
On Tue, 2025-09-09 at 22:04 -0700, Russell Jurney wrote:
> Yeah, GraphFrames ingesting