+1 for the concept.
Correct me if I'm wrong, but at a high level this is proposing adding
a new user API (which is language agnostic) and the proposal is to
start with something like the Logical Plan, with the addition of being
able to remotely call this API.

+0 on architecture/design as it's not clear from the doc how much
impact this truly has. But that is a problem with SPIPs which I have
voiced my concern about in the past.
I can see how this could be flushed out and keep the overall impact
minimal (vs blow up the world architecture change) since it's not just
a drop in replacement for all existing APIs.  For instance,
conceptually this is just a version of the Spark thriftserver which
uses grpc and passes the new API and internally we add a new API
runPlan(LogicalPlan).  You could potentially also not use the internal
version of the catalyst Logical Plan API but have some conversion
still to allow changes to catalyst internals, not sure if that is
needed but a possibility.
With any API addition it will have to be kept stable and require more
testing and likely more dev work, so weighing that vs usefulness is
the question for me.

Tom

On Mon, Jun 13, 2022 at 1:04 PM Herman van Hovell
<her...@databricks.com.invalid> wrote:
>
> Hi all,
>
> I’d like to start a vote for SPIP: "Spark Connect"
>
> The goal of the SPIP is to introduce a Dataframe based client/server API for 
> Spark
>
> Please also refer to:
>
> - Previous discussion in dev mailing list: [DISCUSS] SPIP: Spark Connect - A 
> client and server interface for Apache Spark.
> - Design doc: Spark Connect - A client and server interface for Apache Spark.
> - JIRA: SPARK-39375
>
> Please vote on the SPIP for the next 72 hours:
>
> [ ] +1: Accept the proposal as an official SPIP
> [ ] +0
> [ ] -1: I don’t think this is a good idea because …
>
> Kind Regards,
> Herman

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Reply via email to