[ 
https://issues.apache.org/jira/browse/IGNITE-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16304299#comment-16304299
 ] 

Nikolay Izhikov commented on IGNITE-3084:
-----------------------------------------

{quote}IgniteSQLRelation#calcPartitions - got it, but what will happen if 
topology changes? Will partitions be recalculated?{quote}

If Ignite topology changed *after* {{calcPartitions}} executed and *before* 
query fully executed then Ignite SQL query for a corresponding Spark partition 
will read data from several server Ignite nodes.

Next query will execute {{calcPartitions}} with actual topology info and will 
work as expected.

Note, that topology change doesn’t affect results correctness.
Because, we query only 1 table from Ignite and all aggregate, currently, 
calculated by Spark.

> Spark Data Frames Support in Apache Ignite
> ------------------------------------------
>
>                 Key: IGNITE-3084
>                 URL: https://issues.apache.org/jira/browse/IGNITE-3084
>             Project: Ignite
>          Issue Type: Task
>          Components: spark
>    Affects Versions: 1.5.0.final
>            Reporter: Vladimir Ozerov
>            Assignee: Nikolay Izhikov
>            Priority: Critical
>              Labels: bigdata, important
>             Fix For: 2.4
>
>
> Apache Spark already benefits from integration with Apache Ignite. The latter 
> provides shared RDDs, an implementation of Spark RDD, that help Spark to 
> share a state between Spark workers and execute SQL queries much faster. The 
> next logical step is to enable support for modern Spark Data Frames API in a 
> similar way.
> As a contributor, you will be fully in charge of the integration of Spark 
> Data Frame API and Apache Ignite.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to