[ https://issues.apache.org/jira/browse/IGNITE-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16304299#comment-16304299 ]
Nikolay Izhikov commented on IGNITE-3084: ----------------------------------------- {quote}IgniteSQLRelation#calcPartitions - got it, but what will happen if topology changes? Will partitions be recalculated?{quote} If Ignite topology changed *after* {{calcPartitions}} executed and *before* query fully executed then Ignite SQL query for a corresponding Spark partition will read data from several server Ignite nodes. Next query will execute {{calcPartitions}} with actual topology info and will work as expected. Note, that topology change doesn’t affect results correctness. Because, we query only 1 table from Ignite and all aggregate, currently, calculated by Spark. > Spark Data Frames Support in Apache Ignite > ------------------------------------------ > > Key: IGNITE-3084 > URL: https://issues.apache.org/jira/browse/IGNITE-3084 > Project: Ignite > Issue Type: Task > Components: spark > Affects Versions: 1.5.0.final > Reporter: Vladimir Ozerov > Assignee: Nikolay Izhikov > Priority: Critical > Labels: bigdata, important > Fix For: 2.4 > > > Apache Spark already benefits from integration with Apache Ignite. The latter > provides shared RDDs, an implementation of Spark RDD, that help Spark to > share a state between Spark workers and execute SQL queries much faster. The > next logical step is to enable support for modern Spark Data Frames API in a > similar way. > As a contributor, you will be fully in charge of the integration of Spark > Data Frame API and Apache Ignite. -- This message was sent by Atlassian JIRA (v6.4.14#64029)