[ 
https://issues.apache.org/jira/browse/IGNITE-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Denis Magda updated IGNITE-3084:
--------------------------------
    Description: 
Apache Spark already benefits from integration with Apache Ignite. The latter 
provides shared RDDs, an implementation of Spark RDD, that help Spark to share 
a state between Spark workers and execute SQL queries much faster. The next 
logical step is to enable support for modern Spark Data Frames API in a similar 
way.

As a contributor, you will be fully in charge of the integration with Spark 
Data Frame API and Apache Ignite.


  was:
We see increasing demand on nice DataFrame support for our Spark integration. 
Need to investigate how could we do that.

Looks like we can investigate how MemSQL do that and take it as a starting 
point.


> Investigate how Ignite can support Spark DataFrame
> --------------------------------------------------
>
>                 Key: IGNITE-3084
>                 URL: https://issues.apache.org/jira/browse/IGNITE-3084
>             Project: Ignite
>          Issue Type: Task
>          Components: Ignite RDD
>    Affects Versions: 1.5.0.final
>            Reporter: Vladimir Ozerov
>            Assignee: Valentin Kulichenko
>              Labels: bigdata, gsoc2017
>             Fix For: 2.0
>
>
> Apache Spark already benefits from integration with Apache Ignite. The latter 
> provides shared RDDs, an implementation of Spark RDD, that help Spark to 
> share a state between Spark workers and execute SQL queries much faster. The 
> next logical step is to enable support for modern Spark Data Frames API in a 
> similar way.
> As a contributor, you will be fully in charge of the integration with Spark 
> Data Frame API and Apache Ignite.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to