[jira] [Commented] (IGNITE-10314) Spark dataframe will get wrong schema if user executes add/drop column DDL

ASF GitHub Bot (JIRA) Thu, 06 Dec 2018 20:09:42 -0800


    [ 
https://issues.apache.org/jira/browse/IGNITE-10314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16712313#comment-16712313
 ]


ASF GitHub Bot commented on IGNITE-10314:
-----------------------------------------

GitHub user ldzhjn opened a pull request:

    https://github.com/apache/ignite/pull/5598

    IGNITE-10314 Spark dataframe will get wrong schema if user executes 
add/drop column DDL

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/ldzhjn/ignite IGNITE-10314

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/ignite/pull/5598.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #5598
    
----
commit bcdb69747f9a90fa0f2f9204708c576b72f1de94
Author: rayliu <rayliu@...>
Date:   2018-11-22T06:44:01Z

    Fix bug

commit 8dcc21800a8eb7d1e731c362b4a53860274e85bd
Author: rayliu <rayliu@...>
Date:   2018-11-26T06:39:27Z

    Add tests

commit 1d58fdf068b14ac9847a825dca3464148f9b7596
Author: rayliu <rayliu@...>
Date:   2018-11-26T06:53:16Z

    Optimize imports

commit 336a291dd5a330b5e1e824e807187ef1960bb843
Author: rayliu <rayliu@...>
Date:   2018-11-26T07:01:47Z

    Merge remote-tracking branch 'upstream/master' into IGNITE-10314

commit 304695781ac356ec3f81325c5ce3b244b9f01c8b
Author: rayliu <rayliu@...>
Date:   2018-12-06T09:46:28Z

    Merge remote-tracking branch 'upstream/master' into IGNITE-10314

----


> Spark dataframe will get wrong schema if user executes add/drop column DDL
> --------------------------------------------------------------------------
>
>                 Key: IGNITE-10314
>                 URL: https://issues.apache.org/jira/browse/IGNITE-10314
>             Project: Ignite
>          Issue Type: Bug
>          Components: spark
>    Affects Versions: 2.3, 2.4, 2.5, 2.6, 2.7
>            Reporter: Ray
>            Assignee: Ray
>            Priority: Critical
>             Fix For: 2.8
>
>
> When user performs add/remove column in DDL,  Spark will get the old/wrong 
> schema.
>  
> Analyse 
> Currently Spark data frame API relies on QueryEntity to construct schema, but 
> QueryEntity in QuerySchema is a local copy of the original QueryEntity, so 
> the original QueryEntity is not updated when modification happens.
>  
> Solution
> Get the latest schema using JDBC thin driver's column metadata call, then 
> update fields in QueryEntity.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (IGNITE-10314) Spark dataframe will get wrong schema if user executes add/drop column DDL

Reply via email to