[jira] [Commented] (CARBONDATA-2544) [MV] Wrong data displayed with Filter

xubo245 (JIRA) Tue, 26 Jun 2018 18:45:57 -0700


    [ 
https://issues.apache.org/jira/browse/CARBONDATA-2544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16524445#comment-16524445
 ]


xubo245 commented on CARBONDATA-2544:
-------------------------------------

It's work fine in cluster：


{code:java}
0: jdbc:hive2://hadoop1:10000> select country,sum(salary) from test20 group by 
country;
+----------+--------------+--+
| country  | sum(salary)  |
+----------+--------------+--+
| USA      | 23           |
+----------+--------------+--+
1 row selected (1.226 seconds)
0: jdbc:hive2://hadoop1:10000> select country,sum(salary) from test20 where 
country='USA' group by country;
+----------+--------------+--+
| country  | sum(salary)  |
+----------+--------------+--+
| USA      | 23           |
+----------+--------------+--+
1 row selected (1.655 seconds)
0: jdbc:hive2://hadoop1:10000> select country,sum(salary) from test20 where 
country='USA' group by country;
+----------+--------------+--+
| country  | sum(salary)  |
+----------+--------------+--+
| USA      | 23           |
+----------+--------------+--+
1 row selected (0.92 seconds)
0: jdbc:hive2://hadoop1:10000> explain select country,sum(salary) from test20 
where country='USA' group by country;
+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--+
|                                                                               
                                                                                
                                                                                
                                                                                
                                                                                
             plan                                                               
                                                                                
                                                                                
                                                                                
                                                                                
                             |
+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--+
| == CarbonData Profiler ==
Table Scan on datamv20_table
 - total blocklets: 1
 - filter: (test20_country <> null and test20_country = USA)
 - pruned by Main DataMap
    - skipped blocklets: 0
                                                                                
                                                                                
                                                                                
                                                                                
                                                                                
                                                                                
                                                                                
                                                                            |
| == Physical Plan ==
*HashAggregate(keys=[country#223], functions=[sum(sum(salary)#224L)])
+- Exchange hashpartitioning(country#223, 200)
   +- *HashAggregate(keys=[country#223], 
functions=[partial_sum(sum(salary)#224L)])
      +- *HashAggregate(keys=[test20_country#103], 
functions=[sum(sum_salary#104L)])
         +- Exchange hashpartitioning(test20_country#103, 200)
            +- *HashAggregate(keys=[test20_country#103], 
functions=[partial_sum(sum_salary#104L)])
               +- *BatchedScan CarbonDatasourceHadoopRelation [ Database name 
:default, Table name :datamv20_table, Schema 
:Some(StructType(StructField(test20_country,StringType,true), 
StructField(sum_salary,LongType,true))) ] 
default.datamv20_table[test20_country#103,sum_salary#104L] PushedFilters: 
[IsNotNull(test20_country), EqualTo(test20_country,USA)]  |
+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--+
2 rows selected (0.318 seconds)

{code}



> [MV] Wrong data displayed with Filter 
> --------------------------------------
>
>                 Key: CARBONDATA-2544
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2544
>             Project: CarbonData
>          Issue Type: Bug
>            Reporter: Babulal
>            Assignee: xubo245
>            Priority: Major
>
> spark.sql("drop table if exists test1")
>  spark.sql("create table test1( name string,country string,age int,salary 
> int) stored by 'carbondata' ")
> spark.sql("insert into test1 select 'name1','USA',12,23")
> spark.sql("create datamap datamv2 using 'mv' as select country,sum(salary) 
> from test1 group by country").show()
>  spark.sql("rebuild datamap datamv2")
>  spark.sql("select country,sum(salary) from test1 group by 
> country").show(200,false)
> +--------+----------+
> |country|sum(salary)|
> +--------+----------+
> |USA|23|
> +--------+----------+
>  
> spark.sql("select country,sum(salary) from test1 where country='USA' group by 
> country").show(200,false)
> +--------+----------+
> |country|sum(salary)|
> +--------+----------+
>  +--------+----------+
>  
> This is because, select query formation is wrong , filter value is changed to 
> lowercase 
> 2018-05-27 00:20:16 INFO CarbonSparkSqlParser:54 - Parsing command: select 
> preAGG() as preAgg, gen_subsumer_0.`country`, gen_subsumer_0.`sum(salary)` as 
> `sum(salary)` 
>  from
>  (select test1.`country`, sum(cast(test1.`salary` as bigint)) as 
> `sum(salary)` 
>  from
>  test1
>  group by test1.`country`) gen_subsumer_0 
>  where
>  (gen_subsumer_0.`country` = 'usa')
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (CARBONDATA-2544) [MV] Wrong data displayed with Filter

Reply via email to