[jira] [Commented] (DRILL-2425) Wrong results when identifier change cases within the same data file

2015-05-08 Thread Hanifi Gunes (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14535746#comment-14535746
 ] 

Hanifi Gunes commented on DRILL-2425:
-

[~sphillips] this seems a duplicate of DRILL-2036.

 Wrong results when identifier change cases within the same data file
 

 Key: DRILL-2425
 URL: https://issues.apache.org/jira/browse/DRILL-2425
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Steven Phillips
Priority: Critical
 Fix For: 1.0.0


 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 I have the following JSON file that one of the identifier change cases:
 {code}
 [root@qa-node120 md-83]# hadoop fs -cat 
 /drill/testdata/complex_type/json/schema/a.json
 {SOURCE: ebm,msAddressIpv6Array: null}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.0, 
 msAddressIpv6_2:88.222.333.0}}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.1, 
 msAddressIpv6_2:88.222.333.1}}
 {SOURCE: ebm,msAddressIpv6Array: {msaddressipv6_1:99.111.222.2, 
 msAddressIpv6_2:88.222.333.2}}
 {code}
 Query this file through drill gives wrong results:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +-+
 | msAddressIpv6_1 |
 +-+
 | null|
 | null|
 | null|
 | 99.111.222.2|
 +-+
 {code}
 plan:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ explain plan for select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +++
 |text|json|
 +++
 | 00-00Screen
 00-01  Project(msAddressIpv6_1=[ITEM($0, 'msAddressIpv6_1')])
 00-02Scan(groupscan=[EasyGroupScan 
 [selectionRoot=/drill/testdata/complex_type/json/schema/a.json, numFiles=1, 
 columns=[`msAddressIpv6Array`.`msAddressIpv6_1`], 
 files=[maprfs:/drill/testdata/complex_type/json/schema/a.json]]])
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (DRILL-2425) Wrong results when identifier change cases within the same data file

2015-03-16 Thread Chun Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363493#comment-14363493
 ] 

Chun Chang commented on DRILL-2425:
---

Within the same file, this identifier 'msAddressIpv6Array.msAddressIpv6_1' 
changed case to 'msAddressIpv6Array.msaddressipv6_1'. Notice the 
'msAddressIpv6_1' became all lower case.

Yes, it feels like we treated this as schema change.

 Wrong results when identifier change cases within the same data file
 

 Key: DRILL-2425
 URL: https://issues.apache.org/jira/browse/DRILL-2425
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Daniel Barclay (Drill)
Priority: Critical

 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 I have the following JSON file that one of the identifier change cases:
 {code}
 [root@qa-node120 md-83]# hadoop fs -cat 
 /drill/testdata/complex_type/json/schema/a.json
 {SOURCE: ebm,msAddressIpv6Array: null}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.0, 
 msAddressIpv6_2:88.222.333.0}}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.1, 
 msAddressIpv6_2:88.222.333.1}}
 {SOURCE: ebm,msAddressIpv6Array: {msaddressipv6_1:99.111.222.2, 
 msAddressIpv6_2:88.222.333.2}}
 {code}
 Query this file through drill gives wrong results:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +-+
 | msAddressIpv6_1 |
 +-+
 | null|
 | null|
 | null|
 | 99.111.222.2|
 +-+
 {code}
 plan:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ explain plan for select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +++
 |text|json|
 +++
 | 00-00Screen
 00-01  Project(msAddressIpv6_1=[ITEM($0, 'msAddressIpv6_1')])
 00-02Scan(groupscan=[EasyGroupScan 
 [selectionRoot=/drill/testdata/complex_type/json/schema/a.json, numFiles=1, 
 columns=[`msAddressIpv6Array`.`msAddressIpv6_1`], 
 files=[maprfs:/drill/testdata/complex_type/json/schema/a.json]]])
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (DRILL-2425) Wrong results when identifier change cases within the same data file

2015-03-15 Thread Jacques Nadeau (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14362533#comment-14362533
 ] 

Jacques Nadeau commented on DRILL-2425:
---

Which identifier changes case?  This looks like an issue that the data is not 
be returned when the schema is changing, no?

 Wrong results when identifier change cases within the same data file
 

 Key: DRILL-2425
 URL: https://issues.apache.org/jira/browse/DRILL-2425
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Daniel Barclay (Drill)
Priority: Critical

 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 I have the following JSON file that one of the identifier change cases:
 {code}
 [root@qa-node120 md-83]# hadoop fs -cat 
 /drill/testdata/complex_type/json/schema/a.json
 {SOURCE: ebm,msAddressIpv6Array: null}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.0, 
 msAddressIpv6_2:88.222.333.0}}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.1, 
 msAddressIpv6_2:88.222.333.1}}
 {SOURCE: ebm,msAddressIpv6Array: {msaddressipv6_1:99.111.222.2, 
 msAddressIpv6_2:88.222.333.2}}
 {code}
 Query this file through drill gives wrong results:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +-+
 | msAddressIpv6_1 |
 +-+
 | null|
 | null|
 | null|
 | 99.111.222.2|
 +-+
 {code}
 plan:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ explain plan for select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +++
 |text|json|
 +++
 | 00-00Screen
 00-01  Project(msAddressIpv6_1=[ITEM($0, 'msAddressIpv6_1')])
 00-02Scan(groupscan=[EasyGroupScan 
 [selectionRoot=/drill/testdata/complex_type/json/schema/a.json, numFiles=1, 
 columns=[`msAddressIpv6Array`.`msAddressIpv6_1`], 
 files=[maprfs:/drill/testdata/complex_type/json/schema/a.json]]])
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)