[ 
https://issues.apache.org/jira/browse/DRILL-198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13755887#comment-13755887
 ] 

Ted Dunning edited comment on DRILL-198 at 9/2/13 3:30 AM:
-----------------------------------------------------------

The Mondrian data will eventually be useful, but it apparently exceeds what 
Drill can do right now.

The json-cp storage engine seems to work to get access to data, but I can only 
do a tiny bit with it.

The problem seems to be that there are many data values that Drill can't 
interpret or manipulate well.  For instance, in the store.json file, there are 
25 records that look roughly like this:

{code}
{"store_id":6,"store_type":"Gourmet 
Supermarket","region_id":47,"store_name":"Store 
6","store_number":6,"store_street_address":"5495 Mitchell Canyon 
Road","store_city":"Beverly 
Hills","store_state":"CA","store_postal_code":"55555","store_country":"USA","store_manager":"Maris","store_phone":"958-555-5002","store_fax":"958-555-5001","first_opened_date":"1981-01-03
 00:00:00.0","last_remodel_date":"1991-03-13 
00:00:00.0","store_sqft":23688,"grocery_sqft":15337,"frozen_sqft":5011,"meat_sqft":3340,"coffee_bar":true,"video_store":true,"salad_bar":true,"prepared_food":true,"florist":true}
{code}

This query works

{code}
select _MAP['store_id'] from "store.json";
{code}

and this
{code}
select _MAP['store_id'] from "store.json" group by _MAP['store_id'];
{code}


as does this 
{code}
select _MAP['coffee_bar'] from "store.json";
{code}

But this does not:

{code}
select _MAP['coffee_bar'] from "store.json" group by _MAP['coffee_bar'];
{code}
                
      was (Author: tdunning):
    The Mondrian data will eventually be useful, but it apparently exceeds what 
Drill can do right now.

The json-cp storage engine seems to work to get access to data, but I can only 
do a tiny bit with it.

The problem seems to be that there are many data values that Drill can't 
interpret or manipulate well.  For instance, in the store.json file, there are 
25 records that look roughly like this:

{code}
{"store_id":6,"store_type":"Gourmet 
Supermarket","region_id":47,"store_name":"Store 
6","store_number":6,"store_street_address":"5495 Mitchell Canyon 
Road","store_city":"Beverly 
Hills","store_state":"CA","store_postal_code":"55555","store_country":"USA","store_manager":"Maris","store_phone":"958-555-5002","store_fax":"958-555-5001","first_opened_date":"1981-01-03
 00:00:00.0","last_remodel_date":"1991-03-13 
00:00:00.0","store_sqft":23688,"grocery_sqft":15337,"frozen_sqft":5011,"meat_sqft":3340,"coffee_bar":true,"video_store":true,"salad_bar":true,"prepared_food":true,"florist":true}
{code}

This query works

{code}
select _MAP['store_id'] from "store.json";
{code}

as does this 
{code}
select _MAP['coffee_bar'] from "store.json";
{code}

But this does not:

{code}
select _MAP['coffee_bar'] from "store.json" group by _MAP['coffee_bar'];
{code}
                  
> Need sample data for tutorials
> ------------------------------
>
>                 Key: DRILL-198
>                 URL: https://issues.apache.org/jira/browse/DRILL-198
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Ted Dunning
>
> Need to check in some small sample data files for use in tutorials.  The 
> region and nation files will suffice for now.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to