arina-ielchiieva commented on issue #1711: DRILL-7011: Support schema in scan 
framework
URL: https://github.com/apache/drill/pull/1711#issuecomment-477160154
 
 
   @paul-rogers command syntax is the following:
   ```
   CREATE [OR REPLACE] SCHEMA
   [LOAD 'file:///path/to/file']
   [(column_name data_type nullability,...)]
   [FOR TABLE `table_name`]
   [PATH 'file:///schema_file_path/schema_file_name'] 
   [PROPERTIES ('key1'='value1', 'key2'='value2', ...)]
   ```
   `PROPERTIES` should be provided in parenthesis, in a form of key / value 
pairs where value follows after the key and equal sign, each enclosed the 
single quotes. Key / value groups should be separated by commas.
   ```
   create schema
   (col1 int, col2 int)
   for table t
   properties (
   'drill.strict' = 'true',
   'some_other_prop' = 'val')
   ```
   In `.drill.schema` JSON file this would look the following way:
   ```
   {
     "table" : "dfs.tmp.`t`",
     "schema" : {
       "columns" : [
         {
           "name" : "col1",
           "type" : "INT",
           "mode" : "OPTIONAL"
         },
         {
           "name" : "col2",
           "type" : "INT",
           "mode" : "OPTIONAL"
         }
       ],
       "properties" : {
         "drill.strict" : "true",
         "some_other_prop" : "val"
       }
     },
     "version" : 1
   }
   ```
   During deserialization they will be stored in `TupleMetadata` class (use 
`property(key)`, `properties` methods to extract them).
   
   If you want to add column properties, similar syntax will be used, except 
instead of parenthesis you need to use curly braces:
   ```
   create schema
   (col1 int, col2 int properties {'drill.strict' = 'true'})
   for table t
   properties (
   'drill.strict' = 'true',
   'some_other_prop' = 'val')
   ```
   JSON output:
   ```
   {
     "table" : "dfs.tmp.`t`",
     "schema" : {
       "columns" : [
         {
           "name" : "col1",
           "type" : "INT",
           "mode" : "OPTIONAL",
           "properties" : {
             "drill.strict" : "true"
           }
         },
         {
           "name" : "col2",
           "type" : "INT",
           "mode" : "OPTIONAL"
         }
       ],
       "properties" : {
         "drill.is_strict_schema" : "true",
         "some_other_prop" : "val"
       }
     },
     "version" : 1
   }
   ```
   Please let me know if there are any other syntax related questions.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to