[ 
https://issues.apache.org/jira/browse/DRILL-13?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13560126#comment-13560126
 ] 

Jacques Nadeau commented on DRILL-13:
-------------------------------------

New re-thoughts...

The primary interface is the Storage Engine Capabilities API.  It should 
describe everything that the particular storage engine supports.  This includes 
whether the storage engine supports serialization, deserialization, what types 
of logical operator capabilities it supports internally.  It also needs to 
include a description of statistics capabilities (e.g. supports approximate row 
keys, average row size, total data size, data distribution statistics, etc) and 
metadata capabilities

Statistics API: Provide the actual statistics information that is utilized 
during query planning.
Metadata API: Provide information about the available sub data sources (tables, 
keyspaces, etc) along with locality information, schema information, type 
information, primary and secondary indices types, partitioning information,  
etc.  Portions of this information are used in query parsing.  Others in query 
planning.  Others portions in Execution planning.
DeserializationAPI - Convert a particular data source into one of our two 
canonical in-memory formats.  (row-based or column-based).  Additionally 
support particular types of logical operation pushdown.
Serialization - Serialize the in-memory format back into the persistent storage 
format.

                
> Storage Engine: Define Java Interface
> -------------------------------------
>
>                 Key: DRILL-13
>                 URL: https://issues.apache.org/jira/browse/DRILL-13
>             Project: Apache Drill
>          Issue Type: Task
>            Reporter: Jacques Nadeau
>
> We're going to need to define a storage engine API.  At a minimum, we'll need 
> to generate a Java one.  We will probably need to also create a CPP one.  
> This task is for the former.  Things that are likely to be included in a the 
> Java interface are: reader (scanner), writer, capabilities interface, schema 
> interface, statistics interface, data layout and ordering

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to