I like where Ted is going with this.  I do also like Paul’s idea of giving 
Drill hints as to the schema.  
— C

> On Aug 15, 2018, at 20:34, Ted Dunning <ted.dunn...@gmail.com> wrote:
> 
> This is a bold statement.
> 
> And there are variants of it that could give users nearly the same
> experience that we have now. For instance, if we cache discovered schemas
> for old files and discover the schema for any new file that we see (and
> cache it) before actually running a query. That gives us pretty much the
> flexibility of schema on read without as much of the burden.
> 
> 
> 
> On Wed, Aug 15, 2018 at 5:02 PM weijie tong <tongweijie...@gmail.com> wrote:
> 
>> Hi all:
>>  Hope the statement not seems too dash to you.
>>  Drill claims be a schema-free distributed SQL engine. It pays lots of
>> work to make the execution engine to support it to support JSON file like
>> storage format. It is easier to make bugs and let the code logic ugly. I
>> wonder do we still insist on this ,since we are designing the metadata
>> system with DRILL-6552.
>>   Traditionally, people is used to design its table schema firstly before
>> firing a SQL query. I don't think this saves people too much time. Other
>> system like Spark is popular not due to lack the schema claiming. I think
>> we should be brave enough to take the right decision whether to still
>> insist on this feature which seems not so important but a burden.
>>    Thanks.
>> 

Reply via email to