How vectorization is implemented in Drill?

2016-02-02 Thread jingguo yao
https://drill.apache.org/faq/ says: > Vectorization: Drill takes advantage of the latest SIMD instructions > available in modern processors. I have done a quick browse of the source code. The following list are the locations that seems to relate with vectorization. - Project in exec/vector folde

REGEX search Operator

2016-02-02 Thread Nicolas Paris
Hello, I can't find any reference in the documentation about a regex operator. I would like to be able to query this way : SELECT * FROM xxx WHERE text_field regexOperator'regex_pattern'; Thanks for helping,

RE: REGEX search Operator

2016-02-02 Thread masahide.miura
Hi, ANSI SQL doesn't define regex operator. Drill neither. Doesn't it enough 'LIKE' operator? Or, REGEXP_REPLACE/SUBSTR functions may help you. https://drill.apache.org/docs/string-manipulation/ -- Miura, Masahide -Original Message- From: Nicolas Paris [mailto:nipari...@gmail.com] Sen

Re: REGEX search Operator

2016-02-02 Thread Nicolas Paris
> ANSI SQL doesn't define regex operator. > Drill neither. > ​Drill has SQL functions extension like "REPEATED_CONTAINS"​ that looks to handle regex. regex operator could be replaced with one new SQL extension ? I guess I could create my own functions in java, right ? Maybe push it into github then

Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Stefán Baxter
Hi, I'm getting this error on master/head using the Avro Reader: "what ever the mind of man can conceive and believe, drill can query" 0: jdbc:drill:zk=local> select * from dfs.asa.`/`; Exception in thread "drill-executor-2" java.lang.NoSuchMethodError: org.apache.drill.exec.store.avro.AvroRecord

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Stefán Baxter
Hi, I can confirm that this same query+avro-files work in 1.4 so this is probably a regression Regards, -Stefan On Tue, Feb 2, 2016 at 1:59 PM, Stefán Baxter wrote: > Hi, > > I'm getting this error on master/head using the Avro Reader: > > "what ever the mind of man can conceive and believe,

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Abdel Hakim Deneche
Hi Stefán, Can you open a JIRA for this, please ? Thanks On Tue, Feb 2, 2016 at 6:21 AM, Stefán Baxter wrote: > Hi, > > I can confirm that this same query+avro-files work in 1.4 so this is > probably a regression > > Regards, > -Stefan > > On Tue, Feb 2, 2016 at 1:59 PM, Stefán Baxter > wrot

Re: REGEX search Operator

2016-02-02 Thread John Omernik
I would like to see something like this as well, even if it's an included UDF like REGEX(field, pattern) using Java's library for regex like Hive does. That would be EXTREMELY helpful. On Tue, Feb 2, 2016 at 6:55 AM, Nicolas Paris wrote: > > ANSI SQL doesn't define regex operator. > > Drill

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Stefán Baxter
https://issues.apache.org/jira/browse/DRILL-4339 On Tue, Feb 2, 2016 at 4:46 PM, Abdel Hakim Deneche wrote: > Hi Stefán, > > Can you open a JIRA for this, please ? > > Thanks > > On Tue, Feb 2, 2016 at 6:21 AM, Stefán Baxter > wrote: > > > Hi, > > > > I can confirm that this same query+avro-fil

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Abdel Hakim Deneche
Thanks On Tue, Feb 2, 2016 at 9:03 AM, Stefán Baxter wrote: > https://issues.apache.org/jira/browse/DRILL-4339 > > On Tue, Feb 2, 2016 at 4:46 PM, Abdel Hakim Deneche > > wrote: > > > Hi Stefán, > > > > Can you open a JIRA for this, please ? > > > > Thanks > > > > On Tue, Feb 2, 2016 at 6:21 AM

Query fails on corrupted parquet column

2016-02-02 Thread François Méthot
Hi, Using drill-embedded 1.4, I encountered this error while doing a query on folders containing thousands of parquet files: Error: SYSTEM ERROR: IOException: FAILED_TO_UNCOMPRESSED(5) Fragment 1:9 After re-running the same query with the log level set to DEBUG, I tracked the files that

Hangout Starting

2016-02-02 Thread Jacques Nadeau
https://plus.google.com/hangouts/_/dremio.com/drillhangout?authuser=0 -- Jacques Nadeau CTO and Co-Founder, Dremio

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Jinfeng Ni
Hi Stefán, Can you post the commit id for your build, by run: select * from sys.version; The unit test has similar query (select * from avro_table) [1], and did not hit this problem. I just want to make sure that we are using the same commit id. Thanks [1] https://github.com/apache/drill/blo

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Stefán Baxter
Hi, As a part of answering this request I cloned the project again and built it from scratch. I did this because I had local, Lucene related, work that I thought could not possibly affect Avro (strictly contained to the Lucene reader). Long story short: this ticket can be closed as invalid and I

Re: Avro reader - Possible regression in 1.5-SNAPSHOT

2016-02-02 Thread Jason Altekruse
I made a comment on the JIRA about a possible explanation for this, it seems like a configuration/classpath issue. I would recommend moving the discussion there because it doesn't always forward JIRA updates to the list. On Tue, Feb 2, 2016 at 10:46 AM, Jinfeng Ni wrote: > Hi Stefán, > > Can you

Query Optimizer Question

2016-02-02 Thread Neda Abolhassani
Dear Sir/Madam, I am a new user to Drill and I am doing research on data virtualization. I have read the documentation and installed a single node Drill on a Windows machine. I am wondering what is the cost in the query planning of Drill (i.e. transition, local processing, ...) and how Drill