Re: [ANNOUNCE] New Drill Committer Maksym Rymar

2022-10-08 Thread Abhishek Girish
Congratulations, Maksym! On Fri, Oct 7, 2022 at 10:22 AM James Turton wrote: > The Project Management Committee (PMC) for Apache Drill is pleased to > announce that we have invited Maksym Rymar to join us as a committer of > the Drill project and he has accepted. Please join me in congratulating

Re: [ANNOUNCE] Apache Drill 1.19.0 Released

2021-06-15 Thread Abhishek Girish
Congratulations all! Thanks to everyone who contributed. Laurent, thanks for successfully managing the release. On Mon, Jun 14, 2021 at 5:58 PM Ted Dunning wrote: > Congratulations to Laurent as a first time release manager! > > Well done. > > > > On Mon, Jun 14, 2021 at 5:56 PM Laurent Goujon

Re: Videos on the front page unavailable

2021-02-04 Thread Abhishek Girish
I have good news on this front. We've located the videos and are working on bringing them back to the site. Thanks for your patience. On Thu, Jan 28, 2021 at 2:29 AM luoc wrote: > Hello, > Due to M&A of MapR, these video may have been privatized, and we are > contacting MapR for good news. > We

Re: Release Notes for 1.18.0

2020-09-22 Thread Abhishek Girish
Hey Vineeth! AD 1.18.0 release is done except for the RN. I plan to get that done shortly. Sorry for those who've been waiting.. Regards, Abhishek On Fri, Sep 18, 2020 at 10:28 PM Vineeth Narayanan wrote: > Hello Drill Team, > > Has version 1.18.0 been "officially" released yet? I see that RC0

[RESULT] [VOTE] Release Apache Drill 1.18.0 - RC0

2020-09-03 Thread Abhishek Girish
Hey all, The vote passes! The very first release candidate for 1.18.0, RC0, passed the voting criteria and is ready to be released. Thanks again, to everyone who voted. Total Votes: 5 4x +1 (binding): Vova, Charles, Paul, Boaz 1x +1 (non-binding): Abhishek No 0s or -1s. I'll start with the next

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-09-02 Thread Abhishek Girish
Thanks to everyone who voted. The vote closes. I'll share results in another email. On Sun, Aug 30, 2020 at 10:14 AM Abhishek Girish wrote: > Hi all, > > I'd like to propose the first release candidate (RC0) of Apache Drill, > version 1.18.0. > > The release can

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-09-01 Thread Abhishek Girish
Thanks Charles! Hope you are recovering well. Please take care. On Tue, Sep 1, 2020 at 5:28 PM Charles Givre wrote: > Hey Abhishek, > I’ll take a look tomorrow. > Best, > — C > > > On Sep 1, 2020, at 8:27 PM, Abhishek Girish wrote: > > > > Thanks Vova! > &g

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-09-01 Thread Abhishek Girish
gt; < > https://issues.apache.org/jira/browse/DRILL-7785?focusedCommentId=17187629&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17187629 > > > > on this ticket. > > > > Kind regards, > > Volodymyr Vysotskyi > > > > > > On Mon, Aug 31, 2020 at 4:26

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-08-30 Thread Abhishek Girish
I don't think it's a blocker, > but if we find anything with RC0... let's make sure we get it in. > > -- C > > > > > On Aug 30, 2020, at 9:14 PM, Abhishek Girish wrote: > > > > > > Hey Charles, > > > > > > I would have liked

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-08-30 Thread Abhishek Girish
; > > > > > On Aug 30, 2020, at 8:40 PM, Abhishek Girish wrote: > > > > > > Advanced tests from [5] are also complete. All 7500+ tests passed, except > > > for a few relating to known resource issues (drillbit connectivity / OOM > > > /...). Plus a few

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-08-30 Thread Abhishek Girish
Advanced tests from [5] are also complete. All 7500+ tests passed, except for a few relating to known resource issues (drillbit connectivity / OOM /...). Plus a few with the same symptoms as DRILL-7785. On Sun, Aug 30, 2020 at 2:17 PM Abhishek Girish wrote: > Wanted to share an update on s

Re: [VOTE] Release Apache Drill 1.18.0 - RC0

2020-08-30 Thread Abhishek Girish
lease. Of course, all unit tests (part of the AD repo) - for both default and 'mapr' profiles are also successful. [5] https://github.com/mapr/drill-test-framework On Sun, Aug 30, 2020 at 10:14 AM Abhishek Girish wrote: > Hi all, > > I'd like to propose the first relea

[VOTE] Release Apache Drill 1.18.0 - RC0

2020-08-30 Thread Abhishek Girish
Hi all, I'd like to propose the first release candidate (RC0) of Apache Drill, version 1.18.0. The release candidate covers a total of 164 resolved JIRAs [1]. Thanks to everyone who contributed to this release. The tarball artifacts are hosted at [2] and the maven artifacts are hosted at [3]. T

Re: scaling drill in an openshift (K8s) cluster

2020-03-24 Thread Abhishek Girish
I've added support for auto-scaling and I've tested that it works well. Please see: https://github.com/Agirish/drill-helm-charts#autoscaling-drill-clusters And I have a script to test this: https://github.com/Agirish/drill-helm-charts/blob/master/scripts/runCPULoadTest.sh In case of cloud deploym

Re: Problem running Drill in a Docker container in OpenShift

2020-02-06 Thread Abhishek Girish
ill needs access to > data, which requires some kind of distributed storage. This has long been a > K8s weakness. But, it is, of course, a MapR strength. > > Please let us know if you need help with the open source efforts. > > Thanks, > - Paul > > > > On M

Re: Problem running Drill in a Docker container in OpenShift

2020-02-06 Thread Abhishek Girish
t us know if you need help with the open source efforts. > > > > Thanks, > > - Paul > > > > > > > >On Monday, February 3, 2020, 3:13:28 AM PST, Abhishek Girish < > agir...@apache.org> wrote: > > > > Hey Ron, > > > &g

Re: Problem running Drill in a Docker container in OpenShift

2020-02-03 Thread Abhishek Girish
Hey Ron, As a part of MapR (now HPE), I've created a native operator for Apache Drill and this works on multiple variants of Kubernetes including OpenShift. With this, we introduce a new Kind called "DrillCluster" via a Custom Resource Definition (CRD) and a Custom Controller (logic to manage this

Re: Official Apache Drill Docker Images

2020-01-09 Thread Abhishek Girish
This is great. Thanks Vova! On Thu, Jan 9, 2020 at 9:40 AM Charles Givre wrote: > Great work! > > > On Jan 9, 2020, at 12:39 PM, Arina Yelchiyeva < > arina.yelchiy...@gmail.com> wrote: > > > > Nice ;) > > > >> On Jan 9, 2020, at 7:33 PM, Volodymyr Vysotskyi > wrote: > >> > >> Hi all, > >> > >>

Re: Question about foreman restart

2020-01-07 Thread Abhishek Girish
Thanks Nitin. As mentioned on Slack, Drill would not resubmit the queries. If any drillbit being used in query execution goes down, the query in question is cancelled. On Tue, Jan 7, 2020 at 10:51 AM Nitin Pawar wrote: > I have created DRILL-7517 < > https://issues.apache.org/jira/browse/DRILL-

Re: [ANNOUNCE] Apache Drill 1.17.0 Released

2019-12-26 Thread Abhishek Girish
Congratulations, everyone! On Thu, Dec 26, 2019 at 10:20 AM Volodymyr Vysotskyi wrote: > On behalf of the Apache Drill community, I am happy to announce the release > of Apache Drill 1.17.0. > > Drill is an Apache open-source SQL query engine for Big Data exploration. > Drill is designed from th

Re: Slack Channel invitation Link

2019-12-02 Thread Abhishek Girish
Hey Rameshwar, Can you please try with the below link: https://join.slack.com/t/apache-drill/shared_invite/enQtNTQ4MjM1MDA3MzQ2LTJlYmUxMTRkMmUwYmQ2NTllYmFmMjU4MDk0NjYwZjBmYjg0MDZmOTE2ZDg0ZjBlYmI3Yjc4Y2I2NTQyNGVlZTc I just tried and it looks active. Maybe the README needs to be updated. I'll take

Re: [ANNOUNCE] New PMC Chair of Apache Drill

2019-08-22 Thread Abhishek Girish
Congratulations, Charles!! Looking forward to what's next. Thanks a lot Arina, for your leadership in the last year. I think we may have added more committers and PMC members in your tenure, than ever before. The community is growing well, and I'm so glad to be a part of it. On Thu, Aug 22, 2019

Re: Documentation feedback

2019-08-13 Thread Abhishek Girish
Hey Anastasiia, Thanks a lot for the feedback and for including the solution! Regards, Abhishek On Tue, Aug 13, 2019 at 3:23 AM Arina Yelchiyeva wrote: > Feel free to submit PRs to update site documentation ( > https://github.com/apache/drill/tree/gh-pages < > https://github.com/apache/drill/t

Apache Drill 1.16.0 Docker image now available

2019-05-06 Thread Abhishek Girish
Hey folks, Updated docker image for AD 1.16.0 is now available. To pull image, use the following command: docker pull drill/apache-drill:latest or docker pull drill/apache-drill:1.16.0 For instructions on how to bring up the Drill container, please refer to [1]. [1] http://drill.apache.org/

Re: [ANNOUNCE] New PMC member: Sorabh Hamirwasia

2019-04-05 Thread Abhishek Girish
Congratulations, Sorabh!! On Fri, Apr 5, 2019 at 9:07 AM Timothy Farkas wrote: > Congrats! > > Tim > > On Fri, Apr 5, 2019 at 9:06 AM Arina Ielchiieva wrote: > > > I am pleased to announce that Drill PMC invited Sorabh Hamirwasia to > > the PMC and > > he has accepted the invitation. > > > > Co

Re: Bonjour,

2019-03-21 Thread Abhishek Girish
Hello Justin, The main goal of this project is to query existing datasets - which may be in any of the supported formats [1]. There is currently no support for INSERT INTO. However, Drill does support CREATE TABLE AS [2]. Please give the docs a read and let us know if that helps. Regards, Abhishe

Re: Query Compilation error with 80+ CASE statements

2019-02-27 Thread Abhishek Girish
Rahul, Can you please share plans for both queries (one with fewer which succeeds and one which fails). Also the verbose error. On Tue, Feb 26, 2019 at 11:33 PM Rahul Raj wrote: > Some more update to the mail above: > > The query above has a UDF 'checkNull' used . The UDF code is placed inside

Re: Anouncing Powered-By-Drill page

2019-02-27 Thread Abhishek Girish
This is great! Thanks for making it happen. On Wed, Feb 27, 2019 at 4:12 PM Kunal Khatua wrote: > Hi everyone > > It gives me great pleasure in announcing the launch of the "Powered By > Drill" page on the official Apache Drill website : > https://drill.apache.org/poweredBy > > As a start, the p

Re: HDFS storage prefix returning Error: VALIDATION ERROR: null

2019-02-12 Thread Abhishek Girish
2019-01-25/part-4-ae91cbe2-5410-4bec-ad68-10a053fb2b68.json` > it returns, > > Error: VALIDATION ERROR: Schema [[hdfs]] is not valid with respect to > either root schema or current default schema. > > > On Tuesday, February 12, 2019, 5:10:57 PM PST, Abhishek Girish <

Re: HDFS storage prefix returning Error: VALIDATION ERROR: null

2019-02-12 Thread Abhishek Girish
(s.spark_event,11), > '[^0-9A-Za-z]"', ''),'(".*)','') = 'SparkListenerEnvironmentUpdate' or > REGEXP_REPLACE(REGEXP_REPLACE(substr(s.spark_event,11), '[^0-9A-Za-z]"', > ''),'(".*)',''

Re: HDFS storage prefix returning Error: VALIDATION ERROR: null

2019-02-12 Thread Abhishek Girish
Hey Krishnanand, As mentioned by other folks in earlier threads, can you make sure to include ALL RELEVANT details in your emails? That includes the query, storage plugin configuration, data format, sample data / description of the data, the full log for the query failure? It's necessary if one ne

Re: Slack workspace for Drill discussions

2019-02-11 Thread Abhishek Girish
it to the Drill website in the Community page. On Mon, Feb 11, 2019 at 3:43 PM Charles Givre wrote: > > > Sent from my iPhone > > > On Feb 11, 2019, at 18:30, Abhishek Girish wrote: > > > > Hey folks, > > > > There have been some questions o

Slack workspace for Drill discussions

2019-02-11 Thread Abhishek Girish
Hey folks, There have been some questions on a Slack workspace for Drill - it's popular for multiple open source projects and I think we should encourage active participation as well. The previous Slack workspace for Apache Drill has been idle for ~3 years and we've been unable to invite new users

Re: January Apache Drill board report

2019-01-31 Thread Abhishek Girish
+1. Looks good! On Thu, Jan 31, 2019 at 9:15 AM Vitalii Diravka wrote: > +1 > > Kind regards > Vitalii > > > On Thu, Jan 31, 2019 at 6:18 PM Aman Sinha wrote: > > > Thanks for putting this together, Arina. > > The Drill Developer Day and Meetup were separate events, so you can split > > them up

Re: Drill on YARN Questions

2019-01-11 Thread Abhishek Girish
Hello Teddy, I don't recollect a restart option for the drill-on-yarn.sh script. I've always used a combination of stop and start, like Paul mentions. Could you please try if that works and get back to us? We could certainly have a minor enhancement to support restart - until then i'll request Bri

Apache Drill 1.15.0 Docker image now available

2019-01-07 Thread Abhishek Girish
Hey folks, Updated docker image for AD 1.15.0 is now available. To pull image, use the following command: docker pull drill/apache-drill:latest or docker pull drill/apache-drill:1.15.0 For instructions on how to bring up the Drill container, please refer to [1]. [1] http://drill.apache.org/

Re: [ANNOUNCE] Apache Drill 1.15.0 released

2018-12-31 Thread Abhishek Girish
Congratulations everyone, on yet another great release! And Happy New Year 🎊 On Mon, Dec 31, 2018 at 5:47 AM Vitalii Diravka wrote: > On behalf of the Apache Drill community, I am happy to announce the release > of Apache Drill 1.15.0. > > Drill is an Apache open-source SQL query engine for Big

Re: [ANNOUNCE] New Committer: Salim Achouche

2018-12-18 Thread Abhishek Girish
Congratulations, Salim! On Mon, Dec 17, 2018 at 2:40 AM Arina Ielchiieva wrote: > The Project Management Committee (PMC) for Apache Drill has invited Salim > Achouche to become a committer, and we are pleased to announce that he has > accepted. > > Salim Achouche [1] started contributing to the

Re: [ANNOUNCE] New Committer: Karthikeyan Manivannan

2018-12-07 Thread Abhishek Girish
Congratulations Karthik! On Fri, Dec 7, 2018 at 11:11 AM Arina Ielchiieva wrote: > The Project Management Committee (PMC) for Apache Drill has invited > Karthikeyan > Manivannan to become a committer, and we are pleased to announce that he > has accepted. > > Karthik started contributing to the

Re: [ANNOUNCE] New Committer: Hanumath Rao Maduri

2018-11-01 Thread Abhishek Girish
Congratulations, Hanu! On Thu, Nov 1, 2018 at 10:56 AM Khurram Faraaz wrote: > Congratulations Hanu! > > On Thu, Nov 1, 2018 at 10:14 AM Gautam Parai wrote: > > > Congratulations Hanumath! Well deserved :) > > > > Gautam > > > > On Thu, Nov 1, 2018 at 9:44 AM AnilKumar B > wrote: > > > > > Con

Re: unable to connect to mongodb

2018-10-10 Thread Abhishek Girish
Hey Bhavik, The error indicates a authentication failure - can you double check your login info? Also can you try specifying the IP address of the Mongo instance instead of localhost - in case you have multiple Drillbits? I haven't used Drill with Mongo - so hopefully someone who has can chime in

Re: KVGEN - cann't select the drill on hbase with json value

2018-10-10 Thread Abhishek Girish
Hey liuwenkai, Can you please share a sample row from the table? Also the output of Drill query with (1) select field, (2) select convert_from(field) ? On Fri, Sep 28, 2018 at 10:15 AM kvnew <272301...@qq.com> wrote: > Hi, > > > About drill on hbase, when i search the result of kvgen ,but cann'

Re: Kafka Plugin in Drill 1.14

2018-10-10 Thread Abhishek Girish
Hey Divya, Can you please share if Arina's suggestion worked? It will be helpful for others who encounter a similar problem. On Mon, Oct 8, 2018 at 12:37 AM Arina Yelchiyeva wrote: > Did you run older Drill versions on Windows before? Bootstap plugins are > loaded when there is no stored plugin

Re: Syntax Issue Between Drill Explorer and Spotfire

2018-10-10 Thread Abhishek Girish
Hey Kevin, It looks like the storageplugin.workspace.table_path are specified with single quotes (' ') instead of back-ticks (` `). Is that a email formatting issue, or the actual syntax with your query? If latter, can you try correcting it? Also a note in general: you do not need those back-ticks

Re: Failed to fetch parquet metadata after 15000ms

2018-10-10 Thread Abhishek Girish
Hey Karthik, This is a bug and there are a few JIRAs to track this and one of those is DRILL-5788 . It's likely because of a hard-coded default for the timeout which is sometimes not sufficient. Can you please update the JIRA with your findings, wh

Re: Failed to create schema tree when running Drill View Query

2018-10-10 Thread Abhishek Girish
Hey Divha, That's usually seen when the underlying FS cannot recognize the session user when creating the schema tree. For example if impersonation is enabled and no user is passed, Drill tries to use an "anonymous" user. I've seen that in case of MapR-FS, that it cannot use such a user to proceed

Re: [ANNOUNCE] New Committer: Chunhui Shi

2018-09-28 Thread Abhishek Girish
Congrats Chunhui! On Fri, Sep 28, 2018 at 7:39 AM Vova Vysotskyi wrote: > Congratulations! Well deserved! > > Kind regards, > Volodymyr Vysotskyi > > > On Fri, Sep 28, 2018 at 12:17 PM Arina Ielchiieva > wrote: > > > The Project Management Committee (PMC) for Apache Drill has invited > Chunhui >

Re: Apache Drill meetup session ideas

2018-09-23 Thread Abhishek Girish
Hey Divya, I'm curious to know where the meetup session is and any related details you could share on agenda, target audience and more. We are planning to have one too, so would be good to hear more about the one you are helping to organize. Regards, Abhishek On Sun, Sep 23, 2018 at 7:53 PM Divy

Re: [ANNOUNCE] New Committer: Weijie Tong

2018-08-31 Thread Abhishek Girish
Congrats and thanks, Weijie! On Fri, Aug 31, 2018 at 8:51 AM Arina Ielchiieva wrote: > The Project Management Committee (PMC) for Apache Drill has invited Weijie > Tong to become a committer, and we are pleased to announce that he has > accepted. > > Weijie Tong has become a very active contribut

Re: distributed drill on local file system

2018-08-16 Thread Abhishek Girish
I'd also like to add that with the DFS storage plugin configured with local file system, one will have to make sure all nodes with Drillbits have the same files (under the same directory structure). If a given query on a dataset spawns a distributed plan (multiple fragments), then it may fail if on

Re: Requesting ETA on drill/apache-drill-centos:1.14.0 docker image

2018-08-13 Thread Abhishek Girish
Hello Vedant, Thanks for trying out Drill on Docker! For the official Docker image for Apache Drill 1.14.0, please try out instructions from [1] (note that it refers to a different image: drill/apache-drill:1.14.0). However, it only supports embedded mode as of this release. Since you were using

Re: Ability to set/change default schema for all sessions.

2018-08-11 Thread Abhishek Girish
Not entirely sure of what you are asking for, but here is a suggestion - when you connect through sqlline, you have the ability to set the session schema as follows [1]: $DRILL_HOME/bin/sqlline -u "jdbc:drill:zk=localhost:2181;schema=dfs.tmp" -n user1 This was, each new sqlline session you start

Re: [ANNOUNCE] Apache Drill Release 1.14.0

2018-08-05 Thread Abhishek Girish
Congratulations, everyone! And Boaz, thanks so much for coordinating the release. Folks, please try out 1.14 - it's our best release yet! On Sat, Aug 4, 2018 at 11:35 PM Boaz Ben-Zvi wrote: > On behalf of the Apache Drill community, I am happy to announce the > release of Apache Drill 1.14.0. >

Re: Drill Configuration Requirements To Query Data in Tera Bytes

2018-07-30 Thread Abhishek Girish
Hey Tilak, We don't have any official sizing guidelines - for planning a Drill cluster. A lot of it depends on the type of queries being executed (simple look-ups vs complex joins), data format (columnar data such as Parquet shows best performance), and system load (running a single query on nodes

Re: Apache Drill on Kubernetes

2018-07-26 Thread Abhishek Girish
are approaching a proper multitenant capable > > Drill cluster. This is exciting work! > > > > On Thu, Jul 26, 2018, 11:40 AM Abhishek Girish > wrote: > > > > > Hey everyone, > > > > > > Like John and Saurabh mentioned, yes this

Re: Apache Drill on Kubernetes

2018-07-26 Thread Abhishek Girish
Hey everyone, Like John and Saurabh mentioned, yes this is possible. We've been using Drill on Kubernetes for a while now. I have a draft of my work (Dockerfiles + YAML definitions) available in [1]. Drill should come up successfully in distributed mode (multiple Drillbits) under K8S. Please give

Re: Best Practice to check Drillbit status(Cluster mode)

2018-07-16 Thread Abhishek Girish
are running for > months . > I know I can check the logs and kill that orphaned , which what I did in my > case, but I would like to have notification for down drillbit. > > > Thanks, > Divya > > On Fri, 13 Jul 2018 at 04:15, Abhishek Girish wrote: > > > Hey Div

Re: Best Practice to check Drillbit status(Cluster mode)

2018-07-12 Thread Abhishek Girish
Hey Divya, It would depend on the situation, afaik. The sys.drillbits table contains a list of all running drillibits. If one of the Drillbit has issues and cannot stay connected to the cluster, I would assume it would be unregistered and may not show up in the output of sys.drillbits. If it's an

Re: help drill down in production

2018-07-12 Thread Abhishek Girish
Hey Jose, Can you share more details oh your setup? For Drill usage in production, stand-alone / distributed modes are recommended. Embedded mode is only a good way to get started with Drill. Drillbits are started with you launch Drill in embedded mode and stops when you exit from Sqlline. -Abhis

Re: CTAS AccessControlException

2018-07-02 Thread Abhishek Girish
> > Thanks, > Divya > > > > On Mon, 2 Jul 2018 at 11:37, Abhishek Girish wrote: > > > Hey Divya, > > > > I have a suspicion: There is chance you have a distributed Drill > > environment and not all of the nodes have the same user (with same UID + &g

Re: CTAS AccessControlException

2018-07-01 Thread Abhishek Girish
Hey Divya, I have a suspicion: There is chance you have a distributed Drill environment and not all of the nodes have the same user (with same UID + GID). And your dataset isn't large like you mentioned, so not all Drillbits are always involved in the query execution. So you might intermittently s

Re: Drill with Docker?

2018-06-24 Thread Abhishek Girish
Hey Paul, I have Docker images for Drill published here: https://hub.docker.com/u/drill/. Instructions here: https://issues.apache.org/jira/browse/DRILL-6346?focusedCommentId=16448703&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-16448703 Building with CentOS bas

Re: [DISCUSS] case insensitive storage plugin and workspaces names

2018-06-13 Thread Abhishek Girish
h appropriate error message > and they would have to rename those storage plugins. > > Kind regards, > Arina > > > On Tue, Jun 12, 2018 at 8:45 PM Abhishek Girish > wrote: > > > Paul, I think this proposal was specific to storage plugin and workspace > > *n

Re: [DISCUSS] case insensitive storage plugin and workspaces names

2018-06-12 Thread Abhishek Girish
Paul, I think this proposal was specific to storage plugin and workspace *names*. And not for the whole of Drill. I agree it makes sense to have these names case insensitive, to improve user experience. The only impact to current users I can think of is if someone created two storage plugins dfs a

Re: Which perform better JSON or convert JSON to parquet format ?

2018-06-10 Thread Abhishek Girish
I would suggest converting the JSON files to parquet for better performance. JSON supports a more free form data model, so that's a trade-off you need to consider, in my opinion. On Sun, Jun 10, 2018 at 8:08 PM Divya Gehlot wrote: > Hi, > I am looking for the advise regarding the performance for

Re: Drill and orc file support

2018-03-20 Thread Abhishek Girish
Drill can read ORC format files via the Hive plugin. If you have a Hive table with underlying data stored as ORC, try and configure the Hive storage plugin in Drill [1]. And then you can attempt to query the table in Hive from Drill [2]. [1] http://drill.apache.org/docs/hive-storage-plugin/ [2] h

Re: [Drill 1.13.0] : org.apache.thrift.TApplicationException: Invalid method name: 'get_table_req'

2018-03-20 Thread Abhishek Girish
specific to a certain table / > database in Hive? : Tried 2 tables but getting same error. I have not tried > creating anew one, will try that and let you know. > > > > > On Tue, Mar 20, 2018 3:19 PM, Abhishek Girish agir...@apache.org wrote: > Down in the stack trace it&

Re: How to get data from mongo database into saiku using apache drill

2018-03-20 Thread Abhishek Girish
issue. On Tue, Mar 20, 2018 at 3:27 PM, Abhishek Girish wrote: > Is the issue specific to Mongo datasource (can you access regular files > through the tool)? Do you see any errors in the drillbit.log when you > attempt to access the mongo table? > > On Mon, Mar 19, 2018 at 5:27 PM,

Re: How to get data from mongo database into saiku using apache drill

2018-03-20 Thread Abhishek Girish
Is the issue specific to Mongo datasource (can you access regular files through the tool)? Do you see any errors in the drillbit.log when you attempt to access the mongo table? On Mon, Mar 19, 2018 at 5:27 PM, Sonu Kumawat wrote: > Hi, > >I am trying to get data from mongo database using

Re: [Drill 1.13.0] : org.apache.thrift.TApplicationException: Invalid method name: 'get_table_req'

2018-03-20 Thread Abhishek Girish
Down in the stack trace it's complaining that the table name 'cad' was not found; Can you do a 'use hive;` followed by 'show tables;' and see if table 'cad' is listed? If you try via hive shell, do you see it? Also, can you check if this is impacting accessing all hive tables (may be create a new

Re: [ANNOUNCE] Apache Drill release 1.13.0

2018-03-18 Thread Abhishek Girish
Congratulations everyone, on yet another great release of Apache Drill! On Mon, Mar 19, 2018 at 6:57 AM Parth Chandra wrote: > On behalf of the Apache Drill community, I am happy to announce the > release of > Apache Drill 1.13.0. > > For information about Apache Drill, and to get involved, visit

Re: MapR Drill 1.12 Mismatch between Native and Library Versions

2018-02-09 Thread Abhishek Girish
gt; in my drill_env.sh. This allowed drill to start without that same > > error. > > > > > > Now, I am being a little different. Instead of "installing" drill via > > > RPMs, I download the RPMs (and I did this for both 1.10 and 1.12 from > > MapR

Re: MapR Drill 1.12 Mismatch between Native and Library Versions

2018-02-08 Thread Abhishek Girish
Can you also share the contents of (1) MapR build version on the cluster nodes (cat /opt/mapr/MapRBuildVersion) (2) Drill RPM version installed (rpm -qa |grep -i mapr-drill) And also verify if the maprfs and maprdb jars inside $DRILL_HOME/jars/3rdparty are links to the corresponding jars in /opt/m

Re: No FileSystem for scheme: maprfs

2018-02-01 Thread Abhishek Girish
Hey, Images haven't come through (usually attachments aren't supported in mailing lists). Can you please find another way of sharing them? Also, can you share how you deployed Drill - did you build from source or download packages from MapR? The message usually means your DFS storage plugin is un

Re: [ANNOUNCE] Apache Drill 1.11.0 Released

2017-07-31 Thread Abhishek Girish
Congratulations everyone! On Mon, Jul 31, 2017 at 5:16 AM, Arina Yelchiyeva < arina.yelchiy...@gmail.com> wrote: > On behalf of the Apache Drill community, I am happy to announce the release > of Apache Drill 1.11.0. > > For information about Apache Drill, and to get involved, visit the project >

Re: append data to already existing table saved in parquet format

2017-07-25 Thread Abhishek Girish
Drill doesn't have support for an insert into command. You could try using the CTAS command to write to a specific partition directory, may be? Also look at CTAS auto partitioning [1] [1] https://drill.apache.org/docs/partition-by-clause/ On Tue, Jul 25, 2017 at 10:52 PM, Divya Gehlot wrote: >

Re: CTAS and save as parquet last column values are shown as null

2017-07-24 Thread Abhishek Girish
Filed DRILL-5684 <https://issues.apache.org/jira/browse/DRILL-5684> to track the doc issue. On Mon, Jul 24, 2017 at 8:33 AM, Abhishek Girish wrote: > Glad to know that it worked! > > As you are using Drill on Windows, the new line delimiter in text files > can be different

Re: CTAS and save as parquet last column values are shown as null

2017-07-24 Thread Abhishek Girish
o cast or convert > to_date it throws me error > > > > > Thanks , > Divya > > On 24 July 2017 at 14:17, Abhishek Girish wrote: > >> Can you update your csv format plugin as shown below and retry your query? >> >> "csv": { >>

Re: CTAS and save as parquet last column values are shown as null

2017-07-23 Thread Abhishek Girish
t;15/1/2016","Karnataka\r"] > >> ["Friday","25/3/2016","Karnataka\r"] > >> ["Friday","8/4/2016","Karnataka\r"] > >> ["Monday","7/3/2016","Karnataka\r"] &

Re: CTAS and save as parquet last column values are shown as null

2017-07-23 Thread Abhishek Girish
t; CSV File > > Parquet File > > > > Appreciate the help ! > > Thanks, > Divya ​ > > On 24 July 2017 at 11:52, Abhishek Girish wrote: > >> Can you share a sample row from the CSV and the CTAS query? Also test if a >> select columns[n] query on the CSV

Re: CTAS and save as parquet last column values are shown as null

2017-07-23 Thread Abhishek Girish
Can you share a sample row from the CSV and the CTAS query? Also test if a select columns[n] query on the CSV file works as expected [1] ? It could be an issue with delimiters. [1] https://drill.apache.org/docs/querying-plain-text-files/#columns[n]-syntax On Sun, Jul 23, 2017 at 8:44 PM Divya Geh

Re: Add oracle as a storage fails: Please retry: error (unable to create/ update storage)

2017-07-21 Thread Abhishek Girish
Thanks for sharing. We should probably add a note on this to the jdbc storage plugin doc. On Fri, Jul 21, 2017 at 1:00 PM, Dan Holmes wrote: > Turns out after you put the jar in the directory you have to restart drill. > > Works now. > > Dan Holmes | Revenue Analytics, Inc. > Direct: 770.859.125

Re: Running drill on Windows

2017-07-19 Thread Abhishek Girish
JAVA_HOME without spaces is general requirement for installation on > Windows. > For example, the same is for Hadoop - > https://wiki.apache.org/hadoop/Hadoop2OnWindows > Regarding, memory configuration issues, this should be checked. > > Kind regards > Arina > > On We

Re: Running drill on Windows

2017-07-19 Thread Abhishek Girish
Hey Arina, This is pretty helpful. However, this can only constitute as a workaround and not native Windows support, correct? - as cygwin or similar utilities are a pre-requisite. And further, I'm not sure all configurations would take effect as expected - for instance, I vaguely remember that whi

Re: 1.11 Release date

2017-06-19 Thread Abhishek Girish
arkive.com/Ue0zF3kp/ > s3-storage-plugin-not-working-for-signature-v4-regions> > >) > > > > So, I think rather than using a forked Drill version for short term > > purposes, I'd like to present the product team with with a demo using an > > official Dri

Re: 1.11 Release date

2017-06-18 Thread Abhishek Girish
That's a good question - however, I don't think it hasn't been discussed yet. If not a 1.11.0 release right away, we should consider having a 1.10.1 minor release. Coming to your question on issue with Windows embedded JDBC, if you see that's now been resolved (is there a JIRA?), you could build

Re: Connecting to S3 bucket which does not seem to require a key

2017-06-12 Thread Abhishek Girish
goldsby wrote: > Well, these are for a specific user I created for this bucket. The user > only has read access to this bucket, which only contains this public > citibike data and has no permissions access. > So, I'm fine if anyone can connect (at least until I figure out the > prob

Re: Connecting to S3 bucket which does not seem to require a key

2017-06-12 Thread Abhishek Girish
Jack > > On Mon, Jun 12, 2017 at 10:43 AM, Andries Engelbrecht < > aengelbre...@mapr.com > > wrote: > > > You may be better of downloading the NYC bike data set locally and > convert > > to parquet. > > Converting from csv.zip to parquet will result in la

Re: Connecting to S3 bucket which does not seem to require a key

2017-06-11 Thread Abhishek Girish
Drill connects to to S3 buckets (AWS) via the S3a library. And the storage plugin configuration requires the access & secret keys [1]. I'm not sure if Drill can access S3 without the credentials. It might be possible via custom authenticators [2]. Hopefully others who have tried this will comment.

Re: Drill with Cassandra

2017-06-11 Thread Abhishek Girish
so how does this project > differentiate from Presto which seem to have Cassandra adapter? > > -- > > Thanks, > Sandeep > > > > On Mon, Jun 5, 2017 at 12:55 PM, Abhishek Girish > wrote: > > > Currently Drill does not support Cassandra as a datasource. Ther

Re: md5 function

2017-06-11 Thread Abhishek Girish
There is no in-built function for MD5 afaik, but you can create a UDF for it. Please refer to a related discussion on the user list [1] and Drill documentation on UDFs [2], [3]. Once you are done, it would be helpful if you can please share your UDF with the community. [1] http://mail-archives.a

Re: Drill with Cassandra

2017-06-05 Thread Abhishek Girish
Currently Drill does not support Cassandra as a datasource. There was some previous work on a cassandra plugin [1], but I do not think that's been completed. You are welcome to contribute towards a Cassandra plugin. [1] https://issues.apache.org/jira/browse/DRILL-92 On Mon, Jun 5, 2017 at 9:31 A

Re: UNORDERED_RECEIVER taking 70% of query time

2017-06-01 Thread Abhishek Girish
Attachment hasn't come through. Can you upload the query profile to some cloud storage and share a link to it? Also, please share details on how large your dataset is, number of Drillbits, memory and other configurations. On Thu, Jun 1, 2017 at 10:18 PM, wrote: > Hi, > > > > I am running a sim

Re: Parquet on S3 - timeouts

2017-06-01 Thread Abhishek Girish
fs.s3a.connection.maximum 100 does fix the problem. No more timeouts and very quick response. No need to 'prime' the query... On Thu, Jun 1, 2017 at 4:08 PM, Abhishek Girish wrote: > Can you take a look at [1] and let us know if that helps resolve your > issue?

Re: Parquet on S3 - timeouts

2017-06-01 Thread Abhishek Girish
Can you take a look at [1] and let us know if that helps resolve your issue? [1] https://drill.apache.org/docs/s3-storage-plugin/#quering-parquet-format-files-on-s3 On Thu, Jun 1, 2017 at 12:55 PM, Raz Baluchi wrote: > Now that I have Drill working with parquet files on dfs, the next step was >

Re: Apache Drill 1.0 Web Console - Profiles issue

2017-05-25 Thread Abhishek Girish
Hey Federico, Drill persists query profiles on the local disk of the foreman node. Assuming you checked the Web UI only on one node, can you see if query profiles are listed on the Web UI of other nodes? If yes, then it's expected. You can configure your profiles dir to be on DFS [1], to get a uni

Re: Can we Import the HDFS Query results to Any RDBMS using Apache Drill

2017-05-25 Thread Abhishek Girish
Hello Jagadeesh, Drill supports writing to disk via the CTAS command [1]. So you can read from RDBMS and write to HDFS. However, currently we don't support the other way round. Just curious, can you share what you are trying to achieve here? [1] https://drill.apache.org/docs/create-table-as-ctas

Re: S3 configuration for ceph or atmos

2017-05-24 Thread Abhishek Girish
---+ > > 1 row selected (221.293 seconds) > > > The S3 bucket configured in the plugin has approximately 20,000 objects. My > assumption is that there is some sort of metadata scan that occurs anytime > a command is executed? Any suggestions on how

Re: Writing to s3 using Drill

2017-05-24 Thread Abhishek Girish
"defaultInputFormat": null } OR create table s3.a_ctas as select * from s3.a "default": { "location": "/", "writable": true, "defaultInputFormat": null } On Wed, May 24, 2017 at 12:22 PM, Abhishek Girish

Re: Does s3 plugin support AWS S3 signature version 4 ?

2017-05-24 Thread Abhishek Girish
There hasn't been any updates to Drill's S3 support. Also, we are able to query recently created buckets - so I'm guessing the specific version of signature shouldn't matter, as we use Amazon's library (S3a) [1]. You could file a enhancement request on JIRA [2] to support custom clients and authen

  1   2   3   >