Re: Need help in querying HDFS from drill

2015-08-23 Thread Venki Korukanti
Hi, On Sun, Aug 23, 2015 at 10:26 PM, Malathi wrote: > Hi, > > The entire log of the sqlline is attached in the dropbox link: > https://www.dropbox.com/s/oijx9vjibk1md5x/sqlline.log?dl=0 > > Error id is : > > *aa3410a8-8b11-412e-bb89-dec3e8bb8bb4* > P.S : And one more thing to note is that it wo

Re: Need help in querying HDFS from drill

2015-08-23 Thread Malathi
Hi, The entire log of the sqlline is attached in the dropbox link: https://www.dropbox.com/s/oijx9vjibk1md5x/sqlline.log?dl=0 Error id is : *aa3410a8-8b11-412e-bb89-dec3e8bb8bb4* P.S : And one more thing to note is that it worked when both drill and apache HDFS run in centos in different machine

Re: Drill dir0 issue

2015-08-23 Thread Sungwook Yoon
Related weird behavior regarding Hive partitioned directories as dfs storage. I first created a view create view tmp_view as select cast(substr(`dir0`, 6,4) as int) as `year`, cast(aaa as varchar(100)) as aaa from dfs.root.`/user/hive/warehouse/table` o; select aaa from tmp_view where `year` betw

Re: Show Files Command

2015-08-23 Thread Jacques Nadeau
Thanks for putting together such a complete JIRA. That makes everyone's job much easier. I suggest that others who would also like this feature, vote for it. That way the Drill community can do a better job of prioritizing features. thanks again, Jacques -- Jacques Nadeau CTO and Co-Founder, D

Re: [jira] [Created] (DRILL-3697) REGEXP_REPLACE doc. says POSIX reg. expr.; which? not Java?

2015-08-23 Thread Jacques Nadeau
In the future, it would be helpful if you start by asking usage questions on the user list. I think that will get you faster answers and improve bug reports. Drill currently uses Java's regex. It would be great if you could provide a documentation pull request to correct the function documentatio

Re: Show Files Command

2015-08-23 Thread John Omernik
I've created DRILL-3698, could the team please evaluate to ensure I've created a JIRA using the proper labels/settings for what this is. (I choose future for all versioins as I was unsure what to put). Thanks all for the feedback. These "little" things have me very excited about Drill. https://

Re: Show Files Command

2015-08-23 Thread Edmon Begoli
I just tried the approach recommended approach by Kristine and it worked (almost) fine. However, there was a noise in the columns. I think, to make things consistent, there should be an embeddable sql command as Jacques suggests. On Sun, Aug 23, 2015 at 7:33 PM, Jacques Nadeau wrote: > We need

Fwd: [jira] [Created] (DRILL-3693) SQLLine/drill-localhost seems to demand 2.9GB just to start

2015-08-23 Thread Jacques Nadeau
Hey guys, if you have a question or want to have a discussion about something, please ask the question on the user list. The overhead of managing jiras when someone doesn't understand something is too high. The first question here is: what does drill-env.sh set the jvm memory settings to? This is

Re: Show Files Command

2015-08-23 Thread Jacques Nadeau
We need to expose the show files command as an embeddable sql command or expose a table analog. John, want to file an enhancement request? On Aug 23, 2015 4:31 PM, "John Omernik" wrote: > Kristine: We "can" do that, but it's not conducive to "exploration" it's a > lot of work just to sort somethi

Re: Show Files Command

2015-08-23 Thread John Omernik
Kristine: We "can" do that, but it's not conducive to "exploration" it's a lot of work just to sort something different, trying to find a way to do that sort of thing intuitively as part of walking through the data. Do folks think this worth opening a Jira on? I think it would be really useful, but

Re: Show Files Command

2015-08-23 Thread Kristine Hahn
Correction: save the output to a .tbl file Kristine Hahn Sr. Technical Writer 415-497-8107 @krishahn skype:krishahn On Sun, Aug 23, 2015 at 4:04 PM, Kristine Hahn wrote: > You can use the Drill Shell command !record to save the output to a > .tbl command and then query the PSV results using the

Re: Show Files Command

2015-08-23 Thread Kristine Hahn
You can use the Drill Shell command !record to save the output to a .tbl command and then query the PSV results using the COLUMNS[n] syntax, which can return the accessTime from COLUMNS[7]. Kristine Hahn Sr. Technical Writer 415-497-8107 @krishahn skype:krishahn On Sun, Aug 23, 2015 at 3:45 PM,

Re: Show Files Command

2015-08-23 Thread Edmon Begoli
Related to this - I saw someone on stackoverflow asking if modified_time returned from show files could be treated relationally. Can the show files result be queried or converted into query result from the sql command line, and if so how? On Sunday, August 23, 2015, Ted Dunning wrote: > The cle

Re: Drill dir0 issue

2015-08-23 Thread Sungwook Yoon
So, I filed the issue here, https://issues.apache.org/jira/browse/DRILL-3692 If more details are needed let me know. Sungwook On Sun, Aug 23, 2015 at 2:45 PM, Aman Sinha wrote: > Yes, I just realized that and was about to respond to my prior message. > I just tested with a directory structur

Re: Drill dir0 issue

2015-08-23 Thread Aman Sinha
Yes, I just realized that and was about to respond to my prior message. I just tested with a directory structure similar to Sungwook's (where directories are named with 'year=2012' format) and it works for me. But I am on the current master branch. In the original message 'Sometimes it picks up e.

Re: Drill dir0 issue

2015-08-23 Thread Jacques Nadeau
The way that Sungwook is describing the issue, it has nothing to do with Hive. The files were generated via Hive but he is querying directly through the DFS schema. -- Jacques Nadeau CTO and Co-Founder, Dremio On Sun, Aug 23, 2015 at 12:20 PM, Aman Sinha wrote: > Sungwook, do you have the late

Re: Drill dir0 issue

2015-08-23 Thread Aman Sinha
Sungwook, do you have the latest master build which has the fix for Hive partition pruning (DRILL-3121) ? On Sun, Aug 23, 2015 at 12:15 PM, Sungwook Yoon wrote: > Will do, > > Thanks, > > Sungwook > > > On Sun, Aug 23, 2015 at 2:14 PM, Jacques Nadeau > wrote: > > > It sounds like a bug. Can you

Re: Show Files Command

2015-08-23 Thread Ted Dunning
The cleanest fix would be to make the INFORMATION schema return information about file system objects. Then you could do clean selects with whatever you needed to do. https://drill.apache.org/docs/querying-the-information-schema/ On Sun, Aug 23, 2015 at 8:31 AM, USC wrote: > Hi John, > It is

Re: Drill dir0 issue

2015-08-23 Thread Sungwook Yoon
Will do, Thanks, Sungwook On Sun, Aug 23, 2015 at 2:14 PM, Jacques Nadeau wrote: > It sounds like a bug. Can you file a jira? > > -- > Jacques Nadeau > CTO and Co-Founder, Dremio > > On Sun, Aug 23, 2015 at 12:13 PM, Sungwook Yoon > wrote: > > > Hi Jacques, > > > > This works well, no proble

Re: Drill dir0 issue

2015-08-23 Thread Jacques Nadeau
It sounds like a bug. Can you file a jira? -- Jacques Nadeau CTO and Co-Founder, Dremio On Sun, Aug 23, 2015 at 12:13 PM, Sungwook Yoon wrote: > Hi Jacques, > > This works well, no problem of accessing the partitioned dirs. > (and actually pretty faster than accessing from one level above) > >

Re: Drill dir0 issue

2015-08-23 Thread Sungwook Yoon
Hi Jacques, This works well, no problem of accessing the partitioned dirs. (and actually pretty faster than accessing from one level above) Just the issues I asked about, when I access from the /user/hive/warehouse/table, it somehow does not recover every dir0. Sungwook On Sun, Aug 23, 2015 at

Re: Drill dir0 issue

2015-08-23 Thread Jacques Nadeau
I think Hsuan misunderstood your question. Can you let us know what you get if you query: select * from dfs.root.`/user/hive/warehouse/table/year=2012` -- Jacques Nadeau CTO and Co-Founder, Dremio On Sun, Aug 23, 2015 at 7:07 AM, Sungwook Yoon wrote: > Hi, > > I am trying to use Hive pa

Re: Drill dir0 issue

2015-08-23 Thread Andries Engelbrecht
Hi Sungwook, Can you create a different directory with a few files in each sub directory, but use 2012, 2013, 2014instead of year=2012, etc. Might be a good test to see if the directory naming structure of year= is tripping up Drill on directory pruning. —Andries > On Aug 23, 2015, at 9:4

Re: Drill dir0 issue

2015-08-23 Thread Kristine Hahn
If you set up your data in directories like the log data in the Querying Directories example on http://drill.apache.org/docs/querying-directories, which uses WHERE dir0='2013' LIMIT 10 in the query, and you are having intermittent Table Not Found results, look for hidden files in the directory you

Re: Drill dir0 issue

2015-08-23 Thread USC
Hi Sungwook, In your where clause, you only need to say year=2012. The directory column (e.g., dir0) is used when users query a directory. > On Aug 23, 2015, at 7:07 AM, Sungwook Yoon wrote: > > Hi, > > I am trying to use Hive parquet stored files partitioned by some column. > So, the directo

Re: Show Files Command

2015-08-23 Thread USC
Hi John, It is definitely a great idea to have SQL on show files. What we can do is open a JIRA issue. Usually, a interested person in community would pick up and work on :) > On Aug 23, 2015, at 4:07 AM, John Omernik wrote: > > Hey all, > > So while I've dabbled in drill, this past week I'v

Drill dir0 issue

2015-08-23 Thread Sungwook Yoon
Hi, I am trying to use Hive parquet stored files partitioned by some column. So, the directory structure is partitioned with the column. The column is actually year. Let's say there are 5 years, so dir0 are like year=2010, year=2011,year=2012,year=2013,year=2014 We did like following select * fr

Show Files Command

2015-08-23 Thread John Omernik
Hey all, So while I've dabbled in drill, this past week I've really dug in, and honestly, I think this project is a game changer, I was able to do some amazing things with Drill kudos to all the hard work that has been done with Drill. I had one question, and potential feature request: When usin