The cleanest fix would be to make the INFORMATION schema return information
about file system objects.  Then you could do clean selects with whatever
you needed to do.

https://drill.apache.org/docs/querying-the-information-schema/


On Sun, Aug 23, 2015 at 8:31 AM, USC <hsua...@usc.edu> wrote:

> Hi John,
> It is definitely a great idea to have SQL on show files.
>
> What we can do is open a JIRA issue. Usually, a interested person in
> community would pick up and work on :)
>
> > On Aug 23, 2015, at 4:07 AM, John Omernik <j...@omernik.com> wrote:
> >
> > Hey all,
> >
> > So while I've dabbled in drill, this past week I've really dug in, and
> > honestly, I think this project is a game changer, I was able to do some
> > amazing things with Drill kudos to all the hard work that has been done
> > with Drill.
> >
> > I had one question, and potential feature request:
> >
> > When using drill this weekend, I had a workspace setup, and I found
> myself
> > using the show files command often to find my directories etc. The thing
> > is, the return of show files is not ordered.  And when looking at file
> > system data there are many possible ways to order the results for
> > efficiency as a user.
> >
> > Consider the ls command in unix.  The ability to specify different
> sorting
> > is built in there.  I checked out
> > http://drill.apache.org/docs/show-files-command/ as well as tried the
> > "obvious" show files order by name and that didn't work nor did I see
> how I
> > could in the documentation.
> >
> > So, is there a way to order output? If there isn't now, could that be
> > added? I think just adding ORDER BY SQL methodology would be perfect
> here,
> > you have 8 fields (seen below) and ordering by any one of them, or group
> of
> > them, with ASC/DESC just like standard order by would be a huge win.
> >
> > I suppose one could potentially ask for WHERE clause too, and maybe a
> > select (which fields) however I am more concerned with the order, but if
> I
> > had to implement all there I could see:
> >
> > (All Three, select, where, and order) (I.e. after "Files" if the token
> > isn't WHERE  or ORDER then check for the fields, if it's not a valid
> field
> > list error)
> >
> > SHOW FILES name, accessTime where name like '%.csv' order by name;
> >
> > (Where clause and order, note the token after FILES is WHERE)
> > SHOW FILES WHERE name like '%.csv' order by length ASC, name DESC;
> >
> > (Only Order, ORDER Is the first token after FILES)
> > SHOW FILES ORDER BY length ASC, name DESC
> >
> > I don't think we have to grant full SQL functionality here, just the
> > ability to display various fields, filter on criteria, and ordering.. No
> > aggregates, etc. If you wanted to get fancy, I suppose you could take the
> > table and make it a full on table, i.e. take the results make it a quick
> > inmemory table and then utilize the whole drill stack (minus aggregates)
> of
> > functions on it.  Lots of options.  I just wanted to get this down in an
> > email as it was something I found myself wishing I had over and over
> during
> > data exploration.
> >
> >
> > |name| isDirectory  | isFile  | length | owner
> group|permissions|accessTime
> > | modificationTime  |
> >
> >
> >
> > John
>

Reply via email to