[ 
https://issues.apache.org/jira/browse/DRILL-7004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16942892#comment-16942892
 ] 

Arina Ielchiieva commented on DRILL-7004:
-----------------------------------------

Drill can only parallelize file reading per folder since Drill are using 
org.apache.hadoop.fs.FileSystem, if folder contains many files, obviously it 
will take long to list all of them. Unfortunately, there is no other way to 
improve performance.


> improve show files functionnality
> ---------------------------------
>
>                 Key: DRILL-7004
>                 URL: https://issues.apache.org/jira/browse/DRILL-7004
>             Project: Apache Drill
>          Issue Type: Wish
>          Components: Storage - Other
>    Affects Versions: 1.15.0
>            Reporter: benj
>            Priority: Major
>
> For instant, it's possible to show files/directories in a particular 
> directory with the command
> {code:java}
> SHOW files FROM tmp.`mypath`;
> {code}
> It would be certainly very useful to improve this functionality with :
>  * possibility to list recursively
>  * possibility to use at least wildcard
> {code:java}
> SHOW files FROM tmp.`mypath/*/test/*/*a*`;
> {code}
>  * possibility to use the result like a table
> {code:java}
> SELECT p.* FROM (SHOW files FROM tmp.`mypath`) AS p WHERE ...
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to