[GitHub] [flink] echauchot commented on pull request #15436: [FLINK-6417] Glob support for FileInputFormat

2021-05-06 Thread GitBox


echauchot commented on pull request #15436:
URL: https://github.com/apache/flink/pull/15436#issuecomment-833411086


   And besides, I think that the new FileSource API with FileEnumerator will 
support globs out of the box


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [flink] echauchot commented on pull request #15436: [FLINK-6417] Glob support for FileInputFormat

2021-04-26 Thread GitBox


echauchot commented on pull request #15436:
URL: https://github.com/apache/flink/pull/15436#issuecomment-826626541


   @zentol @AHeise I'm closing this PR because I discovered that it does not 
work on S3 filesystem.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [flink] echauchot commented on pull request #15436: [FLINK-6417] Glob support for FileInputFormat

2021-04-21 Thread GitBox


echauchot commented on pull request #15436:
URL: https://github.com/apache/flink/pull/15436#issuecomment-823911672


   @zentol thanks for your comments and sorry for the delay of my answer (I was 
very busy on other things and then off). 
   I agree with your comments on performance, possible permissions problems and 
filesystem support. But I don't quite agree on the fact that it does not bring 
new functionality: in some cases input files are split because of their size 
into unpredictable names. In that cases, to be able to read them you need to 
use a glob (e.g. myfile_part*) which does not work in current flink but which 
is addressed by this PR. 
   
   Maybe it should be addressed with different code (that limit above problems) 
but still it is a feature that is needed IMHO.
   
   WDYT ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [flink] echauchot commented on pull request #15436: [FLINK-6417] Glob support for FileInputFormat

2021-03-30 Thread GitBox


echauchot commented on pull request #15436:
URL: https://github.com/apache/flink/pull/15436#issuecomment-810354513


   @AHeise  can you review this please ?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org