kamszPolidea commented on a change in pull request #4169: [BEAM-3060] Added
support for multiple filesystems in TextIO
URL: https://github.com/apache/beam/pull/4169#discussion_r153293304
##########
File path:
sdks/java/io/common/src/test/java/org/apache/beam/sdk/io/common/IOTestPipelineOptions.java
##########
@@ -100,4 +100,14 @@
String getFilenamePrefix();
void setFilenamePrefix(String prefix);
+
+ @Description("Google cloud storage - bucket_name/path")
+ String getGcsLocation();
Review comment:
We can use `--filenamePrefix`, but then we need to provide full
communication scheme there for GCS or HDFS, for instance
`gs://bucket/path/file` or `hdfs://hadoop-master:port/dfs-path/file`. If we
assume that user running tests will know it then those two gcsLocation and
hdfsLocation could be ommited. This is basically implementation of our proposal
https://docs.google.com/document/d/1dA-5s6OHiP_cz-NRAbwapoKF5MEC1wKps4A5tFbIPKE/edit#heading=h.29mfbxd6kc64
. Do you think would be better to remove those two pipeline options and just
depend on pipelinePrefix ? Should I also remove protocol resolving part then ?
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
With regards,
Apache Git Services