Re: TextIO binary file

2017-02-05 Thread Aviem Zur
I agree that these files will serve no use outside of Beam pipelines. The rationale was that you might want to have one pipeline write output to files and then have a different pipeline that uses those files as inputs. Say one team in your organization creates a pipeline and a different team util

Re: TextIO binary file

2017-02-05 Thread Eugene Kirpichov
OK, I see what you mean; however I still think this can be solved without introducing a new "Beam object file" (or whatever) file format, and without thereby introducing additional use cases and compatibility constraints on coders. I asked before in the thread why not just use AvroIO (it can seria

Re: TextIO binary file

2017-02-05 Thread Aviem Zur
AvroIO would is great for POJOs. But for use cases with more complex, serializable objects, or objects which are compatible with some coder it falls short. Also, for less savvy users to know they need to use AvroIO might be a stretch. Some simpler API along the the lines of ObjectFile might be mor

Re: TextIO binary file

2017-02-05 Thread Eugene Kirpichov
Hmm, do you have a concrete use case in mind, where all these circumstances come together? - There is a need to write data from one Beam pipeline, and read it from another Beam pipeline - These pipelines have to be kept separate, rather than merged into one pipeline - It is okay that the data can n

Re: Jenkins build became unstable: beam_PostCommit_Java_RunnableOnService_Apex #411

2017-02-05 Thread Kenneth Knowles
This one is back to healthy. Filed https://issues.apache.org/jira/browse/BEAM-1400 for the flake. Might be Apex runner, might be runner/core-java. I'm betting on a combination of the two. On Sun, Feb 5, 2017 at 4:37 PM, Apache Jenkins Server < jenk...@builds.apache.org> wrote: > See

Re: [VOTE] Apache Beam, version 0.5.0, release candidate #2

2017-02-05 Thread Dan Halperin
+1 * I ran my own usual sanity check pipelines, which passed. * New to this Beam release, I also ran some additional Google-internal tests. * Verified module list: new modules are io-elasticsearch and io-mqtt ; and verified licenses for the direct, non-Apache dependencies in pom.xml * mvn apache-r