[jira] [Commented] (SAMZA-484) Define the serialization/deserialization format for stream tuple

Martin Kleppmann (JIRA) Thu, 27 Nov 2014 06:30:13 -0800

    [ 
https://issues.apache.org/jira/browse/SAMZA-484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14227700#comment-14227700
 ]


Martin Kleppmann commented on SAMZA-484:
----------------------------------------

See also SAMZA-429 for a discussion of this topic. It includes a proposal for 
allowing high-level tools to access fields within a message, while keeping the 
pluggable serialization interface that we already have.

It might be simplest to assume JSON for a first implementation, because it 
avoids the burden of schema management. But the APIs should leave the door open 
for replacing it with something more efficient, like Avro, in future.

> Define the serialization/deserialization format for stream tuple
> ----------------------------------------------------------------
>
>                 Key: SAMZA-484
>                 URL: https://issues.apache.org/jira/browse/SAMZA-484
>             Project: Samza
>          Issue Type: Sub-task
>            Reporter: Yi Pan (Data Infrastructure)
>            Priority: Minor
>              Labels: project
>
> It came out in the discussion for streaming SQL that we will need to define 
> the serialization/deserialization format for stream tuple.
> The ideal serialization/deserialization format should allow both forward and 
> backward compatibility on additional/missing fields in the data.
> Several choices to be considered:
> 1) Avro
> 2) Protobuf
> 3) Flatbuffer
> It might also be interesting to consider a pluggable serialization interface 
> that allows different serialization methods for different Samza jobs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (SAMZA-484) Define the serialization/deserialization format for stream tuple

Reply via email to