[
https://issues.apache.org/jira/browse/SAMZA-390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14234893#comment-14234893
]
Yi Pan (Data Infrastructure) commented on SAMZA-390:
----------------------------------------------------
In addition to [~criccomini]'s comments, here are a few more addition notes for
tigon.io from my side:
# concept of “temporal field”
# has a FILTER_JOIN to allow joining two streams within a time duration, which
mimic a window in JOIN
# GROUP BY on temporal field means group is closed and can be computed
incrementally
# has an output_spec.cfg to allow configuration of partition fields and
n_partitions. Don't quite like this. I would like to see it can be specified
via grammar
So far, my only take-away from tigon is:
# if a query result has a temporal field as one of its GROUP By or JOIN keys,
and the incoming messages can be expected to flow in-order-of the timestamps,
we can perform incremental computation of query results.
> High-Level Language for Samza
> -----------------------------
>
> Key: SAMZA-390
> URL: https://issues.apache.org/jira/browse/SAMZA-390
> Project: Samza
> Issue Type: New Feature
> Reporter: Raul Castro Fernandez
> Priority: Minor
> Labels: project
>
> Discussion about high-level languages to define Samza queries. Queries are
> defined in this language and transformed to a dataflow graph where the nodes
> are Samza jobs.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)