[ 
https://issues.apache.org/jira/browse/PIG-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13764307#comment-13764307
 ] 

Jacob Perkins commented on PIG-3453:
------------------------------------

I like the idea of a PoC. I hadn't really thought about the case that you'd 
want to run the same aggregations in real time as you would against your 
warehouse. It sure makes a lot of sense. Mostly I was thinking of language 
reuse; complex stream aggregations using the same familiar data flow syntax of 
Pig.

More examples of the translation from Pig scripts to trident topologies would 
be helpful. It'd be motivation enough to see the amount of boilerplate, etc, 
required to write a trident topology versus a pig script side by side. I've 
been working from a word count since it's sort of the canonical example.

I'm currently focusing on mapping Pig's LogicalPlan to a TridentTopology. Other 
really helpful things (that I'm less likely to make fast progress on) include 
defining a new operator for spouts, called 'tap' and some planning/discussion 
for a new operator for persisting state. 

[~pradeepg26] - Anything else? It'd be nice to get a sense of who's 
working/willing to work on this and where we're at now.
                
> Implement a Storm backend to Pig
> --------------------------------
>
>                 Key: PIG-3453
>                 URL: https://issues.apache.org/jira/browse/PIG-3453
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Pradeep Gollakota
>              Labels: storm
>
> There is a lot of interest around implementing a Storm backend to Pig for 
> streaming processing. The proposal and initial discussions can be found at 
> https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to