[
https://issues.apache.org/jira/browse/PIG-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13764812#comment-13764812
]
Pradeep Gollakota commented on PIG-3453:
----------------------------------------
I personally don't have a concrete use case for this yet. In terms of using a
system that can work both in warehousing and in real-time, I have been looking
at Summingbird (recently opensourced). I think the word count example is a good
place to start as it's the canonical example. However, I'd like to have a more
complicated example as well, so I'm writing a TF-IDF implementation in Pig and
in Trident. Perhaps, this can be step 2 PoC after word count. I'd also like to
cut out some of the more complex operations like nested foreach statements etc
in the initial PoC. I'm not sure yet how we'd solve them.
[~thedatachef] I started a new job last week and I'm not sure how this task
would fit into the road map of my new company yet. I'd love to work on this, if
I have time. You're more than welcome to work on this as well. Thanks for all
your great comments, input and enthusiasm.
> Implement a Storm backend to Pig
> --------------------------------
>
> Key: PIG-3453
> URL: https://issues.apache.org/jira/browse/PIG-3453
> Project: Pig
> Issue Type: New Feature
> Reporter: Pradeep Gollakota
> Labels: storm
>
> There is a lot of interest around implementing a Storm backend to Pig for
> streaming processing. The proposal and initial discussions can be found at
> https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira