[
https://issues.apache.org/jira/browse/PIG-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13763911#comment-13763911
]
Brian ONeill commented on PIG-3453:
-----------------------------------
I'm definitely +1 on this and would be glad to help out with a proof of concept
(PoC). Do we have a concrete use case? Right now, we have a large historical
dataset in HDFS/S3 against which we run Pig scripts. We also ingest real-time
events that have the same data and structure. (using a Kafka queue + Storm).
The real-time system needs to perform the same analysis captured in the Pig
scripts. To accommodate both use cases, we are looking at maintaining two sets
of infrastructure/code-bases. I'd love to be able to deploy the Pig scripts to
Storm. I could take a stab at interpreting one of our simple Pig scripts (a
FILTER + GROUP) into a Trident topology.
(If we get a solid PoC on this, I could incorporate it as a chapter in the
Storm book we are writing)
> Implement a Storm backend to Pig
> --------------------------------
>
> Key: PIG-3453
> URL: https://issues.apache.org/jira/browse/PIG-3453
> Project: Pig
> Issue Type: New Feature
> Reporter: Pradeep Gollakota
> Labels: storm
>
> There is a lot of interest around implementing a Storm backend to Pig for
> streaming processing. The proposal and initial discussions can be found at
> https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira