[
https://issues.apache.org/jira/browse/PIG-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13777355#comment-13777355
]
Brian ONeill commented on PIG-3453:
-----------------------------------
Also, we could perform DISTINCT using a backend storage mechanism (like
Cassandra), where we first check storage to see if the tuple exists, if it does
not, we emit. If we first route all the same tuples to a single bolt, then
check from there that may work (eliminating the potential for two bolts to
check for existence at the same time). Using backend storage would allow
someone to perform a true DISTINCT operation.
> Implement a Storm backend to Pig
> --------------------------------
>
> Key: PIG-3453
> URL: https://issues.apache.org/jira/browse/PIG-3453
> Project: Pig
> Issue Type: New Feature
> Reporter: Pradeep Gollakota
> Labels: storm
>
> There is a lot of interest around implementing a Storm backend to Pig for
> streaming processing. The proposal and initial discussions can be found at
> https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira