[ 
https://issues.apache.org/jira/browse/PIG-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13777355#comment-13777355
 ] 

Brian ONeill commented on PIG-3453:
-----------------------------------

Also, we could perform DISTINCT using a backend storage mechanism (like 
Cassandra), where we first check storage to see if the tuple exists, if it does 
not, we emit.  If we first route all the same tuples to a single bolt, then 
check from there that may work (eliminating the potential for two bolts to 
check for existence at the same time).  Using backend storage would allow 
someone to perform a true DISTINCT operation.
                
> Implement a Storm backend to Pig
> --------------------------------
>
>                 Key: PIG-3453
>                 URL: https://issues.apache.org/jira/browse/PIG-3453
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Pradeep Gollakota
>              Labels: storm
>
> There is a lot of interest around implementing a Storm backend to Pig for 
> streaming processing. The proposal and initial discussions can be found at 
> https://cwiki.apache.org/confluence/display/PIG/Pig+on+Storm+Proposal

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to