[jira] [Updated] (PIG-4713) Document Bloom UDF

2015-10-30 Thread Gabor Liptak (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-4713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gabor Liptak updated PIG-4713:
--
Attachment: PIG-4713.1.patch

> Document Bloom UDF
> --
>
> Key: PIG-4713
> URL: https://issues.apache.org/jira/browse/PIG-4713
> Project: Pig
>  Issue Type: Task
>Reporter: Rohini Palaniswamy
>  Labels: newbie
> Attachments: PIG-4713.1.patch
>
>
> Release notes of https://issues.apache.org/jira/browse/PIG-2328 should go 
> into Builtin Functions (https://pig.apache.org/docs/r0.15.0/func.html) of 
> Apache Pig documentation.  
> Saw one user trying to use Bloom Filter to filter data on a different column 
> than the join column which should not be done as Bloom Filters give false 
> positives and can include records that actually don't match the filter 
> criteria. That should be documented as well and highlighted to avoid users 
> trying to use Bloom Filters for just regular filtering. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (PIG-4713) Document Bloom UDF

2015-10-30 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-4713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-4713:

   Resolution: Fixed
 Assignee: Gabor Liptak
 Hadoop Flags: Reviewed
Fix Version/s: 0.16.0
   Status: Resolved  (was: Patch Available)

+1. Patch committed to trunk. Thanks Gabor!

> Document Bloom UDF
> --
>
> Key: PIG-4713
> URL: https://issues.apache.org/jira/browse/PIG-4713
> Project: Pig
>  Issue Type: Task
>Reporter: Rohini Palaniswamy
>Assignee: Gabor Liptak
>  Labels: newbie
> Fix For: 0.16.0
>
> Attachments: PIG-4713.1.patch
>
>
> Release notes of https://issues.apache.org/jira/browse/PIG-2328 should go 
> into Builtin Functions (https://pig.apache.org/docs/r0.15.0/func.html) of 
> Apache Pig documentation.  
> Saw one user trying to use Bloom Filter to filter data on a different column 
> than the join column which should not be done as Bloom Filters give false 
> positives and can include records that actually don't match the filter 
> criteria. That should be documented as well and highlighted to avoid users 
> trying to use Bloom Filters for just regular filtering. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)