[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2014-04-22 Thread Alan Gates (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alan Gates updated PIG-3613:


Status: Open  (was: Patch Available)

> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
> Attachments: PIG-3613.0.patch, PIG-3613.1.patch
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2014-01-01 Thread Rekha Joshi (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rekha Joshi updated PIG-3613:
-

Attachment: PIG-3613.1.patch

Attached patch.
RB: https://reviews.apache.org/r/16553/

> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
> Attachments: PIG-3613.0.patch, PIG-3613.1.patch
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2014-01-01 Thread Rekha Joshi (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rekha Joshi updated PIG-3613:
-

Status: Patch Available  (was: In Progress)

Thanks Cheolsoo for your inputs!
I have removed the external dependency.Attached updated patch.

> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
> Attachments: PIG-3613.0.patch, PIG-3613.1.patch
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2013-12-18 Thread Cheolsoo Park (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Cheolsoo Park updated PIG-3613:
---

Status: Open  (was: Patch Available)

[~rekhajoshm], thank you for the patch. But unfortunately, it isn't committable.

Apparently, you're using a 3rd party library for your udf-
{code}
+import com.wcohen.ss.JaroWinkler;
{code}
But you're not adding the dependency in ivy.xml.

Looking at the changes in build.xml, looks like you downloaded 
secondarystring.jar on your local machine and compiled against it. Well, that 
will work for nobody but you.

Canceling the patch.


> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
> Attachments: PIG-3613.0.patch
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2013-12-11 Thread Rekha Joshi (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rekha Joshi updated PIG-3613:
-

Attachment: PIG-3613.0.patch

> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
> Attachments: PIG-3613.0.patch
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2013-12-11 Thread Rekha Joshi (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rekha Joshi updated PIG-3613:
-

Attachment: (was: PIG-3613.0.patch)

> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2013-12-06 Thread Rekha Joshi (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rekha Joshi updated PIG-3613:
-

Attachment: PIG-3613.0.patch

> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
> Attachments: PIG-3613.0.patch
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores

2013-12-06 Thread Rekha Joshi (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rekha Joshi updated PIG-3613:
-

 Tags: udf
Fix Version/s: 0.10.1
   Labels: piggybank  (was: )
 Release Note: UDF for SimilarityMatching for string and returns the 
matching score.
   Status: Patch Available  (was: In Progress)

Attached patch.

> UDF for SimilarityMatching between strings with matching scores
> ---
>
> Key: PIG-3613
> URL: https://issues.apache.org/jira/browse/PIG-3613
> Project: Pig
>  Issue Type: Task
>  Components: piggybank
>Affects Versions: 0.10.1
>Reporter: Rekha Joshi
>Assignee: Rekha Joshi
>  Labels: piggybank
> Fix For: 0.10.1
>
>
> It would be great if we can do similarity matching between strings on big 
> data using pig udf.
> Proposed udf works on tuple of strings and gives a matching score.



--
This message was sent by Atlassian JIRA
(v6.1#6144)