[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-3613: Status: Open (was: Patch Available) > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > Attachments: PIG-3613.0.patch, PIG-3613.1.patch > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated PIG-3613: - Attachment: PIG-3613.1.patch Attached patch. RB: https://reviews.apache.org/r/16553/ > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > Attachments: PIG-3613.0.patch, PIG-3613.1.patch > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated PIG-3613: - Status: Patch Available (was: In Progress) Thanks Cheolsoo for your inputs! I have removed the external dependency.Attached updated patch. > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > Attachments: PIG-3613.0.patch, PIG-3613.1.patch > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3613: --- Status: Open (was: Patch Available) [~rekhajoshm], thank you for the patch. But unfortunately, it isn't committable. Apparently, you're using a 3rd party library for your udf- {code} +import com.wcohen.ss.JaroWinkler; {code} But you're not adding the dependency in ivy.xml. Looking at the changes in build.xml, looks like you downloaded secondarystring.jar on your local machine and compiled against it. Well, that will work for nobody but you. Canceling the patch. > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > Attachments: PIG-3613.0.patch > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.1.4#6159)
[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated PIG-3613: - Attachment: PIG-3613.0.patch > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > Attachments: PIG-3613.0.patch > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.1.4#6159)
[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated PIG-3613: - Attachment: (was: PIG-3613.0.patch) > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.1.4#6159)
[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated PIG-3613: - Attachment: PIG-3613.0.patch > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > Attachments: PIG-3613.0.patch > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Updated] (PIG-3613) UDF for SimilarityMatching between strings with matching scores
[ https://issues.apache.org/jira/browse/PIG-3613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated PIG-3613: - Tags: udf Fix Version/s: 0.10.1 Labels: piggybank (was: ) Release Note: UDF for SimilarityMatching for string and returns the matching score. Status: Patch Available (was: In Progress) Attached patch. > UDF for SimilarityMatching between strings with matching scores > --- > > Key: PIG-3613 > URL: https://issues.apache.org/jira/browse/PIG-3613 > Project: Pig > Issue Type: Task > Components: piggybank >Affects Versions: 0.10.1 >Reporter: Rekha Joshi >Assignee: Rekha Joshi > Labels: piggybank > Fix For: 0.10.1 > > > It would be great if we can do similarity matching between strings on big > data using pig udf. > Proposed udf works on tuple of strings and gives a matching score. -- This message was sent by Atlassian JIRA (v6.1#6144)