[jira] [Commented] (HIVE-4913) Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()

2013-08-11 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736264#comment-13736264
 ] 

Hudson commented on HIVE-4913:
--

ABORTED: Integrated in Hive-trunk-hadoop2 #350 (See 
[https://builds.apache.org/job/Hive-trunk-hadoop2/350/])
HIVE-4913 : Put deterministic ordering in the top-K ngrams output of UDF 
context_ngrams() (Xuefu Zhang via Ashutosh Chauhan) (hashutosh: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1512590)
* 
/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/NGramEstimator.java
* /hive/trunk/ql/src/test/results/clientpositive/udaf_context_ngrams.q.out


> Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()
> -
>
> Key: HIVE-4913
> URL: https://issues.apache.org/jira/browse/HIVE-4913
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 0.10.0, 0.11.0
>Reporter: Xuefu Zhang
>Assignee: Xuefu Zhang
>Priority: Minor
> Fix For: 0.12.0
>
> Attachments: HIVE-4913.patch, HIVE-4913.patch
>
>
> Currently UDF context_ngrams() output top-K ngrams in the order of descending 
> frequency. When there are ties, i.e., phrases with same ngram value, the 
> order is indeterministic. Tie breaker is needed to break the ties, so that 
> the output is deterministic.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4913) Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()

2013-08-10 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736165#comment-13736165
 ] 

Hudson commented on HIVE-4913:
--

FAILURE: Integrated in Hive-trunk-h0.21 #2259 (See 
[https://builds.apache.org/job/Hive-trunk-h0.21/2259/])
HIVE-4913 : Put deterministic ordering in the top-K ngrams output of UDF 
context_ngrams() (Xuefu Zhang via Ashutosh Chauhan) (hashutosh: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1512590)
* 
/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/NGramEstimator.java
* /hive/trunk/ql/src/test/results/clientpositive/udaf_context_ngrams.q.out


> Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()
> -
>
> Key: HIVE-4913
> URL: https://issues.apache.org/jira/browse/HIVE-4913
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 0.10.0, 0.11.0
>Reporter: Xuefu Zhang
>Assignee: Xuefu Zhang
>Priority: Minor
> Fix For: 0.12.0
>
> Attachments: HIVE-4913.patch, HIVE-4913.patch
>
>
> Currently UDF context_ngrams() output top-K ngrams in the order of descending 
> frequency. When there are ties, i.e., phrases with same ngram value, the 
> order is indeterministic. Tie breaker is needed to break the ties, so that 
> the output is deterministic.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4913) Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()

2013-08-10 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736124#comment-13736124
 ] 

Hudson commented on HIVE-4913:
--

SUCCESS: Integrated in Hive-trunk-hadoop1-ptest #122 (See 
[https://builds.apache.org/job/Hive-trunk-hadoop1-ptest/122/])
HIVE-4913 : Put deterministic ordering in the top-K ngrams output of UDF 
context_ngrams() (Xuefu Zhang via Ashutosh Chauhan) (hashutosh: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1512590)
* 
/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/NGramEstimator.java
* /hive/trunk/ql/src/test/results/clientpositive/udaf_context_ngrams.q.out


> Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()
> -
>
> Key: HIVE-4913
> URL: https://issues.apache.org/jira/browse/HIVE-4913
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 0.10.0, 0.11.0
>Reporter: Xuefu Zhang
>Assignee: Xuefu Zhang
>Priority: Minor
> Fix For: 0.12.0
>
> Attachments: HIVE-4913.patch, HIVE-4913.patch
>
>
> Currently UDF context_ngrams() output top-K ngrams in the order of descending 
> frequency. When there are ties, i.e., phrases with same ngram value, the 
> order is indeterministic. Tie breaker is needed to break the ties, so that 
> the output is deterministic.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4913) Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()

2013-08-10 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13736073#comment-13736073
 ] 

Hudson commented on HIVE-4913:
--

FAILURE: Integrated in Hive-trunk-hadoop2-ptest #51 (See 
[https://builds.apache.org/job/Hive-trunk-hadoop2-ptest/51/])
HIVE-4913 : Put deterministic ordering in the top-K ngrams output of UDF 
context_ngrams() (Xuefu Zhang via Ashutosh Chauhan) (hashutosh: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1512590)
* 
/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/NGramEstimator.java
* /hive/trunk/ql/src/test/results/clientpositive/udaf_context_ngrams.q.out


> Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()
> -
>
> Key: HIVE-4913
> URL: https://issues.apache.org/jira/browse/HIVE-4913
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 0.10.0, 0.11.0
>Reporter: Xuefu Zhang
>Assignee: Xuefu Zhang
>Priority: Minor
> Fix For: 0.12.0
>
> Attachments: HIVE-4913.patch, HIVE-4913.patch
>
>
> Currently UDF context_ngrams() output top-K ngrams in the order of descending 
> frequency. When there are ties, i.e., phrases with same ngram value, the 
> order is indeterministic. Tie breaker is needed to break the ties, so that 
> the output is deterministic.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4913) Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()

2013-08-08 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13734370#comment-13734370
 ] 

Hive QA commented on HIVE-4913:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12596911/HIVE-4913.patch

{color:red}ERROR:{color} -1 due to 2 failed/errored test(s), 2772 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_bucket_num_reducers
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_vc
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/349/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/349/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests failed with: TestsFailedException: 2 tests failed
{noformat}

This message is automatically generated.

> Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()
> -
>
> Key: HIVE-4913
> URL: https://issues.apache.org/jira/browse/HIVE-4913
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 0.10.0, 0.11.0
>Reporter: Xuefu Zhang
>Assignee: Xuefu Zhang
>Priority: Minor
> Fix For: 0.12.0
>
> Attachments: HIVE-4913.patch, HIVE-4913.patch
>
>
> Currently UDF context_ngrams() output top-K ngrams in the order of descending 
> frequency. When there are ties, i.e., phrases with same ngram value, the 
> order is indeterministic. Tie breaker is needed to break the ties, so that 
> the output is deterministic.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4913) Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()

2013-08-08 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13733916#comment-13733916
 ] 

Ashutosh Chauhan commented on HIVE-4913:


+1

> Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()
> -
>
> Key: HIVE-4913
> URL: https://issues.apache.org/jira/browse/HIVE-4913
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 0.10.0, 0.11.0
>Reporter: Xuefu Zhang
>Assignee: Xuefu Zhang
>Priority: Minor
> Fix For: 0.12.0
>
> Attachments: HIVE-4913.patch, HIVE-4913.patch
>
>
> Currently UDF context_ngrams() output top-K ngrams in the order of descending 
> frequency. When there are ties, i.e., phrases with same ngram value, the 
> order is indeterministic. Tie breaker is needed to break the ties, so that 
> the output is deterministic.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4913) Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()

2013-08-07 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13732982#comment-13732982
 ] 

Ashutosh Chauhan commented on HIVE-4913:


Patch contains some changes from other diff. Can you clean that up? Patch looks 
good.

> Put deterministic ordering in the top-K ngrams output of UDF context_ngrams()
> -
>
> Key: HIVE-4913
> URL: https://issues.apache.org/jira/browse/HIVE-4913
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 0.10.0, 0.11.0
>Reporter: Xuefu Zhang
>Assignee: Xuefu Zhang
>Priority: Minor
> Fix For: 0.12.0
>
> Attachments: HIVE-4913.patch
>
>
> Currently UDF context_ngrams() output top-K ngrams in the order of descending 
> frequency. When there are ties, i.e., phrases with same ngram value, the 
> order is indeterministic. Tie breaker is needed to break the ties, so that 
> the output is deterministic.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira