[jira] [Commented] (HIVE-4590) HCatalog documentation example is wrong

Eugene Koifman (JIRA) Sun, 20 Jul 2014 22:38:28 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-4590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068228#comment-14068228
 ]


Eugene Koifman commented on HIVE-4590:
--------------------------------------

[~leftylev]
1. The MR program does "value.get(1)" in reduce() which means it's "col1" is 
the 2nd column.  Presumably the 1st (0th) column could have been "UserName".
2. you are correct on both

> HCatalog documentation example is wrong
> ---------------------------------------
>
>                 Key: HIVE-4590
>                 URL: https://issues.apache.org/jira/browse/HIVE-4590
>             Project: Hive
>          Issue Type: Bug
>          Components: Documentation, HCatalog
>    Affects Versions: 0.10.0
>            Reporter: Eugene Koifman
>            Assignee: Lefty Leverenz
>            Priority: Minor
>
> http://hive.apache.org/docs/hcat_r0.5.0/inputoutput.html#Read+Example
> reads
> The following very simple MapReduce program reads data from one table which 
> it assumes to have an integer in the second column, and counts how many 
> different values it sees. That is, it does the equivalent of "select col1, 
> count(*) from $table group by col1;".
> The description of the query is wrong.  It actually counts how many instances 
> of each distinct value it find.  For example, if values of col1 are 
> {1,1,1,3,3,3,5) it will produce
> 1, 3
> 3, 2,
> 5, 1
>  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-4590) HCatalog documentation example is wrong

Reply via email to