[ https://issues.apache.org/jira/browse/HIVE-4590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068228#comment-14068228 ]
Eugene Koifman commented on HIVE-4590: -------------------------------------- [~leftylev] 1. The MR program does "value.get(1)" in reduce() which means it's "col1" is the 2nd column. Presumably the 1st (0th) column could have been "UserName". 2. you are correct on both > HCatalog documentation example is wrong > --------------------------------------- > > Key: HIVE-4590 > URL: https://issues.apache.org/jira/browse/HIVE-4590 > Project: Hive > Issue Type: Bug > Components: Documentation, HCatalog > Affects Versions: 0.10.0 > Reporter: Eugene Koifman > Assignee: Lefty Leverenz > Priority: Minor > > http://hive.apache.org/docs/hcat_r0.5.0/inputoutput.html#Read+Example > reads > The following very simple MapReduce program reads data from one table which > it assumes to have an integer in the second column, and counts how many > different values it sees. That is, it does the equivalent of "select col1, > count(*) from $table group by col1;". > The description of the query is wrong. It actually counts how many instances > of each distinct value it find. For example, if values of col1 are > {1,1,1,3,3,3,5) it will produce > 1, 3 > 3, 2, > 5, 1 > -- This message was sent by Atlassian JIRA (v6.2#6252)