[
https://issues.apache.org/jira/browse/HIVE-4590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068228#comment-14068228
]
Eugene Koifman commented on HIVE-4590:
--------------------------------------
[~leftylev]
1. The MR program does "value.get(1)" in reduce() which means it's "col1" is
the 2nd column. Presumably the 1st (0th) column could have been "UserName".
2. you are correct on both
> HCatalog documentation example is wrong
> ---------------------------------------
>
> Key: HIVE-4590
> URL: https://issues.apache.org/jira/browse/HIVE-4590
> Project: Hive
> Issue Type: Bug
> Components: Documentation, HCatalog
> Affects Versions: 0.10.0
> Reporter: Eugene Koifman
> Assignee: Lefty Leverenz
> Priority: Minor
>
> http://hive.apache.org/docs/hcat_r0.5.0/inputoutput.html#Read+Example
> reads
> The following very simple MapReduce program reads data from one table which
> it assumes to have an integer in the second column, and counts how many
> different values it sees. That is, it does the equivalent of "select col1,
> count(*) from $table group by col1;".
> The description of the query is wrong. It actually counts how many instances
> of each distinct value it find. For example, if values of col1 are
> {1,1,1,3,3,3,5) it will produce
> 1, 3
> 3, 2,
> 5, 1
>
--
This message was sent by Atlassian JIRA
(v6.2#6252)