[ https://issues.apache.org/jira/browse/HIVE-4590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14068880#comment-14068880 ]
Lefty Leverenz commented on HIVE-4590: -------------------------------------- Done: {quote} The following very simple MapReduce program reads data from one table which it assumes to have an integer in the second column ("column 1"), and counts how many instances of each distinct value it finds. That is, it does the equivalent of "select col1, count\(*\) from $table group by col1;". For example, if the values in the second column are \{1,1,1,3,3,5\} the program will produce this output of values and counts: 1, 3 3, 2 5, 1 {quote} * [HCatalog Input Output -- Read Example | https://cwiki.apache.org/confluence/display/Hive/HCatalog+InputOutput#HCatalogInputOutput-ReadExample] > HCatalog documentation example is wrong > --------------------------------------- > > Key: HIVE-4590 > URL: https://issues.apache.org/jira/browse/HIVE-4590 > Project: Hive > Issue Type: Bug > Components: Documentation, HCatalog > Affects Versions: 0.10.0 > Reporter: Eugene Koifman > Assignee: Lefty Leverenz > Priority: Minor > > http://hive.apache.org/docs/hcat_r0.5.0/inputoutput.html#Read+Example > reads > The following very simple MapReduce program reads data from one table which > it assumes to have an integer in the second column, and counts how many > different values it sees. That is, it does the equivalent of "select col1, > count(*) from $table group by col1;". > The description of the query is wrong. It actually counts how many instances > of each distinct value it find. For example, if values of col1 are > {1,1,1,3,3,3,5) it will produce > 1, 3 > 3, 2, > 5, 1 > -- This message was sent by Atlassian JIRA (v6.2#6252)