[jira] [Commented] (PHOENIX-1609) MR job to populate index tables

Thomas D'Silva (JIRA) Fri, 31 Jul 2015 16:52:49 -0700

    [ 
https://issues.apache.org/jira/browse/PHOENIX-1609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14650027#comment-14650027
 ]


Thomas D'Silva commented on PHOENIX-1609:
-----------------------------------------

[[email protected]] 

I was testing out the MR index build on a table with 1 billion rows. The map 
reduce job completed but I got the following error at the end when it tries to 
load the HFiles. Any idea what might be happening?

2015-07-31 23:39:39,746 ERROR [main] index.IndexTool -  An exception occured 
while performing the indexing job : java.io.IOException: Trying to load more 
than 32 hfiles to one family of one region
        at 
org.apache.hadoop.hbase.mapreduce.LoadIncrementalHFiles.doBulkLoad(LoadIncrementalHFiles.java:371)
        at org.apache.phoenix.mapreduce.index.IndexTool.run(IndexTool.java:226)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
        at org.apache.phoenix.mapreduce.index.IndexTool.main(IndexTool.java:299)

> MR job to populate index tables 
> --------------------------------
>
>                 Key: PHOENIX-1609
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1609
>             Project: Phoenix
>          Issue Type: New Feature
>            Reporter: maghamravikiran
>            Assignee: maghamravikiran
>             Fix For: 5.0.0, 4.4.0
>
>         Attachments: 0001-PHOENIX-1609-4.0.patch, 
> 0001-PHOENIX-1609-4.0.patch, 0001-PHOENIX-1609-wip.patch, 
> 0001-PHOENIX_1609.patch, PHOENIX-1609-master.patch
>
>
> Often, we need to create new indexes on master tables way after the data 
> exists on the master tables.  It would be good to have a simple MR job given 
> by the phoenix code that users can call to have indexes in sync with the 
> master table. 
> Users can invoke the MR job using the following command 
> hadoop jar org.apache.phoenix.mapreduce.Index -st MASTER_TABLE -tt 
> INDEX_TABLE -columns a,b,c
> Is this ideal? 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (PHOENIX-1609) MR job to populate index tables

Reply via email to