[ 
https://issues.apache.org/jira/browse/HIVE-2289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13067889#comment-13067889
 ] 

John Sichi commented on HIVE-2289:
----------------------------------

@Siddarth:  the only doc available is here:

https://cwiki.apache.org/confluence/display/Hive/IndexDev

It hasn't been updated since the initial work, so it doesn't include the topics 
you're after.  For that, you're best off digging into the code plus related 
JIRA issues.


> NumberFormatException with respect to _offsets when running a query with  
> index
> -------------------------------------------------------------------------------
>
>                 Key: HIVE-2289
>                 URL: https://issues.apache.org/jira/browse/HIVE-2289
>             Project: Hive
>          Issue Type: Bug
>          Components: Indexing
>    Affects Versions: 0.7.0
>         Environment: RedHat 5
>            Reporter: siddharth ramanan
>
> I am having a table named foo with columns origin, destination and 
> information.
> Steps I followed to create index named foosample for foo,
> 1)create index foosample on table foo(origin) as 'compact' with deferred 
> rebuild;
> 2)alter index foosample on foo rebuild;
> 3)insert overwrite directory "/tmp/index_result" select 
> '_bucketname','_offsets' from default__foo_foosample__ where origin='WAW';
> 4)set hive.index.compact.file=/tmp/index_result;
> 5)set 
> hive.input.format=org.apache.hadoop.hive.ql.index.compact.HiveCompactIndexInputFormat;
> 6)select * from foo where origin='WAW';
> Total MapReduce jobs = 1
> Launching Job 1 out of 1
> Number of reduce tasks is set to 0 since there's no reduce operator
> java.lang.NumberFormatException: For input string: "_offsets"
>     at 
> java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
>     at java.lang.Long.parseLong(Long.java:410)
>     at java.lang.Long.parseLong(Long.java:468)
>     at 
> org.apache.hadoop.hive.ql.index.compact.HiveCompactIndexResult.add(HiveCompactIndexResult.java:158)
>     at 
> org.apache.hadoop.hive.ql.index.compact.HiveCompactIndexResult.<init>(HiveCompactIndexResult.java:107)
>     at 
> org.apache.hadoop.hive.ql.index.compact.HiveCompactIndexInputFormat.getSplits(HiveCompactIndexInputFormat.java:89)
>     at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
>     at 
> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
>     at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
>     at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:657)
>     at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:123)
>     at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:130)
>     at 
> org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57)
>     at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1063)
>     at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:900)
>     at org.apache.hadoop.hive.ql.Driver.run(Driver.java:748)
>     at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:164)
>     at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:241)
>     at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:456)
>     at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>     at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>     at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>     at java.lang.reflect.Method.invoke(Method.java:597)
>     at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
> Job Submission failed with exception 'java.lang.NumberFormatException(For 
> input string: "_offsets")'
> FAILED: Execution Error, return code 1 from 
> org.apache.hadoop.hive.ql.exec.MapRedTask
> Steps 2 and 3 ran a successful mapreduce job and also the table 
> default__foo_foosample__ (index table) has data with three columns origin, 
> _bucketname and _offsets.
> Thanks,
> Siddharth

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to