Re: Error reading data from Cassandra

Jeremy Hanna Tue, 05 Apr 2011 07:20:06 -0700

Fabio,

Could you post the full stack trace that's found in the pig_<long number>.log 
that's in the directory that you ran pig?


Thanks,

Jeremy

On Apr 5, 2011, at 8:42 AM, Fabio Souto wrote:

> Hello,
> 
> I have installed Pig 0.8.0 and Cassandra 0.7.4 and I'm not able to read data 
> from cassandra. I write a simple query just to test:
> 
> grunt> A = LOAD 'cassandra://msg_keyspace/messages' USING 
> org.apache.cassandra.hadoop.pig.CassandraStorage();                           
>                                            
> grunt> dump A;   
> 
> 
> And i'm getting the following error:
> ==========================================================================
> 2011-04-05 15:33:57,669 [main] INFO  
> org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: 
> UNKNOWN
> 2011-04-05 15:33:57,669 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - 
> pig.usenewlogicalplan is set to true. New logical plan will be used.
> 2011-04-05 15:33:57,819 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - (Name: A: 
> Store(hdfs://localhost/tmp/temp2037710644/tmp-29784200:org.apache.pig.impl.io.InterStorage)
>  - scope-1 Operator Key: scope-1)
> 2011-04-05 15:33:57,850 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - 
> File concatenation threshold: 100 optimistic? false
> 2011-04-05 15:33:57,877 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
>  - MR plan size before optimization: 1
> 2011-04-05 15:33:57,877 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
>  - MR plan size after optimization: 1
> 2011-04-05 15:33:57,969 [main] INFO  
> org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added to 
> the job
> 2011-04-05 15:33:57,990 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
>  - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
> 2011-04-05 15:34:03,376 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
>  - Setting up single store job
> 2011-04-05 15:34:03,416 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
>  - 1 map-reduce job(s) waiting for submission.
> 2011-04-05 15:34:03,929 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
>  - 0% complete
> 2011-04-05 15:34:04,597 [Thread-5] INFO  
> org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input 
> paths (combined) to process : 1
> 2011-04-05 15:34:05,942 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
>  - HadoopJobId: job_201104051459_0008
> 2011-04-05 15:34:05,943 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
>  - More information at: 
> http://localhost:50030/jobdetails.jsp?jobid=job_201104051459_0008
> 2011-04-05 15:34:35,912 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
>  - job job_201104051459_0008 has failed! Stop running all dependent jobs
> 2011-04-05 15:34:35,918 [main] INFO  
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
>  - 100% complete
> 2011-04-05 15:34:35,931 [main] ERROR org.apache.pig.tools.pigstats.PigStats - 
> ERROR 2997: Unable to recreate exception from backed error: 
> java.lang.NumberFormatException: null
> 2011-04-05 15:34:35,931 [main] ERROR 
> org.apache.pig.tools.pigstats.PigStatsUtil - 1 map reduce job(s) failed!
> 2011-04-05 15:34:35,933 [main] INFO  org.apache.pig.tools.pigstats.PigStats - 
> Script Statistics: 
> 
> HadoopVersion PigVersion      UserId  StartedAt       FinishedAt      Features
> 0.20.2-CDH3B4 0.8.0-SNAPSHOT  root    2011-04-05 15:33:57     2011-04-05 
> 15:34:35     UNKNOWN
> 
> Failed!
> 
> Failed Jobs:
> JobId Alias   Feature Message Outputs
> job_201104051459_0008 A       MAP_ONLY        Message: Job failed! Error - NA 
> hdfs://localhost/tmp/temp2037710644/tmp-29784200,
> 
> Input(s):
> Failed to read data from "cassandra://msg_keyspace/messages"
> 
> Output(s):
> Failed to produce result in "hdfs://localhost/tmp/temp2037710644/tmp-29784200"
> ==========================================================================
> 
> Any idea how to fix this?
> Cheers

Re: Error reading data from Cassandra

Reply via email to