dear all,
---------
Key: PIG-2500
URL: https://issues.apache.org/jira/browse/PIG-2500
Project: Pig
Issue Type: Bug
Reporter: vaibhav
input file :
cat input13
(3,8,9) {(3,8,9)} [open#apache]
(1,4,7) {(1,4,7)} [apache#hadoop]
(2,5,8) {(2,5,8)} [open#apache]
A = LOAD '/data/input13' AS (T1:tuple(f1:int, f2:int),
B:bag{T2:tuple(t1:float,t2:float)}, M:map[] );
output :
dump A ;
(3,),,)
((1,),,)
((2,),,)
but it should be the same as input?
2)
cat input15
(3,8,9) (mary,19)
(1,4,7) (john,18)
(2,5,8) (joe,18)
o/p
((3,8,9),)
((1,4,7),)
((2,5,8),)
---------------------------------------
first logs
--------------------------------------------------------------------------------
grunt> A = LOAD '/data/input13' AS (T1:tuple(f1:int, f2:int),
B:bag{T2:tuple(t1:float,t2:float)}, M:map[] );
grunt> dump A ;
2012-02-01 20:22:14,025 [main] INFO
org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned for A
2012-02-01 20:22:14,025 [main] INFO
org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned
for A
2012-02-01 20:22:14,032 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:22:14,034 [main] INFO
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - (Name:
Store(hdfs://localhost:54310/tmp/temp537168513/tmp899939258:org.apache.pig.builtin.BinStorage)
- 1-246 Operator Key: 1-246)
2012-02-01 20:22:14,035 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
- MR plan size before optimization: 1
2012-02-01 20:22:14,035 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
- MR plan size after optimization: 1
2012-02-01 20:22:14,040 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:22:14,040 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:22:14,040 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
- mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2012-02-01 20:22:15,334 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
- Setting up single store job
2012-02-01 20:22:15,335 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:22:15,336 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- 1 map-reduce job(s) waiting for submission.
2012-02-01 20:22:15,336 [Thread-149] WARN org.apache.hadoop.mapred.JobClient -
Use GenericOptionsParser for parsing the arguments. Applications should
implement Tool for the same.
2012-02-01 20:22:15,378 [Thread-149] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,382 [Thread-149] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,388 [Thread-149] INFO
org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to
process : 1
2012-02-01 20:22:15,388 [Thread-149] INFO
org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input
paths to process : 1
2012-02-01 20:22:15,425 [Thread-157] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,427 [Thread-157] INFO
org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to
process : 1
2012-02-01 20:22:15,427 [Thread-157] INFO
org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input
paths to process : 1
2012-02-01 20:22:15,437 [Thread-157] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,438 [Thread-157] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,440 [Thread-157] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,444 [Thread-157] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,463 [Thread-157] INFO org.apache.hadoop.mapred.TaskRunner
- Task:attempt_local_0011_m_000000_0 is done. And is in the process of commiting
2012-02-01 20:22:15,464 [Thread-157] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,465 [Thread-157] INFO
org.apache.hadoop.mapred.LocalJobRunner -
2012-02-01 20:22:15,465 [Thread-157] INFO org.apache.hadoop.mapred.TaskRunner
- Task attempt_local_0011_m_000000_0 is allowed to commit now
2012-02-01 20:22:15,466 [Thread-157] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:22:15,471 [Thread-157] INFO
org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output of
task 'attempt_local_0011_m_000000_0' to
hdfs://localhost:54310/tmp/temp537168513/tmp899939258
2012-02-01 20:22:15,471 [Thread-157] INFO
org.apache.hadoop.mapred.LocalJobRunner -
2012-02-01 20:22:15,471 [Thread-157] INFO org.apache.hadoop.mapred.TaskRunner
- Task 'attempt_local_0011_m_000000_0' done.
2012-02-01 20:22:15,837 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- HadoopJobId: job_local_0011
2012-02-01 20:22:15,837 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- 0% complete
2012-02-01 20:22:20,843 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- 100% complete
2012-02-01 20:22:20,843 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Successfully stored result in:
"hdfs://localhost:54310/tmp/temp537168513/tmp899939258"
2012-02-01 20:22:20,843 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Records written : 0
2012-02-01 20:22:20,843 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Bytes written : 0
2012-02-01 20:22:20,843 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Spillable Memory Manager spill count : 0
2012-02-01 20:22:20,843 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Proactive spill count : 0
2012-02-01 20:22:20,843 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Success!
2012-02-01 20:22:20,855 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:22:20,859 [main] INFO
org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to
process : 1
2012-02-01 20:22:20,859 [main] INFO
org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input
paths to process : 1
((3,),,)
((1,),,)
((2,),,)
----------------------------------------------------------------------
second logs
grunt> D = LOAD '/data/input15' AS
(F:tuple(f1:int,f2:int,f3:int),T:tuple(t1:chararray,t2:int));
grunt> dump D ;
2012-02-01 20:28:32,287 [main] INFO
org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned for D
2012-02-01 20:28:32,287 [main] INFO
org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned
for D
2012-02-01 20:28:32,330 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Initializing JVM Metrics with processName=JobTracker, sessionId=
2012-02-01 20:28:32,399 [main] INFO
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - (Name:
Store(hdfs://localhost:54310/tmp/temp2086651143/tmp-1826566586:org.apache.pig.builtin.BinStorage)
- 1-14 Operator Key: 1-14)
2012-02-01 20:28:32,428 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
- MR plan size before optimization: 1
2012-02-01 20:28:32,428 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
- MR plan size after optimization: 1
2012-02-01 20:28:32,443 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:28:32,448 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:28:32,448 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
- mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2012-02-01 20:28:33,845 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
- Setting up single store job
2012-02-01 20:28:33,869 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:28:33,870 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- 1 map-reduce job(s) waiting for submission.
2012-02-01 20:28:33,873 [Thread-8] WARN org.apache.hadoop.mapred.JobClient -
Use GenericOptionsParser for parsing the arguments. Applications should
implement Tool for the same.
2012-02-01 20:28:33,968 [Thread-8] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:33,977 [Thread-8] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:33,985 [Thread-8] INFO
org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to
process : 1
2012-02-01 20:28:33,985 [Thread-8] INFO
org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input
paths to process : 1
2012-02-01 20:28:34,102 [Thread-17] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:34,104 [Thread-17] INFO
org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to
process : 1
2012-02-01 20:28:34,105 [Thread-17] INFO
org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input
paths to process : 1
2012-02-01 20:28:34,136 [Thread-17] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:34,145 [Thread-17] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:34,149 [Thread-17] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:34,153 [Thread-17] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:34,179 [Thread-17] INFO org.apache.hadoop.mapred.TaskRunner -
Task:attempt_local_0001_m_000000_0 is done. And is in the process of commiting
2012-02-01 20:28:34,181 [Thread-17] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:34,183 [Thread-17] INFO
org.apache.hadoop.mapred.LocalJobRunner -
2012-02-01 20:28:34,184 [Thread-17] INFO org.apache.hadoop.mapred.TaskRunner -
Task attempt_local_0001_m_000000_0 is allowed to commit now
2012-02-01 20:28:34,185 [Thread-17] INFO
org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
2012-02-01 20:28:34,192 [Thread-17] INFO
org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output of
task 'attempt_local_0001_m_000000_0' to
hdfs://localhost:54310/tmp/temp2086651143/tmp-1826566586
2012-02-01 20:28:34,193 [Thread-17] INFO
org.apache.hadoop.mapred.LocalJobRunner -
2012-02-01 20:28:34,193 [Thread-17] INFO org.apache.hadoop.mapred.TaskRunner -
Task 'attempt_local_0001_m_000000_0' done.
2012-02-01 20:28:34,371 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- HadoopJobId: job_local_0001
2012-02-01 20:28:34,372 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- 0% complete
2012-02-01 20:28:39,379 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- 100% complete
2012-02-01 20:28:39,379 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Successfully stored result in:
"hdfs://localhost:54310/tmp/temp2086651143/tmp-1826566586"
2012-02-01 20:28:39,380 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Records written : 0
2012-02-01 20:28:39,380 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Bytes written : 0
2012-02-01 20:28:39,381 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Spillable Memory Manager spill count : 0
2012-02-01 20:28:39,381 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Proactive spill count : 0
2012-02-01 20:28:39,381 [main] INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
- Success!
2012-02-01 20:28:39,394 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics -
Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already
initialized
2012-02-01 20:28:39,400 [main] INFO
org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to
process : 1
2012-02-01 20:28:39,400 [main] INFO
org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input
paths to process : 1
((3,8,9),)
((1,4,7),)
((2,5,8),)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira