[jira] Updated: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-03-18 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1289:


  Resolution: Fixed
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

Unit test failure due to port conflict. Manual test successful. Patch committed.

> PIG Join fails while doing a filter on joined data
> --
>
> Key: PIG-1289
> URL: https://issues.apache.org/jira/browse/PIG-1289
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Karim Saadah
>Assignee: Daniel Dai
>Priority: Minor
> Fix For: 0.7.0
>
> Attachments: PIG-1289-1.patch, PIG-1289-2.patch
>
>
> PIG Join fails while doing a filter on joined data
> Here are the steps to reproduce it:
> -bash-3.1$ pig -latest -x local
> grunt> a = load 'first.dat' using PigStorage('\u0001') as (f1:int, 
> f2:chararray);
> grunt> DUMP a;
> (1,A)
> (2,B)
> (3,C)
> (4,D)
> grunt> b = load 'second.dat' using PigStorage() as (f3:chararray);
> grunt> DUMP b;
> (A)
> (D)
> (E)
> grunt> c = join a by f2 LEFT OUTER, b by f3;
> grunt> DUMP c;
> (1,A,A)
> (2,B,)
> (3,C,)
> (4,D,D)
> grunt> describe c;
> c: {a::f1: int,a::f2: chararray,b::f3: chararray}
> grunt> d = filter c by (f3 is null or f3 =='');
> grunt> dump d;
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for a
> 2010-03-03 15:00:37,130 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for a
> 2010-03-03 15:00:37,130 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
> 1002: Unable to store alias d
> This one is failing too:
> grunt> d = filter c by (b::f3 is null or b::f3 =='');
> or this one not returning results as expected:
> grunt> d = foreach c generate f1 as f1, f2 as f2, f3 as f3;
> grunt> e = filter d by (f3 is null or f3 =='');
> grunt> DUMP e;
> (1,A,)
> (2,B,)
> (3,C,)
> (4,D,)
> while the expected result is
> (2,B,)
> (3,C,)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-03-16 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1289:


Status: Open  (was: Patch Available)

> PIG Join fails while doing a filter on joined data
> --
>
> Key: PIG-1289
> URL: https://issues.apache.org/jira/browse/PIG-1289
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Karim Saadah
>Assignee: Daniel Dai
>Priority: Minor
> Fix For: 0.7.0
>
> Attachments: PIG-1289-1.patch, PIG-1289-2.patch
>
>
> PIG Join fails while doing a filter on joined data
> Here are the steps to reproduce it:
> -bash-3.1$ pig -latest -x local
> grunt> a = load 'first.dat' using PigStorage('\u0001') as (f1:int, 
> f2:chararray);
> grunt> DUMP a;
> (1,A)
> (2,B)
> (3,C)
> (4,D)
> grunt> b = load 'second.dat' using PigStorage() as (f3:chararray);
> grunt> DUMP b;
> (A)
> (D)
> (E)
> grunt> c = join a by f2 LEFT OUTER, b by f3;
> grunt> DUMP c;
> (1,A,A)
> (2,B,)
> (3,C,)
> (4,D,D)
> grunt> describe c;
> c: {a::f1: int,a::f2: chararray,b::f3: chararray}
> grunt> d = filter c by (f3 is null or f3 =='');
> grunt> dump d;
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for a
> 2010-03-03 15:00:37,130 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for a
> 2010-03-03 15:00:37,130 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
> 1002: Unable to store alias d
> This one is failing too:
> grunt> d = filter c by (b::f3 is null or b::f3 =='');
> or this one not returning results as expected:
> grunt> d = foreach c generate f1 as f1, f2 as f2, f3 as f3;
> grunt> e = filter d by (f3 is null or f3 =='');
> grunt> DUMP e;
> (1,A,)
> (2,B,)
> (3,C,)
> (4,D,)
> while the expected result is
> (2,B,)
> (3,C,)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-03-16 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1289:


Status: Patch Available  (was: Open)

> PIG Join fails while doing a filter on joined data
> --
>
> Key: PIG-1289
> URL: https://issues.apache.org/jira/browse/PIG-1289
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Karim Saadah
>Assignee: Daniel Dai
>Priority: Minor
> Fix For: 0.7.0
>
> Attachments: PIG-1289-1.patch, PIG-1289-2.patch
>
>
> PIG Join fails while doing a filter on joined data
> Here are the steps to reproduce it:
> -bash-3.1$ pig -latest -x local
> grunt> a = load 'first.dat' using PigStorage('\u0001') as (f1:int, 
> f2:chararray);
> grunt> DUMP a;
> (1,A)
> (2,B)
> (3,C)
> (4,D)
> grunt> b = load 'second.dat' using PigStorage() as (f3:chararray);
> grunt> DUMP b;
> (A)
> (D)
> (E)
> grunt> c = join a by f2 LEFT OUTER, b by f3;
> grunt> DUMP c;
> (1,A,A)
> (2,B,)
> (3,C,)
> (4,D,D)
> grunt> describe c;
> c: {a::f1: int,a::f2: chararray,b::f3: chararray}
> grunt> d = filter c by (f3 is null or f3 =='');
> grunt> dump d;
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for a
> 2010-03-03 15:00:37,130 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for a
> 2010-03-03 15:00:37,130 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
> 1002: Unable to store alias d
> This one is failing too:
> grunt> d = filter c by (b::f3 is null or b::f3 =='');
> or this one not returning results as expected:
> grunt> d = foreach c generate f1 as f1, f2 as f2, f3 as f3;
> grunt> e = filter d by (f3 is null or f3 =='');
> grunt> DUMP e;
> (1,A,)
> (2,B,)
> (3,C,)
> (4,D,)
> while the expected result is
> (2,B,)
> (3,C,)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-03-16 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1289:


Attachment: PIG-1289-2.patch

> PIG Join fails while doing a filter on joined data
> --
>
> Key: PIG-1289
> URL: https://issues.apache.org/jira/browse/PIG-1289
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Karim Saadah
>Assignee: Daniel Dai
>Priority: Minor
> Fix For: 0.7.0
>
> Attachments: PIG-1289-1.patch, PIG-1289-2.patch
>
>
> PIG Join fails while doing a filter on joined data
> Here are the steps to reproduce it:
> -bash-3.1$ pig -latest -x local
> grunt> a = load 'first.dat' using PigStorage('\u0001') as (f1:int, 
> f2:chararray);
> grunt> DUMP a;
> (1,A)
> (2,B)
> (3,C)
> (4,D)
> grunt> b = load 'second.dat' using PigStorage() as (f3:chararray);
> grunt> DUMP b;
> (A)
> (D)
> (E)
> grunt> c = join a by f2 LEFT OUTER, b by f3;
> grunt> DUMP c;
> (1,A,A)
> (2,B,)
> (3,C,)
> (4,D,D)
> grunt> describe c;
> c: {a::f1: int,a::f2: chararray,b::f3: chararray}
> grunt> d = filter c by (f3 is null or f3 =='');
> grunt> dump d;
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for a
> 2010-03-03 15:00:37,130 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for a
> 2010-03-03 15:00:37,130 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
> 1002: Unable to store alias d
> This one is failing too:
> grunt> d = filter c by (b::f3 is null or b::f3 =='');
> or this one not returning results as expected:
> grunt> d = foreach c generate f1 as f1, f2 as f2, f3 as f3;
> grunt> e = filter d by (f3 is null or f3 =='');
> grunt> DUMP e;
> (1,A,)
> (2,B,)
> (3,C,)
> (4,D,)
> while the expected result is
> (2,B,)
> (3,C,)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-03-16 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1289:


Fix Version/s: 0.7.0

> PIG Join fails while doing a filter on joined data
> --
>
> Key: PIG-1289
> URL: https://issues.apache.org/jira/browse/PIG-1289
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Karim Saadah
>Assignee: Daniel Dai
>Priority: Minor
> Fix For: 0.7.0
>
> Attachments: PIG-1289-1.patch
>
>
> PIG Join fails while doing a filter on joined data
> Here are the steps to reproduce it:
> -bash-3.1$ pig -latest -x local
> grunt> a = load 'first.dat' using PigStorage('\u0001') as (f1:int, 
> f2:chararray);
> grunt> DUMP a;
> (1,A)
> (2,B)
> (3,C)
> (4,D)
> grunt> b = load 'second.dat' using PigStorage() as (f3:chararray);
> grunt> DUMP b;
> (A)
> (D)
> (E)
> grunt> c = join a by f2 LEFT OUTER, b by f3;
> grunt> DUMP c;
> (1,A,A)
> (2,B,)
> (3,C,)
> (4,D,D)
> grunt> describe c;
> c: {a::f1: int,a::f2: chararray,b::f3: chararray}
> grunt> d = filter c by (f3 is null or f3 =='');
> grunt> dump d;
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for a
> 2010-03-03 15:00:37,130 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for a
> 2010-03-03 15:00:37,130 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
> 1002: Unable to store alias d
> This one is failing too:
> grunt> d = filter c by (b::f3 is null or b::f3 =='');
> or this one not returning results as expected:
> grunt> d = foreach c generate f1 as f1, f2 as f2, f3 as f3;
> grunt> e = filter d by (f3 is null or f3 =='');
> grunt> DUMP e;
> (1,A,)
> (2,B,)
> (3,C,)
> (4,D,)
> while the expected result is
> (2,B,)
> (3,C,)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-03-11 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1289:


Attachment: PIG-1289-1.patch

> PIG Join fails while doing a filter on joined data
> --
>
> Key: PIG-1289
> URL: https://issues.apache.org/jira/browse/PIG-1289
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Karim Saadah
>Assignee: Daniel Dai
>Priority: Minor
> Attachments: PIG-1289-1.patch
>
>
> PIG Join fails while doing a filter on joined data
> Here are the steps to reproduce it:
> -bash-3.1$ pig -latest -x local
> grunt> a = load 'first.dat' using PigStorage('\u0001') as (f1:int, 
> f2:chararray);
> grunt> DUMP a;
> (1,A)
> (2,B)
> (3,C)
> (4,D)
> grunt> b = load 'second.dat' using PigStorage() as (f3:chararray);
> grunt> DUMP b;
> (A)
> (D)
> (E)
> grunt> c = join a by f2 LEFT OUTER, b by f3;
> grunt> DUMP c;
> (1,A,A)
> (2,B,)
> (3,C,)
> (4,D,D)
> grunt> describe c;
> c: {a::f1: int,a::f2: chararray,b::f3: chararray}
> grunt> d = filter c by (f3 is null or f3 =='');
> grunt> dump d;
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for a
> 2010-03-03 15:00:37,130 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for a
> 2010-03-03 15:00:37,130 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
> 1002: Unable to store alias d
> This one is failing too:
> grunt> d = filter c by (b::f3 is null or b::f3 =='');
> or this one not returning results as expected:
> grunt> d = foreach c generate f1 as f1, f2 as f2, f3 as f3;
> grunt> e = filter d by (f3 is null or f3 =='');
> grunt> DUMP e;
> (1,A,)
> (2,B,)
> (3,C,)
> (4,D,)
> while the expected result is
> (2,B,)
> (3,C,)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-03-11 Thread Daniel Dai (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Dai updated PIG-1289:


Status: Patch Available  (was: Open)

> PIG Join fails while doing a filter on joined data
> --
>
> Key: PIG-1289
> URL: https://issues.apache.org/jira/browse/PIG-1289
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Karim Saadah
>Assignee: Daniel Dai
>Priority: Minor
> Attachments: PIG-1289-1.patch
>
>
> PIG Join fails while doing a filter on joined data
> Here are the steps to reproduce it:
> -bash-3.1$ pig -latest -x local
> grunt> a = load 'first.dat' using PigStorage('\u0001') as (f1:int, 
> f2:chararray);
> grunt> DUMP a;
> (1,A)
> (2,B)
> (3,C)
> (4,D)
> grunt> b = load 'second.dat' using PigStorage() as (f3:chararray);
> grunt> DUMP b;
> (A)
> (D)
> (E)
> grunt> c = join a by f2 LEFT OUTER, b by f3;
> grunt> DUMP c;
> (1,A,A)
> (2,B,)
> (3,C,)
> (4,D,D)
> grunt> describe c;
> c: {a::f1: int,a::f2: chararray,b::f3: chararray}
> grunt> d = filter c by (f3 is null or f3 =='');
> grunt> dump d;
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for b
> 2010-03-03 15:00:37,129 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned 
> for a
> 2010-03-03 15:00:37,130 [main] INFO  
> org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned 
> for a
> 2010-03-03 15:00:37,130 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 
> 1002: Unable to store alias d
> This one is failing too:
> grunt> d = filter c by (b::f3 is null or b::f3 =='');
> or this one not returning results as expected:
> grunt> d = foreach c generate f1 as f1, f2 as f2, f3 as f3;
> grunt> e = filter d by (f3 is null or f3 =='');
> grunt> DUMP e;
> (1,A,)
> (2,B,)
> (3,C,)
> (4,D,)
> while the expected result is
> (2,B,)
> (3,C,)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.