Hi Thejas,

Thanks again for your help. When I omit the SrcFuid "qualifier" and use the 
form you suggest, I get this error (that was actually the reason I tried 
SrcFuid.<field> to start with.)

Pig Stack Trace
---------------
ERROR 1025: Found more than one match: SrcFuid::citingdocid, 
NewCitationRel::citingdocid

org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1000: Error during 
parsing. Found more than one match: SrcFuid::citingdocid, 
NewCitationRel::citingdocid
        at org.apache.pig.PigServer$Graph.parseQuery(PigServer.java:1607)
        at org.apache.pig.PigServer$Graph.registerQuery(PigServer.java:1551)
        at org.apache.pig.PigServer.registerQuery(PigServer.java:523)
        at 
org.apache.pig.tools.grunt.GruntParser.processPig(GruntParser.java:868)
        at 
org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:388)
        at 
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:165)
        at 
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:141)
        at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:90)
        at org.apache.pig.Main.run(Main.java:510)
        at org.apache.pig.Main.main(Main.java:107)
Caused by: org.apache.pig.impl.logicalLayer.parser.ParseException: Found more 
than one match: SrcFuid::citingdocid, NewCitationRel::citingdocid
        at 
org.apache.pig.impl.logicalLayer.parser.QueryParser.AliasFieldOrSpec(QueryParser.java:7418)
        at 
org.apache.pig.impl.logicalLayer.parser.QueryParser.ColOrSpec(QueryParser.java:7226)
        at 
org.apache.pig.impl.logicalLayer.parser.QueryParser.BaseEvalSpec(QueryParser.java:5297)


But the good news is that I combined this suggestion with Shawn's and found 
that this works:

TCGroupedByFuid = group TCRaw by (SrcFuid::citingdocid, SrcFuid::col, 
SrcFuid::seq);

Thanks Thejas and Shawn!

Will


William F Dowling
Sr Technical Specialist, Software Engineering
Thomson Reuters
0 +1 215 823 3853


-----Original Message-----
From: Thejas M Nair [mailto:[email protected]] 
Sent: Wednesday, April 06, 2011 3:31 PM
To: [email protected]; Dowling, William (Hlthcr&Science)
Subject: Re: Internal error 2999 - misuse of CONCAT? misuse of GROUP?

In the relation TCRaw, there is no column called SrcFuid.
As a result, you end up using this feature -
http://pig.apache.org/docs/r0.8.0/piglatin_ref2.html#Casting+Relations+to+Sc
alars .


Change your statement to -
 TCGroupedByFuid = group TCRaw by (citingdocid,
                                          col,
                                         seq);

Thanks,
Thejas

On 4/6/11 11:09 AM, "[email protected]"
<[email protected]> wrote:

> 
> 
>> Do you need the group-key to be concatenated ? If not, you can just group on
>> all the three columns -
> 
>> TCGroupedByFuid = group TCRaw by (SrcFuid.citingdocid,
>                                          SrcFuid.col,
>                                         SrcFuid.seq);
> 
> Hi Thejas,
> 
> I had tried that originally before introducing CONCAT(), but I got this error
> message:
> 
> ERROR 0: Scalar has more than one row in the output.
>  1st : (14159274,BCI,6), 2nd :(45937168,BCI,17)
> 
> I don't understand that, since TCRaw is
> 
> (14159274,BCI,6,14159274,14159163,BCI,5,1999,BCI.BCI)
> (14159274,BCI,6,14159274,14159163,WOS,11,1999,WOS.SCI)
> (14159274,WOS,16,14159274,14159163,BCI,5,1999,BCI.BCI)
> (14159274,WOS,16,14159274,14159163,WOS,11,1999,WOS.SCI)
> 
> and the 2nd tuple is not a (projection of any) member of TCRaw (though it is a
> member of SrcFuid). So I think my understanding of GROUP is incorrect.
> 
> Thanks for your help!
> 
> Will
> 
> 


Reply via email to