Document  in Load statement syntax that Pig and underlying M/R does not handle 
concatenated bz2 and gz files correctly
----------------------------------------------------------------------------------------------------------------------

                 Key: PIG-1305
                 URL: https://issues.apache.org/jira/browse/PIG-1305
             Project: Pig
          Issue Type: Bug
          Components: documentation
            Reporter: Viraj Bhat
             Fix For: 0.7.0


The Pig Reference Manual needs to be updated:

Relational Operators

Syntax:

LOAD 'data' [USING function] [AS schema];

'data' 

Please note:
Pig reads in both bz2 and gz formats correctly as long as they are not 
concatenated gzip or bz2 generated in this manner. cat *.bz2  > 
text/concat.bz2. Your M/R jobs may succeed but the results will not be accurate.

Viraj

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to