Hi Will,

You're right that PigStorage doesn't handle \n. PigStorage is really a
dummy reference implementation of Load/StoreFunc, so I'd not recommend to
use it in production. In particular, when you have complex data structures
and special characters in data, advanced file formats work far better. Will
the built-in 
ParquetLoader<http://pig.apache.org/docs/r0.12.0/api/org/apache/pig/builtin/ParquetLoader.html>/Storer
or 
AvroStorage<http://pig.apache.org/docs/r0.12.0/api/org/apache/pig/builtin/AvroStorage.html>work
for you?

Thanks,
Cheolsoo


On Wed, Mar 26, 2014 at 10:51 AM, William Oberman
<ober...@civicscience.com>wrote:

> I was debugging some warnings in a script I had:
> FIELD_DISCARDED_TYPE_CONVERSION_FAILED
> ACCESSING_NON_EXISTENT_FIELD
>
> I got it down to basically these two lines:
> --foo was stored using PigStorage
> foo = LOAD '....' AS (key:chararray, value:map[chararray]);
> STORE foo INTO '...';
>
> The problem is some of the map values have line feeds (\n) in them, which I
> think breaks the PigStorage mind in the load path.
>
> Bug?  Or is it "user error" to allow map values with \n's in them.  I mean,
> I agree it's weird.  But, I didn't expect Pig to have such trouble with
> it...
>
> will
>

Reply via email to