[ 
https://issues.apache.org/jira/browse/PIG-497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pradeep Kamath updated PIG-497:
-------------------------------

    Assignee: Pradeep Kamath  (was: Santhosh Srinivasan)
      Status: Patch Available  (was: Open)

Patch attached.
There were three issues which were resolved:
- DataReaderWriter was using DataOutput.writeBytes(String) instead of 
DataOutput.writeUTF(String). Likewise it was using DataInput.readFully(bytes[]) 
instead of DataInput.readUTF(). The earlier calls get only lower 8bits out of 
each character in the string which would mess up multi byte UTF8 data
- illustrate and dump eventually use System.out.println to output results and 
System.out.println() writes bytes in platform default encoding which is 
typically UTF-16. This was changed to System.write(String.getBytes("UTF-8")

> dump does not deal with non-ascii data
> --------------------------------------
>
>                 Key: PIG-497
>                 URL: https://issues.apache.org/jira/browse/PIG-497
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: types_branch
>            Reporter: Olga Natkovich
>            Assignee: Pradeep Kamath
>             Fix For: types_branch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to