[ https://issues.apache.org/jira/browse/OOZIE-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15935549#comment-15935549 ]
Robert Kanter commented on OOZIE-2819: -------------------------------------- Thanks [~asasvari], I've pushed in the amendment patch. > Make Oozie REST API accept multibyte characters for script Actions > ------------------------------------------------------------------ > > Key: OOZIE-2819 > URL: https://issues.apache.org/jira/browse/OOZIE-2819 > Project: Oozie > Issue Type: Bug > Reporter: Attila Sasvari > Assignee: Attila Sasvari > Fix For: 5.0.0 > > Attachments: OOZIE-2819-00.patch, OOZIE-2819-01.patch, > OOZIE-2819-02.patch, OOZIE-2819-03-amendment.patch, OOZIE-2819-03.patch > > > Submitted Pig action with client side xml failed via proxy submission when it > contained multibyte characters. > {code} > curl -i -X POST -d @/tmp/pig.xml -H 'Content-Type: application/XML; > charset=UTF-8' > 'http://'localhost':11000/oozie/v1/jobs?jobtype=pig&action=start' > {code} > Where > {code} > $ hdfs dfs -cat /tmp/encoding/input.txt > 松 > 林檎 > 松 > {code} > {code} > $ cat /tmp/pig.xml > <configuration> > <property> > <name>fs.default.name</name> > <value>hdfs://localhost:8020/</value> > </property> > <property> > <name>mapred.job.tracker</name> > <value>localhost:8032</value> > </property> > <property> > <name>user.name</name> > <value>hdfs</value> > </property> > <property> > <name>oozie.pig.script</name> > <value><![CDATA[ > lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS line; > test = FILTER lines BY line == '松'; > STORE test INTO 'hdfs:///tmp/encoding/output' USING PigStorage('\n'); > ]]></value> > </property> > <property> > <name>oozie.pig.script.params.size</name> > <value>0</value> > </property> > <property> > <name>oozie.pig.script.options.size</name> > <value>0</value> > </property> > <property> > <name>oozie.libpath</name> > <value>hdfs:///user/oozie/share/lib</value> > </property> > <property> > <name>oozie.use.system.libpath</name> > <value>true</value> > </property> > <property> > <name>oozie.proxysubmission</name> > <value>true</value> > </property> > </configuration> > {code} > In the Oozie launcher log, I could see > {code} > lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS > line;test = FILTER lines BY line == '~';STORE test INTO > 'hdfs:///tmp/encoding/output' USING PigStorage('\n'); > {code} > was used instead of the intended 松 -- This message was sent by Atlassian JIRA (v6.3.15#6346)