I think I figured out this issue. For oozie to support unicode in path name, 
the JAVA_TOOL_OPTIONS in the JVM needs to use an encoding method that supports 
unicode. And the oozie DB needs to use an encoding method that's consistent 
with the JVM. By default, mysql, for example, uses "latin1" if running "create 
database oozie;". This won't work since "latin1" only supports Western 
characters.



On Friday, August 8, 2014 7:06 PM, bowen zhang <[email protected]> wrote:
 


here is an example:
cat job.properties:
nameNode=hdfs://localhost:9000
jobTracker=localhost:8032
queueName=default
examplesRoot=examples

oozie.wf.application.path=${nameNode}/user/${user.name}/${examplesRoot}/apps/map-reduceć–‡
outputDir=map-reduce



On Friday, August 8, 2014 6:55 PM, Alejandro Abdelnur <[email protected]> wrote:
 


hey bowen, it would help to know exactly where do you see issues, a simple wf 
showing the issue would be great

Alejandro
(phone typing)


> On Aug 8, 2014, at 18:12, bowen zhang <[email protected]> wrote:
> 
> Hi guys,
> I recently discovered a hole in oozie implementation where oozie breaks when 
> having hdfs app_path with unicode. If my app_path contains non-ascii 
> characters like Chinese words, the DB and server side will not retrieve it 
> correctly. Do we have strict utf-8 encoding throughout oozie? Any thoughts on 
> this?
> Bowen

Reply via email to