[ 
https://issues.apache.org/jira/browse/SPARK-26496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16730619#comment-16730619
 ] 

Hyukjin Kwon commented on SPARK-26496:
--------------------------------------

I think we should fix it to nextFloat.toString. Similar fix was made in 
SPARK-19613 before.

> Test "locality preferences of StateStoreAwareZippedRDD" frequently fails on 
> High Sierra
> ---------------------------------------------------------------------------------------
>
>                 Key: SPARK-26496
>                 URL: https://issues.apache.org/jira/browse/SPARK-26496
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.4.0
>         Environment: Mac OS X High Sierra
>            Reporter: Bruce Robbins
>            Priority: Minor
>
> This is a bit esoteric and minor, but makes it difficult to run SQL unit 
> tests successfully on High Sierra.
> StreamingInnerJoinSuite."locality preferences of StateStoreAwareZippedRDD" 
> generates a directory name using {{Random.nextString(10)}}, and frequently 
> that directory name is unacceptable to High Sierra.
> For example:
> {noformat}
> scala> val prefix = Random.nextString(10); val dir = new File("/tmp", "del_" 
> + prefix + "-" + UUID.randomUUID.toString); dir.mkdirs()
> prefix: String = 媈ᒢ탊渓뀟?녛ꃲ싢櫦
> dir: java.io.File = /tmp/del_媈ᒢ탊渓뀟?녛ꃲ싢櫦-aff57fc6-ca38-4825-b4f3-473140edd4f6
> res39: Boolean = true // this one was OK
> scala> val prefix = Random.nextString(10); val dir = new File("/tmp", "del_" 
> + prefix + "-" + UUID.randomUUID.toString); dir.mkdirs()
> prefix: String = 窽텘⒘駖ⵚ駢⡞Ρ닋੎
> dir: java.io.File = /tmp/del_窽텘⒘駖ⵚ駢⡞Ρ닋੎-a3f99855-c429-47a0-a108-47bca6905745
> res40: Boolean = false  // nope, didn't like this one
> scala> prefix.foreach(x => printf("%04x ", x.toInt))
> 7abd d158 2498 99d6 2d5a 99e2 285e 03a1 b2cb 0a4e 
> scala> prefix(9)
> res46: Char = ੎
> scala> val prefix = "\u7abd"
> prefix: String = 窽
> scala> val dir = new File("/tmp", "del_" + prefix + "-" + 
> UUID.randomUUID.toString); dir.mkdirs()
> dir: java.io.File = /tmp/del_窽-d1c3af34-d34d-43fe-afed-ccef9a800ff4
> res47: Boolean = true // it's OK with \u7abd
> scala> val prefix = "\u0a4e"
> prefix: String = ੎
> scala> val dir = new File("/tmp", "del_" + prefix + "-" + 
> UUID.randomUUID.toString); dir.mkdirs()
> dir: java.io.File = /tmp/del_੎-3654a34c-6f74-4591-85af-a0f28b675a6f
> res50: Boolean = false // doesn't like \u0a4e
> {noformat}
> I thought it might have something to do with my Java 8 version, but Python is 
> equally affected:
> {noformat}
> >>> f = open(u"/tmp/del_\u7abd_file", "wb")
> f = open(u"/tmp/del_\u7abd_file", "wb")
> >>> f.write("hello\n")
> f.write("hello\n")
> # it's OK with \u7abd
> >>> f2 = open(u"/tmp/del_\u0a4e_file", "wb")
> f2 = open(u"/tmp/del_\u0a4e_file", "wb")
> Traceback (most recent call last):
>   File "<stdin>", line 1, in <module>
> IOError: [Errno 92] Illegal byte sequence: u'/tmp/del_\u0a4e_file'
> # doesn't like \u0a4e
> >>> f2 = open(u"/tmp/del_\ufa4e_file", "wb")
> f2 = open(u"/tmp/del_\ufa4e_file", "wb")
> # a little change and it's happy again
> >>> 
> {noformat}
> Mac OS X Sierra is perfectly happy with these characters. This seems to be a 
> limitation introduced by High Sierra.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to