[ 
https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16489546#comment-16489546
 ] 

Hyukjin Kwon edited comment on SPARK-24358 at 5/24/18 6:31 PM:
---------------------------------------------------------------

Yea, I know the differences and I know the rationale here. We should need 
strong evidences and reasons to accept the divergence. Also we need to take a 
look for PySpark codes bases too and check such divergence.
FWIW, I was trying to take a look and fix the difference among bytes, str and 
unicode and I am currently stuck due to other works.




was (Author: hyukjin.kwon):
Yea, I know the differences and I know the rationale here. We should need 
strong evidences and reasons to accept the divergence. Also we need to take a 
look for PySpark codes bases too and check such divergence.
FWIW, I was trying to take a look and fix the difference among bytes, str and 
unicode and I am currently stuck due to other swarming works.



> createDataFrame in Python 3 should be able to infer bytes type as Binary type
> -----------------------------------------------------------------------------
>
>                 Key: SPARK-24358
>                 URL: https://issues.apache.org/jira/browse/SPARK-24358
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.3.0
>            Reporter: Joel Croteau
>            Priority: Minor
>              Labels: Python3
>
> createDataFrame can infer Python 3's bytearray type as a Binary. Since bytes 
> is just the immutable, hashable version of this same structure, it makes 
> sense for the same thing to apply there.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to