subject:"\[jira\] \[Commented\] \(SPARK\-30242\) Support reading Parquet files from Stream Buffer"

[jira] [Commented] (SPARK-30242) Support reading Parquet files from Stream Buffer

2019-12-16 Thread Hyukjin Kwon (Jira)



[ 
https://issues.apache.org/jira/browse/SPARK-30242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16997779#comment-16997779
 ] 

Hyukjin Kwon commented on SPARK-30242:
--

Nope, I don't think it will be able as it requires to change too many APIs 
(e.g, ORC, CSV, Json, Text) but it can be easily worked around by writing out 
to the local directory and read it back.

> Support reading Parquet files from Stream Buffer
> 
>
> Key: SPARK-30242
> URL: https://issues.apache.org/jira/browse/SPARK-30242
> Project: Spark
>  Issue Type: Wish
>  Components: PySpark
>Affects Versions: 3.0.0
>Reporter: Jelther Oliveira Gonçalves
>Priority: Trivial
>
> Reading from a Python BufferIO a parquet is not possible using Pyspark.
> Using:
>  
> {code:java}
> from io import BytesIO
> parquetbytes : Bytes = b'PAR...'
> df = spark.read.format("parquet").load(BytesIO(parquetbytes))
> {code}
> Raises :
> {code:java}
> java.lang.ClassCastException: java.util.ArrayList cannot be cast to 
> java.lang.String{code}
>  
> Is there any chance this will be available in the future?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-30242) Support reading Parquet files from Stream Buffer

2019-12-13 Thread Jira



[ 
https://issues.apache.org/jira/browse/SPARK-30242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16996046#comment-16996046
 ] 

Jelther Oliveira Gonçalves commented on SPARK-30242:


Hi [~dongjoon], thanks for the update.
I've seen you have changed already.

Thanks.

> Support reading Parquet files from Stream Buffer
> 
>
> Key: SPARK-30242
> URL: https://issues.apache.org/jira/browse/SPARK-30242
> Project: Spark
>  Issue Type: Wish
>  Components: PySpark
>Affects Versions: 3.0.0
>Reporter: Jelther Oliveira Gonçalves
>Priority: Trivial
>
> Reading from a Python BufferIO a parquet is not possible using Pyspark.
> Using:
>  
> {code:java}
> from io import BytesIO
> parquetbytes : Bytes = b'PAR...'
> df = spark.read.format("parquet").load(BytesIO(parquetbytes))
> {code}
> Raises :
> {code:java}
> java.lang.ClassCastException: java.util.ArrayList cannot be cast to 
> java.lang.String{code}
>  
> Is there any chance this will be available in the future?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-30242) Support reading Parquet files from Stream Buffer

2019-12-13 Thread Dongjoon Hyun (Jira)



[ 
https://issues.apache.org/jira/browse/SPARK-30242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16996045#comment-16996045
 ] 

Dongjoon Hyun commented on SPARK-30242:
---

Hi, [~jetolgon]. Thank you for suggestion. For the new feature, you need to set 
the next version of master branch. As of today, it's 3.0.0 .

> Support reading Parquet files from Stream Buffer
> 
>
> Key: SPARK-30242
> URL: https://issues.apache.org/jira/browse/SPARK-30242
> Project: Spark
>  Issue Type: Wish
>  Components: Spark Core
>Affects Versions: 2.4.4
>Reporter: Jelther Oliveira Gonçalves
>Priority: Trivial
>
> Reading from a Python BufferIO a parquet is not possible using Pyspark.
> Using:
>  
> {code:java}
> from io import BytesIO
> parquetbytes : Bytes = b'PAR...'
> df = spark.read.format("parquet").load(BytesIO(parquetbytes))
> {code}
> Raises :
> {code:java}
> java.lang.ClassCastException: java.util.ArrayList cannot be cast to 
> java.lang.String{code}
>  
> Is there any chance this will be available in the future?
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-30242) Support reading Parquet files from Stream Buffer

[jira] [Commented] (SPARK-30242) Support reading Parquet files from Stream Buffer

[jira] [Commented] (SPARK-30242) Support reading Parquet files from Stream Buffer

3 matches

Site Navigation

Mail list logo

Footer information