[jira] [Updated] (SPARK-3833) Allow Spark SQL SchemaRDDs to be merged

2015-09-16 Thread Sean Owen (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-3833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-3833:
-
Assignee: Michael Armbrust

> Allow Spark SQL SchemaRDDs to be merged
> ---
>
> Key: SPARK-3833
> URL: https://issues.apache.org/jira/browse/SPARK-3833
> Project: Spark
>  Issue Type: Wish
>  Components: SQL
>Reporter: Chris Wood
>Assignee: Michael Armbrust
> Fix For: 1.4.0
>
>
> We have JSON flowing into Spark SQL.
> I can successfully store them as parquet and read them with 
> sqlContext.jsonRDD, but the inferred schemas cannot be merged into a single 
> table to do queries.
> I'd like a way to allow for parquet file schemas to be merged, whether they 
> match or not, since we know the schema should be a union of the schemas from 
> the files.
> This will allow us to have the data define the schema and new columns will 
> just appear.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-3833) Allow Spark SQL SchemaRDDs to be merged

2014-10-07 Thread Chris Wood (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-3833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Wood updated SPARK-3833:
--
Summary: Allow Spark SQL SchemaRDDs to be merged  (was: Allow Spark SQL 
SchemaRDDs to me berged)

 Allow Spark SQL SchemaRDDs to be merged
 ---

 Key: SPARK-3833
 URL: https://issues.apache.org/jira/browse/SPARK-3833
 Project: Spark
  Issue Type: Wish
  Components: SQL
Reporter: Chris Wood

 We have JSON flowing into Spark SQL.
 I can successfully store them as parquet and read them with 
 sqlContext.jsonRDD, but the inferred schemas cannot be merged into a single 
 table to do queries.
 I'd like a way to allow for parquet file schemas to be merged, whether they 
 match or not, since we know the schema should be a union of the schemas from 
 the files.
 This will allow us to have the data define the schema and new columns will 
 just appear.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org