Do your schema inference and then apply the JSON schema using withColumn 
overwriting the String representation

 

From: Nirav Patel <npa...@xactlycorp.com>
Date: Tuesday, October 2, 2018 at 5:00 PM
To: <brandonge...@gmail.com>
Cc: spark users <user@spark.apache.org>
Subject: Re: CSV parser - how to parse column containing json data

 

I need to inferSchema from CSV as well. As per your solution, I am creating 
SructType only for Json field. So how am I going to mix and match here? i.e. do 
type inference for all fields but json field and use custom json_schema for 
json field. 

 

 

 

 

 

On Thu, Aug 30, 2018 at 5:29 PM Brandon Geise <brandonge...@gmail.com> wrote:

If you know your json schema you can create a struct and then apply that using 
from_json:

 

val json_schema = StructType(Array(StructField(“x”, StringType, true), 
StructField(“y”, StringType, true), StructField(“z”, IntegerType, true)))

 

.withColumn("_c3", from_json(col("_c3_signals"),json_schema))

 

From: Nirav Patel <npa...@xactlycorp.com>
Date: Thursday, August 30, 2018 at 7:19 PM
To: spark users <user@spark.apache.org>
Subject: CSV parser - how to parse column containing json data

 

Is there a way to parse csv file with some column in middle containing json 
data structure?

 

"a",102,"c","{"x":"xx","y":false,"z":123}","d","e",102.2

 

 

Thanks,

Nirav






        






        

Reply via email to