I may be missing something super obvious here but can't you combine them into a single dataframe. Left join perhaps?
Try writing it in sql " select a from json1 and b from josn2"then run explain to give you a hint to how to do it in code Regards Sam On Tue, 14 Feb 2017 at 14:30, Aseem Bansal <asmbans...@gmail.com> wrote: > Say I have two files containing single rows > > json1.json > > {"a": 1} > > json2.json > > {"b": 2} > > I read in this json file using spark's API into a dataframe one at a time. > So I have > > Dataset json1DF > and > Dataset json2DF > > If I run "select a, b from __THIS__" in a SQLTransformer then I will get > an exception as for json1DF does not have "b" and json2DF does not have "a" > > How could I handle this situation with missing columns in JSON? >