Hi Michael thanks for the reply. I know that I can create DataFrame using
JavaBean or Struct Type I want to know how can I create DataFrame from
above code which is custom Hadoop format.

On Tue, Aug 11, 2015 at 12:04 AM, Michael Armbrust <mich...@databricks.com>
wrote:

> You can't create a DataFrame from an arbitrary object since we don't know
> how to figure out the schema.  You can either create a JavaBean
> <https://spark.apache.org/docs/latest/sql-programming-guide.html#programmatically-specifying-the-schema>
> or manually create a row + specify the schema
> <https://spark.apache.org/docs/latest/sql-programming-guide.html#programmatically-specifying-the-schema>
> .
>
>
>
> On Mon, Aug 10, 2015 at 11:22 AM, unk1102 <umesh.ka...@gmail.com> wrote:
>
>> Hi I have my own Hadoop custom InputFormat which I want to use in
>> DataFrame.
>> How do we do that? I know I can use sc.hadoopFile(..) but then how do I
>> convert it into DataFrame
>>
>> JavaPairRDD<Void,MyRecordWritable> myFormatAsPairRdd =
>>
>> jsc.hadoopFile("hdfs://tmp/data/myformat.xyz",MyInputFormat.class,Void.class,MyRecordWritable.class);
>> JavaRDD<MyRecordWritable> myformatRdd =  myFormatAsPairRdd.values();
>> DataFrame myFormatAsDataframe =
>> sqlContext.createDataFrame(myformatRdd,??);
>>
>> In above code what should I put in place of ?? I tried to put
>> MyRecordWritable.class but it does not work as it is not schema it is
>> Record
>> Writable. Please guide.
>>
>>
>>
>> --
>> View this message in context:
>> http://apache-spark-user-list.1001560.n3.nabble.com/How-to-use-custom-Hadoop-InputFormat-in-DataFrame-tp24198.html
>> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
>> For additional commands, e-mail: user-h...@spark.apache.org
>>
>>
>

Reply via email to