Re: How to specify file

Aditya Fri, 23 Sep 2016 00:15:02 -0700

Hi Sea,

For using Spark SQL you will need to create DataFrame from the file andthen execute select * on dataframe.

In your case you will need to do something like this


        JavaRDD<String> DF = context.textFile("path");
        JavaRDD<Row> rowRDD3 = DF.map(new Function<String, Row>() {
            public Row call(String record) throws Exception {
                String[] fields = record.split("\001");
                Row createRow = createRow(fields);
                return createRow;
            }
        });
        DataFrame ResultDf3 = hiveContext.createDataFrame(rowRDD3, schema);
        ResultDf3.registerTempTable("test")
        hiveContext.sql("select * from test");

You will need to create schema for the file first just like how you havecreated for csv file.





On Friday 23 September 2016 12:26 PM, Sea wrote:

Hi, I want to run sql directly on files, I find that spark hassupported sql like select * from csv.`/path/to/file`, but files maynot be split by ','. Maybe it is split by '\001', how can I specifydelimiter?
Thank you!

Re: How to specify file

Reply via email to