Re: How to insert a dataframe as a static partition to a partitioned table

2017-07-20 Thread Chaoyu Tang
Thanks Vadim. But I am looking for an API either in DataSet, DataFrame, or DataFrameWriter etc. The way you suggested can be done via a query like spark.sql(""" ALTER TABLE `table` ADD PARTITION (partcol=1) LOCATION '/path/to/your/dataset' """), and before that I write it to a specified location

Re: How to insert a dataframe as a static partition to a partitioned table

2017-07-20 Thread Vadim Semenov
This should work: ``` ALTER TABLE `table` ADD PARTITION (partcol=1) LOCATION '/path/to/your/dataset' ``` On Wed, Jul 19, 2017 at 6:13 PM, ctang wrote: > I wonder if there are any easy ways (or APIs) to insert a dataframe (or > DataSet), which does not contain the partition

Re: How to insert a dataframe as a static partition to a partitioned table

2017-07-19 Thread Ryan
Not sure about the writer api, but you could always register a temp table for that dataframe and execute insert hql. On Thu, Jul 20, 2017 at 6:13 AM, ctang wrote: > I wonder if there are any easy ways (or APIs) to insert a dataframe (or > DataSet), which does not contain the

How to insert a dataframe as a static partition to a partitioned table

2017-07-19 Thread ctang
I wonder if there are any easy ways (or APIs) to insert a dataframe (or DataSet), which does not contain the partition columns, as a static partition to the table. For example, The DataSet with columns (col1, col2) will be inserted into a table (col1, col2) partitioned by column partcol as a