Re: How to use iceberg in spark?

Ryan Blue Wed, 15 May 2019 10:59:28 -0700

Hi Junjie,

If you can’t load the classes, I’m not sure what’s going on. You should be
able to use --jars and --driver-class-path to add the Jar on the command
line. Try that, and also make sure you’re using a recent Jar built from
master because we renamed from com.netflix a little while ago.

Right now, master only supports reading from HDFS tables in Spark. That
looks like this:

val df = spark.read.format("iceberg").load("hdfs://...")
df.createOrReplaceTempView("table_name")

We’re working on getting better support into Spark with the DataSourceV2
effort. CTAS was just committed, and we will have a PR for reading v2
tables from SQL posted to Spark shortly.

I’m also planning on updating the Spark code so that it can use a Hive
metastore for tables now that we have committed the connection pool in
iceberg-hive.

rb

On Tue, May 14, 2019 at 8:58 PM 俊杰陈 <[email protected]> wrote:

> Try to send this again since I can't see this on dev mail list.
>
> On Tue, May 14, 2019 at 1:24 PM 俊杰陈 <[email protected]> wrote:
>
>> Hi
>>
>> Do we have wiki or doc for spark usage?  I tried to put jar in spark jars
>> dir, but when importing org.apache.iceberg.xxxx in spark-shell, it shows
>> iceberg is not a member of package org.apache.
>>
>> --
>> Thanks & Best Regards
>>
>
>
> --
> Thanks & Best Regards
>

-- 
Ryan Blue
Software Engineer
Netflix

Re: How to use iceberg in spark?

Reply via email to