Re: Spark 2.0 -- spark warehouse relative path in absolute URI error

2016-07-28 Thread Hatim Diab
I’m not familiar with windows but for unix is the path is /data/zxy then it’ll be file:///data/zxy so I’d assume file://C:/ > On Jul 28, 2016, at 2:33 PM, Sean Owen wrote: > > Hm, file:///C:/... doesn't work? that should certainly be an absolute > URI with an absolute

Re: Tools for Balancing Partitions by Size

2016-07-12 Thread Hatim Diab
Hi, Since the final size depends on data types and compression. I've had to first get a rough estimate of data, written to disk, then compute the number of partitions. partitions = int(ceil(size_data * conversion_ratio / block_size)) In my case block size 256mb, source txt & dest is snappy