Re: Fwd: Question on ORC file stripe size.

2014-12-03 Thread Prasanth Jayachandran
Stripe size is too low. ORC maintains multiple buffers in memory. ORC’s memory manager flushes a stripe when the in-memory data size (which includes buffers in memory) is greater than specified stripe size. This check happens after every 5000 rows.  This is what is happening in this case There

Fwd: Question on ORC file stripe size.

2014-12-02 Thread Jim Green
Hi Team, I am creating this table: CREATE TABLE IF NOT EXISTS orctest2 ( id string, id2 string, id3 string, id4 string ) STORED AS ORC tblproperties ("orc.stripe.size"="1048576","orc.row.index.stride"="”); The stripe size is set to 1MB. After loading data, the table file is about 60MB: -rwxr-