unsubscribe

On Tue, Apr 14, 2015 at 4:28 PM, Gopal Vijayaraghavan <gop...@apache.org>
wrote:

>
> >0.14 . Acid tables have been a real pain for us. We don¹t believe they are
> >production ready. At least in our use cases, Tez crashes for assorted
> >reasons or only assigns 1 mapper to the partition. Having delta files and
> >no base files borks mapper assignments.
>
> Some of the chicken-egg problems for those were solved recently in
> HIVE-10114.
>
> Then TEZ-1993 is coming out in the next version of Tez, into which we¹re
> plugging in HIVE-7428 (no fix yet).
>
> Currently delta-only splits have 0 bytes as the ³file size², so it grouped
> together to make a 16Mb chunk (rather a huge single 0 sized split).
>
> Those patches are the effect of me shaving the yak from the ³1 mapper²
> issue.
>
> After which the writer has to follow up on HIVE-9933 to get the locality
> of files fixed.
>
> >name are left scattered about, borking queries. Latency is higher with
> >streaming than writing to an orc file in hdfs, forcing obscene quantities
> >of buckets and orc files smaller than any reasonable orc stripe / hdfs
> >block size. The compactor hangs seemingly at random for no reason we¹ve
> >been able to discern.
>
> I haven¹t seen these issues yet, but I am not dealing with a large volume
> insert rate, so haven¹t produced latency issues there.
>
> Since I work on Hive performance and I haven¹t seen too many bugs filed,
> so I haven¹t paid attention to the performance of ACID.
>
> Please file bugs when you find them, so that it appears on the radar for
> folks like me.
>
> I¹m poking about because I want a live stream into LLAP to work seamlessly
> & return sub-second query results when queried (pre-cache/stage & merge
> etc).
>
> >An orc file without a footer is junk data (or, at least, the last stripe
> >is junk data). I suppose my question should have been 'what will the hive
> >query do when it encounters this? Skip the stripe / file? Error out the
> >query? Something else?¹
>
> It should throw an exception, because that¹s a corrupt ORC file.
>
> The trucking demo uses Storm without ACID - this is likely to get better
> once we use Apache Falcon to move the data around.
>
> Cheers,
> Gopal
>
>
>


-- 
Chad J. Dotzenrod
(630)669-6095
cdotzen...@gmail.com

Reply via email to