Re: External vs. Managed Tables - Current state

2018-11-02 Thread Lars Francke
Hi again, I finally got around to creating a Wiki page: < https://cwiki.apache.org/confluence/display/Hive/Managed+vs.+External+Tables > @Sergey: What are "SH tables"? Any more additions to the content are welcome. I'd be happy to put them in the Wiki too. All I need are a few cues/keywords.

Re: External vs. Managed Tables - Current state

2018-08-22 Thread Lars Francke
Thanks for the input here and on the issue itself. @Sergey: That is a good summary. I would try to distill that into a Wiki page. I still think that it'd be good to list the actual technical differences. 1) Because this is not actually written down in a concise way so lots of our users don't

Re: External vs. Managed Tables - Current state

2018-08-22 Thread Sergey Shelukhin
I think it’s very simple conceptually (well the way I’ve been thinking about it), and any limitations can be derived from this using logic. If someone has specific questions we can clarify; if some limitations are missing (e.g. when Hive was using stats for hbase tables) it is a bug. For managed

Re: External vs. Managed Tables - Current state

2018-08-21 Thread Mithun RK
+1. I'm out of the loop as well. I'd be keen on grokking what's not available with EXTERNAL tables, given that these are popular on our clusters as well. It would be good to have that documented in one place. On Tue, Aug 21, 2018 at 8:19 AM Lars Francke wrote: > As this came up with yet another

Re: External vs. Managed Tables - Current state

2018-08-21 Thread Lars Francke
As this came up with yet another customer and I've heard more things that I believe to be false: Please, anyone have any kind of feedback for me here? Is my list correct, can you add stuff to it, have more details to certain points etc.? Thank you! On Wed, Aug 1, 2018 at 12:17 PM, Lars Francke

Re: External vs. Managed Tables - Current state

2018-08-01 Thread Lars Francke
I have opened https://issues.apache.org/jira/browse/HIVE-20287 and would love to get feedback On Wed, Aug 1, 2018 at 11:14 AM, Lars Francke wrote: > Thanks Lefty! I haven't created a JIRA but it's on my list. > > Still interested in any input anyone might have? > > Some of you guys _must_ have

Re: External vs. Managed Tables - Current state

2018-08-01 Thread Lars Francke
Thanks Lefty! I haven't created a JIRA but it's on my list. Still interested in any input anyone might have? Some of you guys _must_ have some knowledge on this stuff. You're correct that the DROP stuff is documented. What I meant is that I couldn't find the relevant parts in the code. On Sat,

Re: External vs. Managed Tables - Current state

2018-07-20 Thread Lefty Leverenz
Agreed, the Hive wiki is woefully incomplete. But it does mention DROP for external tables in two sections of the DDL doc: - External Tables : "When dropping an EXTERNAL table, data

External vs. Managed Tables - Current state

2018-07-19 Thread Lars Francke
Hi, I've been wondering if anyone can tell me what the differences and limitations of managed vs. external tables are these days. I've seen the docs from Hortonworks[1] that list these features as not supported by External tables: * Query cache * Materialized views, except in a limited way *