Hi, I am new to Hive, I am reading the doc which is available on Apache site and try to create a correlation between hadoop and Hive. so please help me to understand this: As per my understanding, all the files where unstructured data are stored in HDFS system across the hadoop cluster. Now when we have to analyze those data we use Hive. Now I have some question which I am not able to get :
1.When engineer/buisnessuser want to analyze the data, which is available on any of the file on HDFS cluster, so what is the steps to get the desired file and analyze the file using hive. 2.Is Hive stores all the data in their tables after the analysis permanently? 3.Is Hive itself a database? Thanks Prashant ::DISCLAIMER:: ---------------------------------------------------------------------------------------------------------------------------------------------------- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. E-mail transmission is not guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or may contain viruses in transmission. The e mail and its contents (with or without referred errors) shall therefore not attach any liability on the originator or HCL or its affiliates. Views or opinions, if any, presented in this email are solely those of the author and may not necessarily reflect the views or opinions of HCL or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of authorized representative of HCL is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. Before opening any email and/or attachments, please check them for viruses and other defects. ----------------------------------------------------------------------------------------------------------------------------------------------------