Re: Is it ok to build an entire ETL/ELT data flow using HIVE queries?

2016-02-15 Thread Mich Talebzadeh
A combination of both normally See below https://www.linkedin.com/pulse/etl-elt-use-case-mich-talebzadeh-ph-d-?trk=pulse_spock-articles [1] HTH. Mich On 16/02/2016 06:19, Ramasubramanian wrote: > Hi, > > Is it ok to build an entire ETL/ELT data flow using HIVE queries? > > Data is

Re: The index for query in hive 1.2.1 does not work.

2016-02-15 Thread Mich Talebzadeh
Hi, "Traditional" Indexes are not currently used in Hive. You can create them but they are not used by the optimizer. You can create storage indexes in Hive using ORC file format that provides three levels of granularity * ORC File itself * Multiple stripes within the ORC f

Re: Is it ok to build an entire ETL/ELT data flow using HIVE queries?

2016-02-15 Thread Heng Chen
My company do ETL data flow using HIVE + Pig, it is OK now. 2016-02-16 14:55 GMT+08:00 Jörn Franke : > Why should it not be ok if you do not miss any functionality? You can use > oozie + hive queries to have more sophisticated logging and scheduling. Do > not forget to do proper capacity/queu

Re: Is it ok to build an entire ETL/ELT data flow using HIVE queries?

2016-02-15 Thread Jörn Franke
Why should it not be ok if you do not miss any functionality? You can use oozie + hive queries to have more sophisticated logging and scheduling. Do not forget to do proper capacity/queue management. > On 16 Feb 2016, at 07:19, Ramasubramanian > wrote: > > Hi, > > Is it ok to build an entire

Is it ok to build an entire ETL/ELT data flow using HIVE queries?

2016-02-15 Thread Ramasubramanian
Hi, Is it ok to build an entire ETL/ELT data flow using HIVE queries? Data is stored in HIVE. We have transactional and reference data. We need to build a small warehouse. Need suggestion on alternatives too. Regards, Rams

The index for query in hive 1.2.1 does not work.

2016-02-15 Thread 万修远
Hello, When I use index in hive 1.2.1, I find the index does not work. The details are as follows: 1. After using index, the query speed does not improve. If I use manual use of indexes, the query speed improve obviously, but when switch to automatic use of indexes, the speed makes no differ

Re: list bucketing join

2016-02-15 Thread Sergey Shelukhin
It’s probably a bug. Can you file a JIRA with the full callstack? As far as I know, list bucketing is not widely used, so the bug might have been introduced unwittingly, but it’s hard to tell without seeing the callstack. From: Shangzhong zhu mailto:shanzh...@gmail.com>> Reply-To: "user@hive.apa

ERRROR : no more data to read

2016-02-15 Thread brajmohan saxena
Hi, While trying to connect to Hive server ( port 1 ) on Hortonworks sandbox through simple Thrift program ( transport->open(); ) I am getting this error. *ERROR: No more data to read.* could you please let me know the reason for this, and from which Hive versoin onwards this error started.

Re: Need help :Does anybody has HDP cluster on EC2?

2016-02-15 Thread Chandeep Singh
You could also fire up a VNC session and access all internal pages from there. > On Feb 15, 2016, at 9:19 AM, Divya Gehlot wrote: > > Hi Sabarish, > Thanks alot for your help. > I am able to view the logs now > > Thank you very much . > > Cheers, > Divya > > > On 15 February 2016 at 16:51,

Re: Need help :Does anybody has HDP cluster on EC2?

2016-02-15 Thread Divya Gehlot
Hi Sabarish, Thanks alot for your help. I am able to view the logs now Thank you very much . Cheers, Divya On 15 February 2016 at 16:51, Sabarish Sasidharan < sabarish.sasidha...@manthan.com> wrote: > You can setup SSH tunneling. > > > http://docs.aws.amazon.com/ElasticMapReduce/latest/Develop

Re: Need help :Does anybody has HDP cluster on EC2?

2016-02-15 Thread Awhan Patnaik
mention the FQDN to IP translation in the /etc/hosts file (if using linux) On Mon, Feb 15, 2016 at 1:55 PM, Divya Gehlot wrote: > Hi, > I have hadoop cluster set up in EC2. > I am unable to view application logs in Web UI as its taking internal IP > Like below : > http://ip-xxx-xx-xx-xxx.ap-sout

Need help :Does anybody has HDP cluster on EC2?

2016-02-15 Thread Divya Gehlot
Hi, I have hadoop cluster set up in EC2. I am unable to view application logs in Web UI as its taking internal IP Like below : http://ip-xxx-xx-xx-xxx.ap-southeast-1.compute.internal:8042 How can I change this to external one or redir

hive on tez hadoop-common version problem.

2016-02-15 Thread no jihun
Hi all. I have some problem with hive-on-tez. email thread below is forwarding originally wrote to tez users. Could someone please check this problem out? Any advice will be appreciated. thanks. -- Forwarded message -- From: no jihun Date: 2016-02-15 17:04 GMT+09:00 Subject: Re