Re: An issue with Hive on hadoop cluster

2011-05-23 Thread MIS
I have the configuration consistent across both the client and server sides. I have checked the hadoop logs on both the nodes. On both the nodes, in the tasktracker logs, every task attempt is directed towards hdfs://localhost:54310/user/hive/warehouse and not towards hdfs://:54310/user/hive/wareh

Re: Hive web interface startup problem

2011-05-23 Thread Edward Capriolo
The error message does not correctly state where hive is looking for the war You only need to supply the path from hive home like this /lib/hive-hwi-0.7.0-cdh3u0.war Not quite sure when it broke. Maybe around here: http://svn.apache.org/viewvc/hive/trunk/hwi/src/java/org/apache/hadoop/hive/hwi/

Re: Hive web interface startup problem

2011-05-23 Thread Carl Steinbach
Hi Ayon, We dropped support for HWI in CDH3u0 due to inconsistent upstream maintenance of this package. We recommend instead that people use Hue/Beeswax. More Information: https://ccp.cloudera.com/display/CDHDOC/Known+Issues+and+Work+Arounds+in+CDH3#KnownIssuesandWorkAroundsinCDH3-Hive https://c

Hive web interface startup problem

2011-05-23 Thread Ayon Sinha
Hi Everyone, I get a war not found error, when the war file exists. Any ideas? [ayon@d002 ant]$ ls -l /usr/lib/hive/lib/hive-hwi-0.7.0-cdh3u0.war -rwxr-xr-x 1 root root 29304 Mar 26 00:43 /usr/lib/hive/lib/hive-hwi-0.7.0-cdh3u0.war [ayon@d002 ant]$ hive --service hwi 11/05/24 02:07:27 INFO hwi.HW

Re: Hive question, summing second-level domain names

2011-05-23 Thread Adam Phelps
It does seem like that command is approximately what I need, but I can't seem to get its usage quite right for what I'm attempting to do. As an example, here's a try where I'm narrowing it down to only subdomains of facebook.com: SELECT regexp_extract(qname, '^((.*[.])|(.{0}))[^.]+\\.[^.]+\

Re: Logging MySQL queries

2011-05-23 Thread Dilip Joseph
If you just want to temporarily look at the queries while debugging some problem, one option I have found useful is to enable logging of all queries on your mysqld (assuming mysqld instance is used only for hive, and is not under heavy load). Dilip On Mon, May 23, 2011 at 2:45 PM, Steven Wong wr

RE: hive storing a byte array

2011-05-23 Thread Steven Wong
Hive does not support the blob data type. An option is to store your binary data encoded as string (such as using base64) and define them in Hive as string. -Original Message- From: Luke Forehand [mailto:luke.foreh...@networkedinsights.com] Sent: Monday, May 23, 2011 1:21 PM To: user@hi

HIve query failing on MapReduce with no logs at all

2011-05-23 Thread Ayon Sinha
Hi, I created a table in HIVE prompt from files in the HDFS. I can see the table and can perform a select * query (which I believe does not need to do a MR). But for any query that needs MR, it submits the job, I can see it in Jobtracker and in 6 secs, all mappers & reducers are killed with no l

RE: Logging MySQL queries

2011-05-23 Thread Steven Wong
After posting my question, I did some digging and also found the log4j categories. Unfortunately, setting them to DEBUG in hive-log4j.properties has no effect. I will look into log4jdbc. Thanks for pointing it out. Steven From: Carl Steinbach [mailto:c...@cloudera.com] Sent: Monday, May 23, 2

Re: Logging MySQL queries

2011-05-23 Thread Carl Steinbach
Hi Steven, I think you can tell Datanucleus to log SQL using some of the configuration properties described here: http://www.datanucleus.org/products/accessplatform/logging.html It looks like the Datanucleus.Datastore.* categories are the ones you are interested in. Another option which may work

Re: Logging MySQL queries

2011-05-23 Thread Ashish Thusoo
you will have to write a pre execute or post execute hook to do this. The Hook api is at http://hive.apache.org/docs/r0.7.0/api/org/apache/hadoop/hive/ql/hooks/package-summary.html and then specify your

Logging MySQL queries

2011-05-23 Thread Steven Wong
My Hive metastore uses MySQL. I'd like to see Hive CLI log all SQL queries that are issued to MySQL. What config/property should I set to accomplish this? Thanks. Steven

hive storing a byte array

2011-05-23 Thread Luke Forehand
Hello, Can someone please provide an example in Hive, how I can store a serialized object in a field? A field type of byte array or binary or blob is really what I was looking for, but if something slightly less trivial is involved some instruction would be much appreciated. This object is used

Re: An issue with Hive on hadoop cluster

2011-05-23 Thread Ning Zhang
AFAIK, the fs.default.name should be set by both the client and server side .xml files, and they should be consistent (the URI scheme, the hostname and port number). The server side config (also called fs.default.name) should be read by the namenod

Re: Hive question, summing second-level domain names

2011-05-23 Thread Loren Siebert
Use regexp_extract() and group on that. http://wiki.apache.org/hadoop/Hive/LanguageManual/UDF#String_Functions For example, to get class C IP block from an IP address in a column called ‘host': regexp_extract(host, '(.*)\\.\\d{1,3}', 1) classc On May 23, 2011, at 1:03 PM, Adam

Hive question, summing second-level domain names

2011-05-23 Thread Adam Phelps
(As an FYI I'm relatively new to Hive and have no previous SQL experience, so have been struggling a bit with the Language manual which seems to assume previous SQL experience) Suppose I have a table, within which there is a column which contains domain names (ie such as hadoop.apache.org). I

Re: Hive connecting to squirrel on windows

2011-05-23 Thread Patrick Hunt
The issue seems to be a classloader problem. You can workaround the problem by changing squirrel's classpath to include the hadoop/hive jars (rather than adding it under the driver). I believe can do this on windows by editing the squirrel batch file that you use to start the client. Patrick On S

Re: An issue with Hive on hadoop cluster

2011-05-23 Thread MIS
I have already tried your suggestion. I have mentioned the same in my mail. I have also given the required permissions for the directory (hive.metastore.warehouse.dir). If you look closely at the stack trace , the port number that I have specified in the config files for the namenode and jobtracke

Re: An issue with Hive on hadoop cluster

2011-05-23 Thread jinhang du
Set the follow property in hive.site.xml. fs.default.name = hdfs: mapred.job.tracker = hive.metastore.warehouse.dir = Make sure you have the authority to write into this directory (hive.metastore.warehouse.dir). Try it. 2011/5/23 MIS > I'm getting into an issue when trying to run hive over t

An issue with Hive on hadoop cluster

2011-05-23 Thread MIS
I'm getting into an issue when trying to run hive over the hadoop cluster. The hadoop cluster is working fine, in a stand alone manner. I'm using hadoop 0.20.2 and hive 0.7.0 versions. The problem is that the hive is not considering the fs.default.name property that I am setting in the core-site.