Hi Dileep, 

Have tried to se the following values in hive and run the query again.  More 
info why the query may fail in the following link : 

https://cwiki.apache.org/Hive/statsdev.html


set hive.stats.autogather=false;
As well as ; 

set hive.stats.dbclass=jdbc:derby;
set 
hive.stats.dbconnectionstring="jdbc:derby:;databaseName=TempStatsStore;create=true";
set hive.stats.jdbcdriver="org.apache.derby.jdbc.EmbeddedDriver";

Hope this helps. 


 

 Abdelrahman Shettia
ashet...@hortonworks.com


On Mar 8, 2013, at 2:31 PM, Dileep Kumar <dileepkumar...@gmail.com> wrote:

> Thanks for your attention !
> No only one hive process is running and thing that bother me is smaller query 
> runs till completion which I invoke the same way. It is using embeded db if 
> that is the problem I can change it to external DB but as my smaller query 
> runs fine I thought this should be OK.
> 
> 
> On Fri, Mar 8, 2013 at 2:16 PM, Dean Wampler 
> <dean.wamp...@thinkbiganalytics.com> wrote:
> Do you have more than one hive process running? It looks like you're using 
> Derby, which only supports one process at a time. Also, you have to start 
> Hive from the same directory every time, where the metastore "database" is 
> written, unless you edit the JDBC connection property in the Hive config file 
> to point to a particular path. Here's what I use:
> 
> <property>
>   <name>javax.jdo.option.ConnectionURL</name>
>   
> <value>jdbc:derby:;databaseName=/Users/somedufus/hive/metastore_db;create=true</value>
>   <description>JDBC connect string for a JDBC metastore</description>
> </property>
> 
> 
> On Fri, Mar 8, 2013 at 4:09 PM, Dileep Kumar <dileepkumar...@gmail.com> wrote:
> Hi All,
> 
> I am running a hive query which does insert into a table.
> What I noticed from the symptom it looks like it got to do with some settings 
> but  I am not able to figure out what settings.
> 
> When I submit the query it starts 2130 map tasks in the job and 150 of them 
> completes fine without any error and then next batch of 75 gets killed and 
> all of them after that gets killed.
> While I submit a similar query based on smaller table its starts around only 
> 135 map tasks and it runs till completion without any error and does the 
> insert into appropriate table.
> 
> I don't find any obvious error messages in any of the tasks log apart form 
> this:
> 
> 
> ./hadoop-0.20-mapreduce/userlogs/job_201303080834_0001/attempt_201303080834_0001_m_001636_0/syslog:2013-03-08
>  08:54:06,910 INFO orapache.hadoop.hive.ql.exec.MapOperator: 
> DESERIALIZE_ERRORS:0
> ./hadoop-0.20-mapreduce/userlogs/job_201303080834_0001/attempt_201303080834_0001_m_001646_0/syslog:2013-03-08
>  08:41:06,060 INFO orapache.hadoop.hive.ql.exec.MapOperator: 
> DESERIALIZE_ERRORS:0
> ./hadoop-0.20-mapreduce/userlogs/job_201303080834_0001/attempt_201303080834_0001_m_001646_0/syslog:2013-03-08
>  08:46:54,390 ERROR o.apache.hadoop.hive.ql.stats.jdbc.JDBCStatsPublisher: 
> Error during instantiating JDBC driver org.apache.derby.jdbc.EmbeddedDriver.
> ./hadoop-0.20-mapreduce/userlogs/job_201303080834_0001/attempt_201303080834_0001_m_001646_0/syslog:2013-03-08
>  08:46:54,394 ERROR o.apache.hadoop.hive.ql.exec.FileSinkOperator: 
> StatsPublishing error: cannot connect to database
> 
> Please suggest if I need to set anything in Hive when I invoke this query. 
> The query that runs successfully has lot less rows compared to on that fails.
> 
> Thanks,
> DK
> 
> 
> 
> -- 
> Dean Wampler, Ph.D.
> thinkbiganalytics.com
> +1-312-339-1330
> 
> 

Reply via email to