Hello,

I am trying to connect to Oracle database to import data into HDFS, but after 
submitting the command, it doesn't start processing it till a very long time. 
In the example below, it takes about 3 hours to do so. In parallel, we have 
about 16 connections trying to connect at the same time. The cluster is a 
18-node cluster. Is there some way to check why the process is hanging for that 
long ? Just notice the timestamps.

2016-12-28 13:05:54 :  sqoop import -D 
mapred.child.java.opts=-Djava.security.egd=file:/dev/../dev/urandom 
--connection-param-file oraconnector.prop --connect jdbc:oracle:thin:@//**** 
--username *** -P --query "select * from <table> where \$CONDITIONS and id=*** 
and date='***' " --target-dir <HDFS_PATH> --null-string '' --null-non-string '' 
--as-avrodatafile -m 1 -verbose
Warning: 
/opt/cloudera/parcels/CDH-5.8.3-1.cdh5.8.3.p0.2/bin/../lib/sqoop/../accumulo 
does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
16/12/28 16:19:04 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6-cdh5.8.3
16/12/28 16:19:04 DEBUG tool.BaseSqoopTool: Enabled debug logging.
16/12/28 16:19:04 DEBUG tool.BaseSqoopTool: Loaded connection parameters: 
{oracle.jdbc.mapDateToTimestamp=false}

With Regards,
Sweeta

Reply via email to