Hello,
I am trying to connect to Oracle database to import data into HDFS, but after
submitting the command, it doesn't start processing it till a very long time.
In the example below, it takes about 3 hours to do so. In parallel, we have
about 16 connections trying to connect at the same time. The cluster is a
18-node cluster. Is there some way to check why the process is hanging for that
long ? Just notice the timestamps.
2016-12-28 13:05:54 : sqoop import -D
mapred.child.java.opts=-Djava.security.egd=file:/dev/../dev/urandom
--connection-param-file oraconnector.prop --connect jdbc:oracle:thin:@//****
--username *** -P --query "select * from <table> where \$CONDITIONS and id=***
and date='***' " --target-dir <HDFS_PATH> --null-string '' --null-non-string ''
--as-avrodatafile -m 1 -verbose
Warning:
/opt/cloudera/parcels/CDH-5.8.3-1.cdh5.8.3.p0.2/bin/../lib/sqoop/../accumulo
does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
16/12/28 16:19:04 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6-cdh5.8.3
16/12/28 16:19:04 DEBUG tool.BaseSqoopTool: Enabled debug logging.
16/12/28 16:19:04 DEBUG tool.BaseSqoopTool: Loaded connection parameters:
{oracle.jdbc.mapDateToTimestamp=false}
With Regards,
Sweeta