Ok I sorted this one out.

In file $SPARK_HOME/bin/conf/spark-defaults.conf

Set the parameter *spark.driver.extraClassPath* to the additional jar files
that you need --> ojdbc8.jar","oraclepki.jar","osdt_cert.jar","osdt_core.jar
.
spark.driver.extraClassPath
/home/hduser/jars/jconn4.jar:/home/hduser/jars/ojdbc8.jar:/home/hduser/jars/oraclepki.jar:/home/hduser/jars/osdt_cert.jar:/home/hduser/jars/osdt_core.jar


*I had this referring to the old ojdb6,jar previously so cause of errors!*

To diagnose the problem in Spark session find out what the JAVA CLASSPATH
set. Just open a session in Spark shell and do

scala> *System.getProperty("java.class.path")*
res0: String = /home/hduser/jars/jconn4.jar:
*/home/hduser/jars/ojdbc8.jar:/home/hduser/jars/oraclepki.jar:/home/hduser/jars/osdt_cert.jar:/home/hduser/jars/osdt_core.jar*

Ok they are there so the connection should work. If they are not there,
then JDBC connection to ADW is not going to work

Let us test this

import java.sql.DriverManager
import java.sql.Connection
import java.sql.DatabaseMetaData
import java.sql.ResultSet
import java.sql.SQLException
import java.util.ArrayList
import org.apache.spark.sql.functions._
import java.sql.{Date, Timestamp}

val driverName = "oracle.jdbc.OracleDriver"
//var url= "jdbc:oracle:thin:@rhes564:1521:mydb12"  // Old example works up
to 12c
// Define URL for ADW. *DBAccess directory is the location of UNZIPPED
Wallet_<DB>.zip that you downloaded from ADW connection page*
var url = "jdbc:oracle:thin:@mydb_high
?TNS_ADMIN=/home/hduser/dba/bin/ADW/DBAccess"
var _username = "scratchpad"
var _password = "xxxxxxxxx"
var _dbschema = "SCRATCHPAD"
var _dbtable = "LL_18201960"
var e:SQLException = null
var connection:Connection = null
var metadata:DatabaseMetaData = null

// Define prop
val prop = new java.util.Properties
prop.setProperty("user", _username)
prop.setProperty("password",_password)
//
// Check Oracle is accessible
try {
      connection = DriverManager.getConnection(url, _username, _password)
} catch {
  case e: SQLException => e.printStackTrace
  connection.close()
}
metadata = connection.getMetaData()

And this is the output


scala> try {

     |       connection = DriverManager.getConnection(url, _username,
_password)

     | } catch {

     |   case e: SQLException => e.printStackTrace

     |   connection.close()

     | }

AArray = [B@61cb973d

AArray = [B@3b1261ed

AArray = [B@23fd9be1


scala> metadata = connection.getMetaData()

metadata: java.sql.DatabaseMetaData =
oracle.jdbc.driver.OracleDatabaseMetaData@545ac5d5

You can of course add these to HDFS for YARN mode as below in my article "The
Operational Advantages of Spark as a Distributed Processing Framework
<https://www.linkedin.com/pulse/operational-advantages-spark-distributed-processing-mich/>
"

Putting Spark Jar files on HDFS

In Yarn mode, *it is important that Spark jar files are available
throughout the Spark cluster*. I have spent a fair bit of time on this and
I recommend that you follow this procedure to make sure that the
spark-submit job runs ok. Use the spark.yarn.archive configuration option
and set that to the location of an archive (you create on HDFS) containing
all the JARs in the $SPARK_HOME/jars/ folder, at the root level of the
archive. For example:

1) Create the archive:
   jar cv0f spark-libs.jar -C $SPARK_HOME/jars/ .2) Create a directory
on HDFS for the jars accessible to the application
   hdfs dfs -mkdir /jars3) Upload to HDFS:
   hdfs dfs -put spark-libs.jar /jars4) For a large cluster, increase
the replication count of the Spark archive
   so that you reduce the amount of times a NodeManager will do a remote copy
   hdfs dfs -setrep -w 10 hdfs:///jars/spark-libs.jar (Change the amount of
   replicas proportional to the number of total NodeManagers)3) In
$SPARK_HOME/conf/spark-defaults.conf file set
  spark.yarn.archive to hdfs:///rhes75:9000/jars/spark-libs.jar. Similar to
  below
   spark.yarn.archive=hdfs://rhes75:9000/jars/spark-libs.jar

HTH

Mich


LinkedIn * 
https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*





*Disclaimer:* Use it at your own risk. Any and all responsibility for any
loss, damage or destruction of data or any other property which may arise
from relying on this email's technical content is explicitly disclaimed.
The author will in no case be liable for any monetary damages arising from
such loss, damage or destruction.




On Thu, 27 Aug 2020 at 17:34, <kuassi.men...@oracle.com> wrote:

> Mich,
>
> That's right, referring to you guys.
>
> Cheers, Kuassi
> On 8/27/20 9:27 AM, Mich Talebzadeh wrote:
>
> Thanks Kuassi,
>
> I presume you mean Spark DEV team by "they are using ... "
>
> cheers,
>
> Mich
>
>
>
> LinkedIn * 
> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
> <https://urldefense.com/v3/__https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw__;!!GqivPVa7Brio!M88sYUJxwSuzuyOSFENThsP9nncvOVkQvjolE69LmD9sUxRhpRDNkuuKPwHsPVs4NQ$>*
>
>
>
>
>
> *Disclaimer:* Use it at your own risk. Any and all responsibility for any
> loss, damage or destruction of data or any other property which may arise
> from relying on this email's technical content is explicitly disclaimed.
> The author will in no case be liable for any monetary damages arising from
> such loss, damage or destruction.
>
>
>
>
> On Thu, 27 Aug 2020 at 17:11, <kuassi.men...@oracle.com> wrote:
>
>> According to our dev team.
>>
>> From the error it is evident that they are using a jdbc jar which does
>> not support setting tns_admin in URL.
>> They might have some old jar in class-path which is being used instead of
>> 18.3 jar.
>> You can ask them to use either full URL or tns alias format URL with
>> tns_admin path set as either connection property or system property.
>>
>> Regards, Kuassi
>> On 8/26/20 2:11 PM, Mich Talebzadeh wrote:
>>
>> And this is a test using Oracle supplied JAVA
>> script DataSourceSample.java with slight amendment for login/password and
>> table. it connects ok
>>
>> hduser@rhes76: /home/hduser/dba/bin/ADW/src> javac -classpath
>> ./ojdbc8.jar:. DataSourceSample.java
>> hduser@rhes76: /home/hduser/dba/bin/ADW/src> java -classpath
>> ./ojdbc8.jar:. DataSourceSample
>> AArray = [B@57d5872c
>> AArray = [B@667a738
>> AArray = [B@2145433b
>> Driver Name: Oracle JDBC driver
>> Driver Version: 18.3.0.0.0
>> Default Row Prefetch Value is: 20
>> Database Username is: SCRATCHPAD
>>
>> DATETAKEN  WEIGHT
>> ---------------------
>> 2017-09-07 07:22:09 74.7
>> 2017-09-08 07:26:18 74.8
>> 2017-09-09 07:15:53 75
>> 2017-09-10 07:53:30 75.9
>> 2017-09-11 07:21:49 75.8
>> 2017-09-12 07:31:27 75.6
>> 2017-09-26 07:11:26 75.4
>> 2017-09-27 07:22:48 75.6
>> 2017-09-28 07:15:52 75.4
>> 2017-09-29 07:30:40 74.9
>>
>>
>>
>> Regards,
>>
>>
>> LinkedIn * 
>> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
>> <https://urldefense.com/v3/__https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw__;!!GqivPVa7Brio!K1lIv4Tn9yeWXGcfb2Zru8i7NZguGuAy1VxoSqORVtoQ_AJbkZohU0cXYquoFAJWTA$>*
>>
>>
>>
>>
>>
>> *Disclaimer:* Use it at your own risk. Any and all responsibility for
>> any loss, damage or destruction of data or any other property which may
>> arise from relying on this email's technical content is explicitly
>> disclaimed. The author will in no case be liable for any monetary damages
>> arising from such loss, damage or destruction.
>>
>>
>>
>>
>> On Wed, 26 Aug 2020 at 21:58, Mich Talebzadeh <mich.talebza...@gmail.com>
>> wrote:
>>
>>> Hi Kuassi,
>>>
>>> This is the error. Only test running on local mode
>>>
>>> scala> val driverName = "oracle.jdbc.OracleDriver"
>>> driverName: String = oracle.jdbc.OracleDriver
>>>
>>> scala> var url = "jdbc:oracle:thin:@mydb_high
>>> ?TNS_ADMIN=/home/hduser/dba/bin/ADW/DBAccess"
>>> url: String = jdbc:oracle:thin:@mydb_high
>>> ?TNS_ADMIN=/home/hduser/dba/bin/ADW/DBAccess
>>> scala> var _username = "scratchpad"
>>> _username: String = scratchpad
>>> scala> var _password = "xxxxxxxxxx"  -- no special characters
>>> _password: String = xxxxxxxxxxx
>>> scala> var _dbschema = "SCRATCHPAD"
>>> _dbschema: String = SCRATCHPAD
>>> scala> var _dbtable = "LL_18201960"
>>> _dbtable: String = LL_18201960
>>> scala> var e:SQLException = null
>>> e: java.sql.SQLException = null
>>> scala> var connection:Connection = null
>>> connection: java.sql.Connection = null
>>> scala> var metadata:DatabaseMetaData = null
>>> metadata: java.sql.DatabaseMetaData = null
>>> scala> val prop = new java.util.Properties
>>> prop: java.util.Properties = {}
>>> scala> prop.setProperty("user", _username)
>>> res1: Object = null
>>> scala> prop.setProperty("password",_password)
>>> res2: Object = null
>>> scala> // Check Oracle is accessible
>>>
>>> *scala> try { *
>>> *     |       connection = DriverManager.getConnection(url, _username,
>>> _password)*
>>> *     | } catch {*
>>> *     |   case e: SQLException => e.printStackTrace*
>>> *     |   connection.close()*
>>> *     | }*
>>> *java.sql.SQLRecoverableException: IO Error: Invalid connection string
>>> format, a valid format is: "host:port:sid"*
>>>         at oracle.jdbc.driver.T4CConnection.logon(T4CConnection.java:489)
>>>         at
>>> oracle.jdbc.driver.PhysicalConnection.<init>(PhysicalConnection.java:553)
>>>         at
>>> oracle.jdbc.driver.T4CConnection.<init>(T4CConnection.java:254)
>>>         at
>>> oracle.jdbc.driver.T4CDriverExtension.getConnection(T4CDriverExtension.java:32)
>>>         at oracle.jdbc.driver.OracleDriver.connect(OracleDriver.java:528)
>>>         at java.sql.DriverManager.getConnection(DriverManager.java:664)
>>>
>>> Is this related to Oracle or Spark? Do I need to set up another
>>> connection parameter etc?
>>>
>>>
>>>
>>> Cheers
>>>
>>>
>>> *Disclaimer:* Use it at your own risk. Any and all responsibility for
>>> any loss, damage or destruction of data or any other property which may
>>> arise from relying on this email's technical content is explicitly
>>> disclaimed. The author will in no case be liable for any monetary damages
>>> arising from such loss, damage or destruction.
>>>
>>>
>>>
>>>
>>> On Wed, 26 Aug 2020 at 21:09, <kuassi.men...@oracle.com> wrote:
>>>
>>>> Mich,
>>>>
>>>> All looks fine.
>>>> Perhaps some special chars in username or password?
>>>>
>>>> it is recommended not to use such characters like '@', '.' in your
>>>> password.
>>>>
>>>> Best, Kuassi
>>>> On 8/26/20 12:52 PM, Mich Talebzadeh wrote:
>>>>
>>>> Thanks Kuassi.
>>>>
>>>> This is the version of jar file that work OK with JDBC connection via
>>>> JAVA to ADW
>>>>
>>>> unzip -p ojdbc8.jar META-INF/MANIFEST.MF
>>>> Manifest-Version: 1.0
>>>> Implementation-Title: JDBC
>>>> *Implementation-Version: 18.3.0.0.0*
>>>> sealed: true
>>>> Specification-Vendor: Sun Microsystems Inc.
>>>> Specification-Title: JDBC
>>>> Class-Path: oraclepki.jar
>>>> Implementation-Vendor: Oracle Corporation
>>>> Main-Class: oracle.jdbc.OracleDriver
>>>> Ant-Version: Apache Ant 1.7.1
>>>> Repository-Id: JAVAVM_18.1.0.0.0_LINUX.X64_180620
>>>> Created-By: 25.171-b11 (Oracle Corporation)
>>>> Specification-Version: 4.0
>>>>
>>>> And this the setting for TNS_ADMIN
>>>>
>>>> e*cho ${TNS_ADMIN}*
>>>> */home/hduser/dba/bin/ADW/DBAccess*
>>>>
>>>> hduser@rhes76: /home/hduser/dba/bin/ADW/DBAccess> *cat
>>>> ojdbc.properties*
>>>> *# Connection property while using Oracle wallets.*
>>>>
>>>> *oracle.net.wallet_location=(SOURCE=(METHOD=FILE)(METHOD_DATA=(DIRECTORY=${TNS_ADMIN})))*
>>>> *# FOLLOW THESE STEPS FOR USING JKS*
>>>> *# (1) Uncomment the following properties to use JKS.*
>>>> *# (2) Comment out the oracle.net.wallet_location property above*
>>>> *# (3) Set the correct password for both trustStorePassword and
>>>> keyStorePassword.*
>>>> *# It's the password you specified when downloading the wallet from OCI
>>>> Console or the Service Console.*
>>>> *#javax.net.ssl.trustStore=${TNS_ADMIN}/truststore.jks*
>>>> *#javax.net.ssl.trustStorePassword=<password_from_console>*
>>>> *#javax.net.ssl.keyStore=${TNS_ADMIN}/keystore.jks*
>>>> *#javax.net.ssl.keyStorePassword=<password_from_console>hduser@rhes76:
>>>> /home/hduser/dba/bin/ADW/DBAccess>*
>>>>
>>>> Regards,
>>>>
>>>> Mich
>>>>
>>>> LinkedIn * 
>>>> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
>>>> <https://urldefense.com/v3/__https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw__;!!GqivPVa7Brio!LxAFleT1w3dN53Njh2o9xm_GtQd-d0NTouqw1mBYLroe4Byzc1nvSN0rb-cnpRttfw$>*
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> *Disclaimer:* Use it at your own risk. Any and all responsibility for
>>>> any loss, damage or destruction of data or any other property which may
>>>> arise from relying on this email's technical content is explicitly
>>>> disclaimed. The author will in no case be liable for any monetary damages
>>>> arising from such loss, damage or destruction.
>>>>
>>>>
>>>>
>>>>
>>>> On Wed, 26 Aug 2020 at 20:16, <kuassi.men...@oracle.com> wrote:
>>>>
>>>>> Hi,
>>>>>
>>>>> From which release is the ojdbc8.jar from? 12c, 18c or 19c? I'd
>>>>> recommend ojdbc8.jar from the latest release.
>>>>> One more thing to pay attention to is the content of the
>>>>> ojdbc.properties file (part of the unzipped wallet)
>>>>> Make sure that ojdbc.properties file has been configured to use Oracle
>>>>> Wallet, as follows (i.e., anything related to JKS commented out)
>>>>>
>>>>>
>>>>> *oracle.net.wallet_location=(SOURCE=(METHOD=FILE)(METHOD_DATA=(DIRECTORY=${TNS_ADMIN})))*
>>>>> *#javax.net.ssl.trustStore=${TNS_ADMIN}/truststore.jks*
>>>>> *#javax.net.ssl.trustStorePassword=<password_from_console>*
>>>>> *#javax.net.ssl.keyStore=${TNS_ADMIN}/keystore.jks*
>>>>> *#javax.net.ssl.keyStorePassword=<password_from_console>*
>>>>>
>>>>> Alternatively, if you want to use JKS< then you need to comment out
>>>>> the firts line and un-comment the other lines and set the values.
>>>>>
>>>>> Kuassi
>>>>> On 8/26/20 11:58 AM, Mich Talebzadeh wrote:
>>>>>
>>>>> Hi,
>>>>>
>>>>> The connection from Spark to Oracle 12c etc are well established using
>>>>> ojdb6.jar.
>>>>>
>>>>> I am attempting to connect to Oracle Autonomous Data warehouse (ADW)
>>>>> version
>>>>>
>>>>> *Oracle Database 19c Enterprise Edition Release 19.0.0.0.0*
>>>>>
>>>>> Oracle document suggest using ojdbc8.jar
>>>>> <https://urldefense.com/v3/__http://ojdbc8.jar__;!!GqivPVa7Brio!Msuw5mr2YjeHSLbBSlNvs8rqL7T_-eWFfdsamiYduARIsECZqEzUTG8hd-v1x8KwcQ$>
>>>>>  to
>>>>> connect to the database with the following URL format using Oracle Wallet
>>>>>
>>>>> "jdbc:oracle:thin:@mydb_high
>>>>> ?TNS_ADMIN=/home/hduser/dba/bin/ADW/DBAccess"
>>>>>
>>>>> This works fine through JAVA itself but throws an error with
>>>>> Spark version 2.4.3.
>>>>>
>>>>> The connection string is defined as follows
>>>>>
>>>>> val url = "jdbc:oracle:thin:@mydb_high
>>>>> ?TNS_ADMIN=/home/hduser/dba/bin/ADW/DBAccess"
>>>>>
>>>>> where DBAcess directory is the unzipped wallet for Wallet_mydb.zip as
>>>>> created by ADW connection.
>>>>>
>>>>> The thing is that this works through normal connection via java
>>>>> code.using the same URL
>>>>>
>>>>> So the question is whether there is a dependency in Spark JDBC
>>>>> connection to the ojdbc.
>>>>>
>>>>> The error I am getting is:
>>>>>
>>>>> java.sql.SQLRecoverableException: IO Error: Invalid connection string
>>>>> format, a valid format is: "host:port:sid"
>>>>>         at
>>>>> oracle.jdbc.driver.T4CConnection.logon(T4CConnection.java:489)
>>>>>         at
>>>>> oracle.jdbc.driver.PhysicalConnection.<init>(PhysicalConnection.java:553)
>>>>>         at
>>>>> oracle.jdbc.driver.T4CConnection.<init>(T4CConnection.java:254)
>>>>>         at
>>>>> oracle.jdbc.driver.T4CDriverExtension.getConnection(T4CDriverExtension.java:32)
>>>>>         at
>>>>> oracle.jdbc.driver.OracleDriver.connect(OracleDriver.java:528)
>>>>>         at java.sql.DriverManager.getConnection(DriverManager.java:664)
>>>>>
>>>>> This Oracle doc
>>>>> <https://docs.oracle.com/en/cloud/paas/autonomous-data-warehouse-cloud/user/connect-jdbc-thin-wallet.html#GUID-5ED3C08C-1A84-4E5A-B07A-A5114951AA9E>
>>>>> explains the connectivity.
>>>>>
>>>>> The unzipped wallet has the followiing files
>>>>>
>>>>>  ls DBAccess/
>>>>> README  cwallet.sso  ewallet.p12  keystore.jks  ojdbc.properties
>>>>> sqlnet.ora  tnsnames.ora  truststore.jks
>>>>>
>>>>>
>>>>> Thanks
>>>>>
>>>>> Mich
>>>>>
>>>>>
>>>>>
>>>>> LinkedIn * 
>>>>> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
>>>>> <https://urldefense.com/v3/__https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw__;!!GqivPVa7Brio!Msuw5mr2YjeHSLbBSlNvs8rqL7T_-eWFfdsamiYduARIsECZqEzUTG8hd-teislmnw$>*
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> *Disclaimer:* Use it at your own risk. Any and all responsibility for
>>>>> any loss, damage or destruction of data or any other property which may
>>>>> arise from relying on this email's technical content is explicitly
>>>>> disclaimed. The author will in no case be liable for any monetary damages
>>>>> arising from such loss, damage or destruction.
>>>>>
>>>>>
>>>>>
>>>>>

Reply via email to