[ https://issues.apache.org/jira/browse/PIO-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249353#comment-16249353 ]
Takako Shimamoto commented on PIO-137: -------------------------------------- The connection pool is initialized at a Spark driver, and then JDBCPEvents tries to use it in a Spark worker to delete records in the RDD. The correct solution is to initialize the connection pool (or create a connection object) at a worker. I’ll handle this issue. > Connection pool is not yet initialized.(name:'default) > ------------------------------------------------------ > > Key: PIO-137 > URL: https://issues.apache.org/jira/browse/PIO-137 > Project: PredictionIO > Issue Type: Bug > Components: Core > Affects Versions: 0.11.0-incubating > Reporter: Mukesh Gupta > > While running http://github.com/actionml/db-cleaner template, I am facing the > following issue: > {code:java} > [WARN] [TaskSetManager] Lost task 0.1 in stage 23.0 (TID 75, > ip-172-31-105-42.ap-southeast-1.compute.internal): > java.lang.IllegalStateException: Connection pool is not yet > initialized.(name:'default) > at > scalikejdbc.ConnectionPool$$anonfun$get$1.apply(ConnectionPool.scala:76) > at > scalikejdbc.ConnectionPool$$anonfun$get$1.apply(ConnectionPool.scala:74) > at scala.collection.MapLike$class.getOrElse(MapLike.scala:128) > at scala.collection.AbstractMap.getOrElse(Map.scala:58) > at scalikejdbc.ConnectionPool$.get(ConnectionPool.scala:74) > at scalikejdbc.ConnectionPool$.apply(ConnectionPool.scala:65) > at scalikejdbc.DB$.connectionPool(DB.scala:152) > at scalikejdbc.DB$.localTx(DB.scala:262) > at > org.apache.predictionio.data.storage.jdbc.JDBCPEvents$$anonfun$delete$1$$anonfun$apply$11.apply(JDBCPEvents.scala:182) > at > org.apache.predictionio.data.storage.jdbc.JDBCPEvents$$anonfun$delete$1$$anonfun$apply$11.apply(JDBCPEvents.scala:181) > at scala.collection.Iterator$class.foreach(Iterator.scala:727) > at scala.collection.AbstractIterator.foreach(Iterator.scala:1157) > at > org.apache.predictionio.data.storage.jdbc.JDBCPEvents$$anonfun$delete$1.apply(JDBCPEvents.scala:181) > at > org.apache.predictionio.data.storage.jdbc.JDBCPEvents$$anonfun$delete$1.apply(JDBCPEvents.scala:179) > at > org.apache.spark.rdd.RDD$$anonfun$foreachPartition$1$$anonfun$apply$33.apply(RDD.scala:920) > at > org.apache.spark.rdd.RDD$$anonfun$foreachPartition$1$$anonfun$apply$33.apply(RDD.scala:920) > at > org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:1858) > at > org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:1858) > at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:66) > at org.apache.spark.scheduler.Task.run(Task.scala:89) > at > org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:227) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:748) > {code} > Following are my PIO settings > {code:java} > PIO_STORAGE_REPOSITORIES_METADATA_NAME=pio_meta > PIO_STORAGE_SOURCES_ELASTICSEARCH_HOME=/elasticsearch > PIO_STORAGE_REPOSITORIES_EVENTDATA_NAME=pio_event > PIO_STORAGE_SOURCES_PGSQL_INDEX=enabled > PIO_STORAGE_REPOSITORIES_EVENTDATA_SOURCE=PGSQL > PIO_STORAGE_SOURCES_ELASTICSEARCH_TYPE=elasticsearch > PIO_STORAGE_SOURCES_ELASTICSEARCH_PORTS=9300 > PIO_STORAGE_REPOSITORIES_METADATA_SOURCE=ELASTICSEARCH > PIO_STORAGE_REPOSITORIES_MODELDATA_NAME=pio_model > PIO_HOME=/PredictionIO > PIO_STORAGE_SOURCES_PGSQL_USERNAME=dbuser > PIO_STORAGE_REPOSITORIES_MODELDATA_SOURCE=PGSQL > PIO_STORAGE_SOURCES_PGSQL_URL=jdbc:postgresql://hostname:5432/predictionio > PIO_STORAGE_SOURCES_ELASTICSEARCH_HOSTS=es-hostname > PIO_STORAGE_SOURCES_ELASTICSEARCH_CLUSTERNAME=es_cluster > PIO_STORAGE_SOURCES_PGSQL_TYPE=jdbc > PIO_STORAGE_SOURCES_PGSQL_PASSWORD=xxxxxxxxxxxxxx > {code} > I have tried upgrading the scalike version to 3.1.0 (as hinted by this: > https://github.com/scalikejdbc/scalikejdbc/commit/b7713a8dbfb72b05f43743a5c281b5d7f7bac824) > but the issue still remains. -- This message was sent by Atlassian JIRA (v6.4.14#64029)