Hello David, I know about DIH thread problems. Some time ago I did quick fix patch for 3.4, which passes tests. If you have some time pls try it. http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201110.mbox/%3CCANGii8cOrWXsSvP9EYcRFX_mQBoVdatzRW%2BF0Cq2c%3D6sx8czZw%40mail.gmail.com%3E I'm working on fixing it in trunk.
But I've never seen that ClassCastException, it can be an another one bug. Regards On Sat, Dec 10, 2011 at 10:35 PM, David Webb (Commented) (JIRA) < [email protected]> wrote: > > [ > https://issues.apache.org/jira/browse/SOLR-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13166926#comment-13166926] > > David Webb commented on SOLR-2961: > ---------------------------------- > > Weird note, when threads="2", processing continues even though the > stacktraces are output to the logs. When threads="6", when the error > occues, the DIH process immediately stops and performs a rollback. > > This is preventing me from using DIH to load and maintain my production > index. Any help is greatly appreciated since I am now at the 11th hour. :) > > Solr and all components have been stellar up to this point. Great project! > > > DIH with threads and TikaEntityProcessor JDBC ISsue > > --------------------------------------------------- > > > > Key: SOLR-2961 > > URL: https://issues.apache.org/jira/browse/SOLR-2961 > > Project: Solr > > Issue Type: Bug > > Components: contrib - DataImportHandler > > Affects Versions: 3.4, 3.5 > > Environment: Windows Server 2008, Apache Tomcat 6, Oracle 11g, > ojdbc 11.2.0.1 > > Reporter: David Webb > > Labels: dih, tika > > Attachments: data-config.xml > > > > > > I have a DIH Configuration that works great when I dont specify > threads="X" in the root entity. As soon as I give a value for threads, I > get the following error messages in the stacktrace. Please advise. > > SEVERE: JdbcDataSource was not closed prior to finalize(), indicates a > bug -- POSSIBLE RESOURCE LEAK!!! > > Dec 10, 2011 1:18:33 PM > org.apache.solr.handler.dataimport.JdbcDataSource closeConnection > > SEVERE: Ignoring Error when closing connection > > java.sql.SQLRecoverableException: IO Error: Socket closed > > at oracle.jdbc.driver.T4CConnection.logoff(T4CConnection.java:511) > > at > oracle.jdbc.driver.PhysicalConnection.close(PhysicalConnection.java:3931) > > at > org.apache.solr.handler.dataimport.JdbcDataSource.closeConnection(JdbcDataSource.java:401) > > at > org.apache.solr.handler.dataimport.JdbcDataSource.close(JdbcDataSource.java:392) > > at > org.apache.solr.handler.dataimport.JdbcDataSource.finalize(JdbcDataSource.java:380) > > at java.lang.ref.Finalizer.invokeFinalizeMethod(Native Method) > > at java.lang.ref.Finalizer.runFinalizer(Unknown Source) > > at java.lang.ref.Finalizer.access$100(Unknown Source) > > at java.lang.ref.Finalizer$FinalizerThread.run(Unknown Source) > > Caused by: java.net.SocketException: Socket closed > > at java.net.SocketOutputStream.socketWrite(Unknown Source) > > at java.net.SocketOutputStream.write(Unknown Source) > > at oracle.net.ns.DataPacket.send(DataPacket.java:199) > > at oracle.net.ns.NetOutputStream.flush(NetOutputStream.java:211) > > at > oracle.net.ns.NetInputStream.getNextPacket(NetInputStream.java:227) > > at oracle.net.ns.NetInputStream.read(NetInputStream.java:175) > > at oracle.net.ns.NetInputStream.read(NetInputStream.java:100) > > at oracle.net.ns.NetInputStream.read(NetInputStream.java:85) > > at > oracle.jdbc.driver.T4CSocketInputStreamWrapper.readNextPacket(T4CSocketInputStreamWrapper.java:123) > > at > oracle.jdbc.driver.T4CSocketInputStreamWrapper.read(T4CSocketInputStreamWrapper.java:79) > > at > oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1122) > > at > oracle.jdbc.driver.T4CMAREngine.unmarshalSB1(T4CMAREngine.java:1099) > > at oracle.jdbc.driver.T4CTTIfun.receive(T4CTTIfun.java:288) > > at oracle.jdbc.driver.T4CTTIfun.doRPC(T4CTTIfun.java:191) > > at > oracle.jdbc.driver.T4C7Ocommoncall.doOLOGOFF(T4C7Ocommoncall.java:61) > > at oracle.jdbc.driver.T4CConnection.logoff(T4CConnection.java:498) > > ... 8 more > > Dec 10, 2011 1:18:34 PM > org.apache.solr.handler.dataimport.ThreadedEntityProcessorWrapper nextRow > > SEVERE: Exception in entity : null > > org.apache.solr.handler.dataimport.DataImportHandlerException: Failed to > initialize DataSource: f2 > > at > org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72) > > at > org.apache.solr.handler.dataimport.DataImporter.getDataSourceInstance(DataImporter.java:333) > > at > org.apache.solr.handler.dataimport.ContextImpl.getDataSource(ContextImpl.java:99) > > at > org.apache.solr.handler.dataimport.ThreadedContext.getDataSource(ThreadedContext.java:66) > > at > org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:101) > > at > org.apache.solr.handler.dataimport.ThreadedEntityProcessorWrapper.nextRow(ThreadedEntityProcessorWrapper.java:84) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.runAThread(DocBuilder.java:446) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.run(DocBuilder.java:399) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.runAThread(DocBuilder.java:466) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.run(DocBuilder.java:399) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.runAThread(DocBuilder.java:466) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.access$000(DocBuilder.java:353) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner$1.run(DocBuilder.java:406) > > at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown > Source) > > at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown > Source) > > at java.lang.Thread.run(Unknown Source) > > Caused by: java.lang.ClassCastException: > org.apache.solr.handler.dataimport.TikaEntityProcessor cannot be cast to > org.apache.solr.handler.dataimport.EntityProcessorWrapper > > at > org.apache.solr.handler.dataimport.FieldStreamDataSource.init(FieldStreamDataSource.java:58) > > at > org.apache.solr.handler.dataimport.DataImporter.getDataSourceInstance(DataImporter.java:331) > > ... 14 more > > Dec 10, 2011 1:18:34 PM > org.apache.solr.handler.dataimport.ThreadedEntityProcessorWrapper nextRow > > SEVERE: Exception in entity : null > > org.apache.solr.handler.dataimport.DataImportHandlerException: Failed to > initialize DataSource: f2 > > at > org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72) > > at > org.apache.solr.handler.dataimport.DataImporter.getDataSourceInstance(DataImporter.java:333) > > at > org.apache.solr.handler.dataimport.ContextImpl.getDataSource(ContextImpl.java:99) > > at > org.apache.solr.handler.dataimport.ThreadedContext.getDataSource(ThreadedContext.java:66) > > at > org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:101) > > at > org.apache.solr.handler.dataimport.ThreadedEntityProcessorWrapper.nextRow(ThreadedEntityProcessorWrapper.java:84) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.runAThread(DocBuilder.java:446) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.run(DocBuilder.java:399) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.runAThread(DocBuilder.java:466) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.run(DocBuilder.java:399) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.runAThread(DocBuilder.java:466) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner.access$000(DocBuilder.java:353) > > at > org.apache.solr.handler.dataimport.DocBuilder$EntityRunner$1.run(DocBuilder.java:406) > > at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown > Source) > > at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown > Source) > > at java.lang.Thread.run(Unknown Source) > > Caused by: java.lang.ClassCastException: > org.apache.solr.handler.dataimport.TikaEntityProcessor cannot be cast to > org.apache.solr.handler.dataimport.EntityProcessorWrapper > > at > org.apache.solr.handler.dataimport.FieldStreamDataSource.init(FieldStreamDataSource.java:58) > > at > org.apache.solr.handler.dataimport.DataImporter.getDataSourceInstance(DataImporter.java:331) > > ... 14 more > > Dec 10, 2011 1:18:34 PM > org.apache.solr.handler.dataimport.JdbcDataSource finalize > > SEVERE: JdbcDataSource was not closed prior to finalize(), indicates a > bug -- POSSIBLE RESOURCE LEAK!!! > > Dec 10, 2011 1:18:34 PM > org.apache.solr.handler.dataimport.ThreadedEntityProcessorWrapper nextRow > > SEVERE: Exception in entity : null > > -- > This message is automatically generated by JIRA. > If you think it was sent incorrectly, please contact your JIRA > administrators: > https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa > For more information on JIRA, see: http://www.atlassian.com/software/jira > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > > -- Sincerely yours Mikhail Khludnev Developer Grid Dynamics tel. 1-415-738-8644 Skype: mkhludnev <http://www.griddynamics.com> <[email protected]>
