This may help: batchSize : The batchsize used in jdbc connection
http://wiki.apache.org/solr/DataImportHandler#Configuring_DataSources From: Geek Gamer [via Lucene] [mailto:ml-node+809069-2054572211-124...@n3.nabble.com] Sent: Monday, May 10, 2010 9:42 PM To: caman Subject: DIH full-import memory issue Hi, I am facing issues with DIH fullimport, I have a database with 3 million records that will translate into index size of 6GB. When I am trying to do full import I am getting out of memory error like : INFO: Starting Full Import May 10, 2010 11:44:06 PM org.apache.solr.handler.dataimport.SolrWriter readIndexerProperties WARNING: Unable to read: dataimport.properties May 10, 2010 11:44:06 PM org.apache.solr.update.DirectUpdateHandler2 deleteAll INFO: [] REMOVING ALL DOCUMENTS FROM INDEX May 10, 2010 11:44:06 PM org.apache.solr.core.SolrDeletionPolicy onInit INFO: SolrDeletionPolicy.onInit: commits:num=1 commit{dir=/home/search/SOLR/solr/data/index,segFN=segments_1,version=127354 9043650,generation=1,filenames=[segments_1] May 10, 2010 11:44:06 PM org.apache.solr.core.SolrDeletionPolicy updateCommits INFO: newest commit = 1273549043650 May 10, 2010 11:44:06 PM org.apache.solr.handler.dataimport.JdbcDataSource$1 call INFO: Creating a connection for entity offer with URL: jdbc:mysql://domU-12-31-39-10-59-01.compute-1.internal/jounce1 May 10, 2010 11:44:07 PM org.apache.solr.handler.dataimport.JdbcDataSource$1 call INFO: Time taken for getConnection(): 301 Exception in thread "Timer-1" java.lang.OutOfMemoryError: Java heap space at java.util.HashMap.newValueIterator(HashMap.java:843) at java.util.HashMap$Values.iterator(HashMap.java:910) at org.mortbay.jetty.servlet.HashSessionManager.scavenge(HashSessionManager.jav a:180) at org.mortbay.jetty.servlet.HashSessionManager.access$000(HashSessionManager.j ava:36) at org.mortbay.jetty.servlet.HashSessionManager$1.run(HashSessionManager.java:1 44) at java.util.TimerThread.mainLoop(Timer.java:512) at java.util.TimerThread.run(Timer.java:462) May 10, 2010 11:54:54 PM org.apache.solr.handler.dataimport.DataImporter doFullImport SEVERE: Full Import failed org.apache.solr.handler.dataimport.DataImportHandlerException: java.lang.OutOfMemoryError: Java heap space at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java: 424) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:242 ) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:180) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.ja va:331) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:389 ) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:370) Caused by: java.lang.OutOfMemoryError: Java heap space at com.mysql.jdbc.MysqlIO.nextRowFast(MysqlIO.java:1621) at com.mysql.jdbc.MysqlIO.nextRow(MysqlIO.java:1398) at com.mysql.jdbc.MysqlIO.readSingleRowSet(MysqlIO.java:2816) at com.mysql.jdbc.MysqlIO.getResultSet(MysqlIO.java:467) at com.mysql.jdbc.MysqlIO.readResultsForQueryOrUpdate(MysqlIO.java:2510) at com.mysql.jdbc.MysqlIO.readAllResults(MysqlIO.java:1746) at com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2135) at com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2536) at com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2465) at com.mysql.jdbc.StatementImpl.execute(StatementImpl.java:734) at org.apache.solr.handler.dataimport.JdbcDataSource$ResultSetIterator.<init>(J dbcDataSource.java:246) at org.apache.solr.handler.dataimport.JdbcDataSource.getData(JdbcDataSource.jav a:210) at org.apache.solr.handler.dataimport.JdbcDataSource.getData(JdbcDataSource.jav a:39) at org.apache.solr.handler.dataimport.SqlEntityProcessor.initQuery(SqlEntityPro cessor.java:58) at org.apache.solr.handler.dataimport.SqlEntityProcessor.nextRow(SqlEntityProce ssor.java:71) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProc essorWrapper.java:237) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java: 357) ... 5 more May 10, 2010 11:54:54 PM org.apache.solr.update.DirectUpdateHandler2 rollback INFO: start rollback May 10, 2010 11:54:54 PM org.apache.solr.update.DirectUpdateHandler2 rollback INFO: end_rollback I tried allocating 4 Gigs of memory to the VM but no luck. Are the records cached before indexing or streamed? any pointers to documents? thanks in anticipation, umar _____ View message @ http://lucene.472066.n3.nabble.com/DIH-full-import-memory-issue-tp809069p809 069.html To start a new topic under Solr - User, email ml-node+472068-464289649-124...@n3.nabble.com To unsubscribe from Solr - User, click < (link removed) GZvcnRoZW90aGVyc3R1ZmZAZ21haWwuY29tfDQ3MjA2OHwtOTM0OTI1NzEx> here. -- View this message in context: http://lucene.472066.n3.nabble.com/DIH-full-import-memory-issue-tp809069p809076.html Sent from the Solr - User mailing list archive at Nabble.com.