DIH still running out of memory for me, with Full Import on a database of size 1.5 GB.
Solr version: 3_5_0 Note that I have already added batchSize="-1" but getting same error. Sharing my DIH config below. ============================================ <dataConfig> <dataSource type="JdbcDataSource" name="jdbc" driver="com.mysql.jdbc.Driver" url="jdbc:mysql://localhost:3306/ib" user="root" password="root" batchSize="-1" /> <document name="content"> <entity name="issue" dataSource="jdbc" transformer="RegexTransformer,DateFormatTransformer, TemplateTransformer" pk="id" query=" select ib_issue.`_id` as id, ib_issue.`_issue_title` as issueTitle, ib_issue.`_issue_descr` as issueDescr, createdBy.`_name` as issueCreatedByName, createdBy.`_email` as issueCreatedByEmail from `ib_issue` inner join `ib_user` as createdBy on createdBy.`_id` = ib_issue.`_created_by_user_id` group by ib_issue.`_id` " > </entity> </document> </dataConfig> Please find the error trace below =================================== 2012-02-20 19:04:40.531:INFO::Started SocketConnector@0.0.0.0:8983 Feb 20, 2012 7:04:57 PM org.apache.solr.core.SolrCore execute INFO: [] webapp=/solr path=/select params={command=status&qt=/dih_ib_jdbc} status=0 QTime=0 Feb 20, 2012 7:04:58 PM org.apache.solr.core.SolrCore execute INFO: [] webapp=/solr path=/select params={command=show-config&qt=/dih_ib_jdbc} status=0 QTime=0 Feb 20, 2012 7:05:30 PM org.apache.solr.handler.dataimport.DataImporter doFullImport INFO: Starting Full Import Feb 20, 2012 7:05:30 PM org.apache.solr.core.SolrCore execute INFO: [] webapp=/solr path=/dih_ib_jdbc params={command=full-import} status=0 QTime=0 Feb 20, 2012 7:05:30 PM org.apache.solr.handler.dataimport.SolrWriter readIndexerProperties INFO: Read dih_ib_jdbc.properties Feb 20, 2012 7:05:30 PM org.apache.solr.update.DirectUpdateHandler2 deleteAll INFO: [] REMOVING ALL DOCUMENTS FROM INDEX Feb 20, 2012 7:05:30 PM org.apache.solr.core.SolrDeletionPolicy onInit INFO: SolrDeletionPolicy.onInit: commits:num=1 commit{dir=E:\workspace\solr_3_5_0\example\solr\data\index,segFN=segments_1,version=1329744880204,generation=1,filenames=[segments_1] Feb 20, 2012 7:05:30 PM org.apache.solr.core.SolrDeletionPolicy updateCommits INFO: newest commit = 1329744880204 Feb 20, 2012 7:05:30 PM org.apache.solr.handler.dataimport.JdbcDataSource$1 call INFO: Creating a connection for entity issue with URL: jdbc:mysql://localhost:3306/issueburner Feb 20, 2012 7:05:30 PM org.apache.solr.handler.dataimport.JdbcDataSource$1 call INFO: Time taken for getConnection(): 172 Feb 20, 2012 7:07:45 PM org.apache.solr.common.SolrException log SEVERE: Full Import failed:org.apache.solr.handler.dataimport.DataImportHandlerException: java.lang.OutOfMemoryError: Java heap space at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:669) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:268) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:187) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:359) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:427) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:408) Caused by: java.lang.OutOfMemoryError: Java heap space at org.apache.lucene.util.UnicodeUtil.UTF16toUTF8(UnicodeUtil.java:377) at org.apache.lucene.store.DataOutput.writeString(DataOutput.java:103) at org.apache.lucene.index.FieldsWriter.writeField(FieldsWriter.java:200) at org.apache.lucene.index.StoredFieldsWriterPerThread.addField(StoredFieldsWriterPerThread.java:58) at org.apache.lucene.index.DocFieldProcessorPerThread.processDocument(DocFieldProcessorPerThread.java:265) at org.apache.lucene.index.DocumentsWriter.updateDocument(DocumentsWriter.java:766) at org.apache.lucene.index.IndexWriter.updateDocument(IndexWriter.java:2327) at org.apache.lucene.index.IndexWriter.updateDocument(IndexWriter.java:2299) at org.apache.solr.update.DirectUpdateHandler2.addDoc(DirectUpdateHandler2.java:240) at org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:61) at org.apache.solr.update.processor.LogUpdateProcessor.processAdd(LogUpdateProcessorFactory.java:115) at org.apache.solr.handler.dataimport.SolrWriter.upload(SolrWriter.java:73) at org.apache.solr.handler.dataimport.DataImportHandler$1.upload(DataImportHandler.java:293) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:636) ... 5 more Feb 20, 2012 7:07:45 PM org.apache.solr.update.DirectUpdateHandler2 rollback INFO: start rollback Feb 20, 2012 7:07:45 PM org.apache.solr.update.DirectUpdateHandler2 rollback -- View this message in context: http://lucene.472066.n3.nabble.com/DataImportHandler-running-out-of-memory-tp490797p3760755.html Sent from the Solr - User mailing list archive at Nabble.com.