OutOfMemoryError when re-indexing the repository In-Reply-To: <[EMAIL PROTECTED]> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable
[ http://issues.apache.org/jira/browse/JCR-550?page=3Dcomments#action_1= 2446865 ]=20 =20 Claus K=C3=B6ll commented on JCR-550: -------------------------------- ok i didn't think about this .. i will try the reindexing process with the = fixed LazyReader and we will see if it works... but at the moment i start the repository with this arguments -Xms1550m -Xmx3000m i do not think that only one document causes the outofmemoryexception. i th= ink there are objects they will not be garbaged claus > ObservationManagerFactory) - OutOfMemoryError when re-indexing the repository > -------------------------------------------------------------------------= ----- > > Key: JCR-550 > URL: http://issues.apache.org/jira/browse/JCR-550 > Project: Jackrabbit > Issue Type: Bug > Components: indexing > Affects Versions: 1.0.1 > Environment: tomcat 5.0 [256 up to 512 mb of ram]=20 > jackrabbit 1.0.1=20 > jdk 1.4.2_12=20 > Intel Xeon 3.2GHz with 2Gb of memory > ---- > poi-3.0-alpha2-20060616.jar > poi-contrib-3.0-alpha2-20060616.jar > poi-scratchpad-3.0-alpha2-20060616.jar > jackrabbit-core-1.0.1.jar > jackrabbit-index-filters-1.0.1.jar > jackrabbit-jcr-commons-1.0.1.jar > jcr-1.0.jar > tm-extractors-0.4.jar > lucene-1.4.3.jar > Reporter: Christian Zanata > Assigned To: Marcel Reutegger > Attachments: log_files.zip > > > [ERROR] 20060825 17:06:40 > (org.apache.jackrabbit.core.observation.ObservationManagerFactory) - > Synchronous EventConsumer threw exception. java.lang.OutOfMemoryError > when we try to re-index a repository, the repository is quite big (more t= hen 4 Gb of disk usage) and sometimes it stores 40Mb size documents. > As attach I put all the last logs we registered, with the full stack trac= es. > Related to this whe have also errors with Lucene: > [DEBUG] 20060803 08:24:01 (org.apache.jackrabbit.core.query.LazyReader) > - Dump:=20 > java.io.IOException: Invalid header signature; read 8656037701166316554, > expected -2226271756974174256 > at org.apache.jackrabbit.core.query.MsWordTextFilter > and then this ones: > [DEBUG] 20060803 08:37:17 (org.apache.jackrabbit.core.ItemManager) - > removing item 8637bf5f-4689-4e75-888f-b7b89bef40c8 from cache > [ WARN] 20060803 08:40:13 (org.apache.jackrabbit.core.RepositoryImpl) - > Existing lock file at C:\Wave\Repository\.lock deteteced. Repository was > not shut down properly. > [ERROR] 20060803 09:33:14 > (org.apache.jackrabbit.core.observation.ObservationManagerFactory) - > Synchronous EventConsumer threw exception. > java.lang.NullPointerException: null values not allowed > this is our repository.xml configuration for indexing > <SearchIndex > class=3D"org.apache.jackrabbit.core.query.lucene.SearchIndex"> > <param name=3D"path" value=3D"${wsp.home}/index"/> > <param name=3D"textFilterClasses" > value=3D"org.apache.jackrabbit.core.query.lucene.TextPlainTextFilter, > org.apache.jackrabbit.core.query.MsExcelTextFilter, > org.apache.jackrabbit.core.query.MsPowerPointTextFilter,=20 > org.apache.jackrabbit.core.query.MsWordTextFilter, > org.apache.jackrabbit.core.query.PdfTextFilter, > org.apache.jackrabbit.core.query.HTMLTextFilter, > org.apache.jackrabbit.core.query.XMLTextFilter, > org.apache.jackrabbit.core.query.RTFTextFilter, > org.apache.jackrabbit.core.query.OpenOfficeTextFi= lter"/> > <param name=3D"useCompoundFile" value=3D"true"/> > <param name=3D"minMergeDocs" value=3D"100"/> > <param name=3D"volatileIdleTime" value=3D"3"/> > <param name=3D"maxMergeDocs" value=3D"100000"/> > <param name=3D"mergeFactor" value=3D"10"/> > <param name=3D"bufferSize" value=3D"10"/> > <param name=3D"cacheSize" value=3D"1000"/> > <param name=3D"forceConsistencyCheck" value=3D"false"/> > <param name=3D"autoRepair" value=3D"true"/> > <param name=3D"respectDocumentOrder" value=3D"false"/> > <param name=3D"analyzer" > value=3D"org.apache.lucene.analysis.standard.StandardAnalyzer"/> > </SearchIndex> --=20 This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: htt= p://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira