So I've been trying to figure this out for days now and still not even close. So I'm getting this in the log file:

FATAL 2011-06-03 14:26:39,188 [Worker thread '22'] (WorkerThread.java:955) - Error tossed: null
java.lang.NullPointerException
at org.apache.manifoldcf.agents.output.dupfinder.DupFinderConnector.addOrReplaceDocument(DupFinderConnector.java:153) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.addOrReplaceDocument(IncrementalIngester.java:1433) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.performIngestion(IncrementalIngester.java:418) at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:313) at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocument(WorkerThread.java:1565) at org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector.processDocuments(FileConnector.java:275) at org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423) at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:564)

When I go to line 153 of DupFinderConnector I'm calling:

boolean isDuplicate = dataManager.insertData(timeStamp, rcdCounter++, documentURI, outputDescription, authorityNameString, document.getBinaryLength(), 1, hashsumHexValue, inputStream);

Added a log statement to print out all the parms. The only null one is authorityNameString. First off, this error shows up in a few worker threads and not all or in all crawl jobs. Any explanation or clue as to what I should be looking for?

Thanks!

Reply via email to