I would guess that dataManager is null. The only other possibility is that document is null, and I don't think that can happen.
Karl On Fri, Jun 3, 2011 at 4:11 PM, Farzad Valad <ho...@farzad.net> wrote: > So I've been trying to figure this out for days now and still not even > close. So I'm getting this in the log file: > > FATAL 2011-06-03 14:26:39,188 [Worker thread '22'] (WorkerThread.java:955) - > Error tossed: null > java.lang.NullPointerException > at > org.apache.manifoldcf.agents.output.dupfinder.DupFinderConnector.addOrReplaceDocument(DupFinderConnector.java:153) > at > org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.addOrReplaceDocument(IncrementalIngester.java:1433) > at > org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.performIngestion(IncrementalIngester.java:418) > at > org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentIngest(IncrementalIngester.java:313) > at > org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.ingestDocument(WorkerThread.java:1565) > at > org.apache.manifoldcf.crawler.connectors.filesystem.FileConnector.processDocuments(FileConnector.java:275) > at > org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423) > at > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:564) > > When I go to line 153 of DupFinderConnector I'm calling: > > boolean isDuplicate = dataManager.insertData(timeStamp, > rcdCounter++, documentURI, outputDescription, authorityNameString, > document.getBinaryLength(), 1, hashsumHexValue, inputStream); > > Added a log statement to print out all the parms. The only null one is > authorityNameString. First off, this error shows up in a few worker threads > and not all or in all crawl jobs. Any explanation or clue as to what I > should be looking for? > > Thanks! >