On Aug 10, 2005, at 4:16 PM, Doug Cutting wrote:
Erik Hatcher wrote:

Nevermind.... I had removed parse-html plugin from my nutch- site.xml inadvertently, which caused this issue. However, it'd be nice to get a clearer error message for this situation. Thoughts?


Perhaps in ParserFactory a synchronized HashMap should be used in place of a Hashtable, so that a null parser for a content type can still be cached. Then Fetcher.java should be changed to gracefully turn a null parser into a ParseStatus.STATUS_NOTPARSED. Does that sound right?

Seems good to me, at least the Fetcher.java change. Is the synchronized HashMap change necessary for the Fetcher.java change to work properly?

I can probably make these changes and commit them. But I'm wondering if there will be merge pains for the MapReduce branch with trunk changes like this.

    Erik



-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to