[ https://issues.apache.org/jira/browse/NUTCH-950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12977934#action_12977934 ]
Julien Nioche edited comment on NUTCH-950 at 1/5/11 2:56 PM: ------------------------------------------------------------- Committed revision 1055604 in 1.3 Committed revision 1055608 for trunk {panel} NUTCH-950 DomainURLFilter throws NPE on bogus urls (Alexis Detreglode via jnioche) {panel} will review the other submissions later was (Author: jnioche): Committed 1055604 in 1.3 NUTCH-950 DomainURLFilter throws NPE on bogus urls (Alexis Detreglode via jnioche) will commit for 2.0 later and review the other submissions > Content-Length limit, URL filter and few minor issues > ----------------------------------------------------- > > Key: NUTCH-950 > URL: https://issues.apache.org/jira/browse/NUTCH-950 > Project: Nutch > Issue Type: Bug > Affects Versions: 2.0 > Reporter: Alexis > Attachments: nutch1.patch, nutch2.patch, nutch3.patch, nutch4.patch > > > 1. crawl command (nutch1.patch) > The class was renamed to Crawler but the references to it were not updated. > 2. URL filter (nutch2.patch) > This avoids a NPE on bogus urls which host do not have a suffix. > 3. Content-Length limit (nutch3.patch) > This is related to NUTCH-899. > The patch avoids the entire flush operation on the Gora datastore to crash > because the MySQL blob limit was exceeded by a few bytes. Both protocol-http > and protocol-httpclient plugins were problematic. > 4. Ivy configuration (nutch4.patch) > - Change xercesImpl and restlet versions. These 2 version changes are > required. The first one currently makes a JUnit test crash, the second one is > missing in default Maven repository. > - Add gora-hbase, zookeeper which is an HBase dependency. Add MySQL > connector. These jars are necesary to run Gora with HBase or MySQL > datastores. (more a suggestion that a requirement here) > - Add com.jcraft/jsch, which is a protocol-sftp plugin dependency. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.