GitHub user dyzsasd opened a pull request: https://github.com/apache/nutch/pull/72
Branch 2.3.1 You can merge this pull request into a Git repository by running: $ git pull https://github.com/apache/nutch branch-2.3.1 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/nutch/pull/72.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #72 ---- commit fa88ac21de22536c7bd464d59204d8fbf034aa53 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-06-27T17:21:35Z prepare for new development git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1497462 13f79535-47bb-0310-9956-ffa450edef68 commit 9728ed2267e359772c6e8aa61f0bde69b7237f2d Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-06-27T18:01:56Z update for release report git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1497480 13f79535-47bb-0310-9956-ffa450edef68 commit e868ed8d22f0ff69f7fa0da60269d09f30698469 Author: lufeng <fen...@apache.org> Date: 2013-07-01T13:34:23Z NUTCH-1594 count variable is never changed in ParseUtil class git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1498437 13f79535-47bb-0310-9956-ffa450edef68 commit fe9ea2aad1e75e419048d454992a5a56ceac8a1d Author: Markus Jelsma <mar...@apache.org> Date: 2013-07-05T10:27:47Z NUTCH-1595 Upgrade to Tika 1.4 (jnioche, markus) git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1499959 13f79535-47bb-0310-9956-ffa450edef68 commit d5cb787bead9589df0fe4f896fbb2ed17f059d9c Author: Julien Nioche <jnio...@apache.org> Date: 2013-07-08T08:50:08Z NUTCH-1604 Protocol-factory not thread-safe git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1500610 13f79535-47bb-0310-9956-ffa450edef68 commit ccd793cd35768377231d77c01c5e9a9b700694f1 Author: Sebastian Nagel <sna...@apache.org> Date: 2013-07-25T21:15:02Z NUTCH-1587 misspelled property "threshold" in conf/log4j.properties git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1507131 13f79535-47bb-0310-9956-ffa450edef68 commit d4deef989ffc41b9dd5e77683e73286d81e1178b Author: Sebastian Nagel <sna...@apache.org> Date: 2013-08-07T21:10:17Z NUTCH-911 protocol-file to return proper protocol status for notmodified, gone, access_denied git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1511496 13f79535-47bb-0310-9956-ffa450edef68 commit 46dae3c0f754f212f7260d897bbd0785c19cd418 Author: lufeng <fen...@apache.org> Date: 2013-08-13T15:17:05Z NUTCH-1294 IndexClean job with solr implementation. git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1513543 13f79535-47bb-0310-9956-ffa450edef68 commit f7a76daaeb0c0f3686ececb1d946529f28f6ff17 Author: lufeng <fen...@apache.org> Date: 2013-08-13T15:21:34Z NUTCH-1294 IndexClean job with solr implementation. git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1513548 13f79535-47bb-0310-9956-ffa450edef68 commit 0508944f9bfbbf5f6b6898a95d156d2977ab3137 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-08-18T23:02:53Z NUTCH-1624 Typo in WebTableReader line 486 git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1515240 13f79535-47bb-0310-9956-ffa450edef68 commit 86c1f5584a49d45ac1d150a8dafedbd2af7351c1 Author: Julien Nioche <jnio...@apache.org> Date: 2013-08-23T08:52:38Z NUTCH-1629 Injector skips empty lines git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1516752 13f79535-47bb-0310-9956-ffa450edef68 commit 936389646645b84816579f30c96077a678de5b1c Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-08-23T19:47:16Z NUTCH-1631 Display Document Count Added to Solr Server git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1517003 13f79535-47bb-0310-9956-ffa450edef68 commit 33bed204bb922e9d5b3f3d67f2b61757ce3fdd9e Author: lufeng <fen...@apache.org> Date: 2013-08-24T15:21:20Z NUTCH-1619 Writes Dmoz Description and Title information to db with snippet argument. git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1517147 13f79535-47bb-0310-9956-ffa450edef68 commit a0030f4ef10f2866ccae90afadc8f3460911f88d Author: lufeng <fen...@apache.org> Date: 2013-08-24T15:50:01Z NUTCH-1619 Writes Dmoz Description and Title information to db with snippet argument. git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1517155 13f79535-47bb-0310-9956-ffa450edef68 commit 1d62b185abbd6f98c3dd644861bfb44d036bde8a Author: lufeng <fen...@apache.org> Date: 2013-09-05T14:40:25Z NUTCH-1556 enabling updatedb to accept batchId git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1520332 13f79535-47bb-0310-9956-ffa450edef68 commit 3a0eb5bdcb2a3ab14c4cf1093e50e3e5dc5ffd8b Author: lufeng <fen...@apache.org> Date: 2013-09-12T13:23:24Z NUTCH-1556 enabling updatedb to accept batchId git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1522566 13f79535-47bb-0310-9956-ffa450edef68 commit 3a63fe35fb5c1e07d061a33980070110b30660cd Author: Julien Nioche <jnio...@apache.org> Date: 2013-09-20T08:03:24Z NUTCH-1641 Log timings for main jobs git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1524931 13f79535-47bb-0310-9956-ffa450edef68 commit 25d97ee80cf5815bba35ff929619e5f00f74d39b Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-10-27T11:54:36Z NUTCH-1124 JUnit tests for OPIC Scoring git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1536106 13f79535-47bb-0310-9956-ffa450edef68 commit 0162aef1ef287292394a2ef078381dbb0f73a659 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-11-01T18:44:23Z NUTCH-1125 JUnit test for TLD git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538023 13f79535-47bb-0310-9956-ffa450edef68 commit 9bfd1fdd00aee4cb6f5c049e305bde6d3917f573 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-11-02T14:03:57Z NUTCH-1413 Record response time git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538193 13f79535-47bb-0310-9956-ffa450edef68 commit 295ea6bf338a8fd762ca6ba855011bd966222aba Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-11-02T14:11:04Z NUTCH-1650 Adaptive Fetch Scheduler interval Wrong Set git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538195 13f79535-47bb-0310-9956-ffa450edef68 commit de47b8e2e39c250b9cf2c76070b57aa739494d3a Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-11-02T14:16:28Z NUTCH-1588 Port NUTCH-1245 URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again to 2.x git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538200 13f79535-47bb-0310-9956-ffa450edef68 commit 1b15606816bf76dca22df7ff644e36db0e145eb6 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-11-02T20:52:19Z NUTCH-1360 Suport the storing of IP address connected to when web crawling git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538280 13f79535-47bb-0310-9956-ffa450edef68 commit 0429d858a2379cac33fb8f64a39e6d9c0fce5d02 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-11-04T19:11:16Z NUTCH-1651 modifiedTime and prevmodifiedTime never set git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538723 13f79535-47bb-0310-9956-ffa450edef68 commit 01d5123ace23143974d1b9b5d364764c6c073b93 Author: Julien Nioche <jnio...@apache.org> Date: 2013-11-14T12:12:32Z Removed all in one Crawl class (NUTCH-1621) git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1541886 13f79535-47bb-0310-9956-ffa450edef68 commit 38aa2dc51a869215aac52ead46274da582635a37 Author: Julien Nioche <jnio...@apache.org> Date: 2013-11-15T09:20:03Z Removed all in one Crawl class (NUTCH-1621) git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1542208 13f79535-47bb-0310-9956-ffa450edef68 commit c96a55308d639de95083f090494f0a4a36be54e0 Author: Sebastian Nagel <sna...@apache.org> Date: 2013-11-21T22:04:13Z NUTCH-1587 misspelled property "threshold" in log4j.properties git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1544341 13f79535-47bb-0310-9956-ffa450edef68 commit 7232641bba876a1b423061331bb047be2c4cbf2a Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-11-27T10:14:18Z NUTCH-1673 Title isn't reset in MoreIndexingFilter git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1545982 13f79535-47bb-0310-9956-ffa450edef68 commit 0ab335e9f73194e30dcd5d2065996853067b42f1 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-12-23T15:06:41Z NUTCH-1681 In URLUtil.java, toUNICODE method does not work correctly git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1553125 13f79535-47bb-0310-9956-ffa450edef68 commit f6cd10fa70e757e53aef8be8c179ad638dd73e94 Author: Lewis John McGibbney <lewi...@apache.org> Date: 2013-12-23T17:17:53Z NUTCH-1360 Support the storing of IP address connected to when web crawling git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1553154 13f79535-47bb-0310-9956-ffa450edef68 ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---