Build failed in Jenkins: Nutch-trunk #1527

2011-06-25 Thread Apache Jenkins Server
See -- [...truncated 985 lines...] A src/plugin/subcollection/src/java/org/apache/nutch/collection A src/plugin/subcollection/src/java/org/apache/nutch/collection/Subcollection.java A

[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney

2011-06-25 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "FrontPage" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/FrontPage?action=diff&rev1=196&rev2=197 == General Information == * [[http://nutch.apach

[Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney

2011-06-25 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "Archive and Legacy" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/Archive%20and%20Legacy?action=diff&rev1=10&rev2=11 === Internal Nutch Documentation =

[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney

2011-06-25 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "FrontPage" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/FrontPage?action=diff&rev1=195&rev2=196 == Nutch 2.0 == * Nutch2Roadmap -- Discussions o

[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney

2011-06-25 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "FrontPage" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/FrontPage?action=diff&rev1=194&rev2=195 * [[GORA_HBase]] -- Configuring Nutch 2.0 with GORA a

[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney

2011-06-25 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "FrontPage" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/FrontPage?action=diff&rev1=193&rev2=194 * Current CommandLineOptions /!\ :TODO:Missing pages

Build failed in Jenkins: Nutch-trunk #1526

2011-06-25 Thread Apache Jenkins Server
See Changes: [markus] NUTCH-1006 MetaEquiv with single quotes not accepted [markus] NUTCH-1010 ContentLength not trimmed -- [...truncated 984 lines...] A src/plugin/subcollection/src/java/or

[jira] [Updated] (NUTCH-295) More description for fetcher.threads.fetch property

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-295: Patch Info: [Patch Available] Fix Version/s: 2.0 1.4 > More description fo

[jira] [Updated] (NUTCH-956) solrindex issues

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-956: Fix Version/s: 2.0 1.4 back on radar. > solrindex issues > > >

[jira] [Closed] (NUTCH-994) Fine tune Solr schema

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-994. --- Bulk close of resolved issues for 1.3. > Fine tune Solr schema > - > >

[jira] [Closed] (NUTCH-948) Remove Lucene dependencies

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-948. --- Bulk close of resolved issues for 1.3. > Remove Lucene dependencies > -- > >

[jira] [Closed] (NUTCH-957) fetcher.timelimit.mins is invalid when depth is greater than 1

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-957. --- Bulk close of resolved issues for 1.3. > fetcher.timelimit.mins is invalid when depth is greater than 1 >

[jira] [Closed] (NUTCH-997) IndexingFitlers to store Date objects instead of Strings

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-997. --- Bulk close of resolved issues for 1.3. > IndexingFitlers to store Date objects instead of Strings > -

[jira] [Closed] (NUTCH-962) max. redirects not handled correctly: fetcher stops at max-1 redirects

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-962. --- Bulk close of resolved issues for 1.3. > max. redirects not handled correctly: fetcher stops at max-1 red

[jira] [Closed] (NUTCH-1003) 'package' task does not reflect the new organisation of the code

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-1003. Bulk close of resolved issues for 1.3. > 'package' task does not reflect the new organisation of the c

[jira] [Closed] (NUTCH-972) Mergedb doesn't merge with empty directory, as is the case with merge (for indexes)

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-972. --- Bulk close of resolved issues for 1.3. > Mergedb doesn't merge with empty directory, as is the case with

[jira] [Closed] (NUTCH-954) Bugfix for Content-Length limit in http protocols

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-954. --- Bulk close of resolved issues for 1.3. > Bugfix for Content-Length limit in http protocols >

[jira] [Closed] (NUTCH-975) Fix missing/wrong headers in source files

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-975. --- Bulk close of resolved issues for 1.3. > Fix missing/wrong headers in source files >

[jira] [Closed] (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-939. --- Bulk close of resolved issues for 1.3. > Added -dir command line option to Indexer and SolrIndexer, allo

[jira] [Closed] (NUTCH-984) Parse-tika throws some URL's away

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-984. --- Bulk close of resolved issues for 1.3. > Parse-tika throws some URL's away >

[jira] [Closed] (NUTCH-995) Generate POM file using the Ivy makepom task

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-995. --- Bulk close of resolved issues for 1.3. > Generate POM file using the Ivy makepom task >

[jira] [Closed] (NUTCH-824) Crawling - File Error 404 when fetching file with an hexadecimal character in the file name.

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma closed NUTCH-824. --- Bulk close of resolved issues for 1.3. > Crawling - File Error 404 when fetching file with an hexadecimal

[jira] [Updated] (NUTCH-717) Make Nutch Solr integration easier

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-717: Fix Version/s: 1.4 > Make Nutch Solr integration easier > -- > >

[jira] [Updated] (NUTCH-965) Skip parsing for truncated documents

2011-06-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-965: Fix Version/s: 2.0 1.4 > Skip parsing for truncated documents > -