[Nutch Wiki] Update of "ContributorsGroup" by JulienNioche

2014-09-17 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "ContributorsGroup" page has been changed by JulienNioche: https://wiki.apache.org/nutch/ContributorsGroup?action=diff&rev1=13&rev2=14 * ShakehKhudikyan * riverma * JorgeLui

[jira] [Commented] (NUTCH-1841) Two nits with developer wiki page

2014-09-17 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136994#comment-14136994 ] Julien Nioche commented on NUTCH-1841: -- I gave you edit rights on the Wiki. Could you

Re: Nutch won't fetch the whole page if the Transfer Dncoding is chunked

2014-09-17 Thread Julien Nioche
Hi Isn't that an effect of http.content.limit 65536 The length limit for downloaded content using the http:// protocol, in bytes. If this value is nonnegative (>=0), content longer than it will be truncated; otherwise, no truncation at all. Do not confuse this setting with the file.content.limit

[jira] [Updated] (NUTCH-841) Create a Wicket-based Web Application for Nutch

2014-09-17 Thread Fjodor Vershinin (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fjodor Vershinin updated NUTCH-841: --- Attachment: webui.patch GSOC patch > Create a Wicket-based Web Application for Nutch > ---

[jira] [Commented] (NUTCH-1084) ReadDB url throws exception

2014-09-17 Thread Edoardo Causarano (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137156#comment-14137156 ] Edoardo Causarano commented on NUTCH-1084: -- Hi, I also noticed that setting HADO

[Nutch Wiki] Trivial Update of "FirstReport" by LewisJohnMcgibbney

2014-09-17 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "FirstReport" page has been changed by LewisJohnMcgibbney: https://wiki.apache.org/nutch/FirstReport?action=diff&rev1=8&rev2=9 '''Mentor Name''': Lewis John McGibbney (lewismc)

[jira] [Resolved] (NUTCH-1841) Two nits with developer wiki page

2014-09-17 Thread Arthur Cinader (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arthur Cinader resolved NUTCH-1841. --- Resolution: Fixed > Two nits with developer wiki page > - > >

[Nutch Wiki] Trivial Update of "Becoming_A_Nutch_Developer" by ArthurCinader

2014-09-17 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "Becoming_A_Nutch_Developer" page has been changed by ArthurCinader: https://wiki.apache.org/nutch/Becoming_A_Nutch_Developer?action=diff&rev1=13&rev2=14 Comment: Fix two non-critica

[jira] [Commented] (NUTCH-1841) Two nits with developer wiki page

2014-09-17 Thread Arthur Cinader (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137484#comment-14137484 ] Arthur Cinader commented on NUTCH-1841: --- fixed. Failed to put this bug number in th

[jira] [Updated] (NUTCH-1832) Make Nutch work without an indexer

2014-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1832: Attachment: NUTCH-1832-2.x.patch Patch for 2.X > Make Nutch work without an indexer

[jira] [Commented] (NUTCH-1832) Make Nutch work without an indexer

2014-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137631#comment-14137631 ] Lewis John McGibbney commented on NUTCH-1832: - Committed @revision 1625715 2.X

[jira] [Updated] (NUTCH-1832) Make Nutch work without an indexer

2014-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1832: Fix Version/s: 2.3 > Make Nutch work without an indexer > --

[jira] [Updated] (NUTCH-841) Create a Wicket-based Web Application for Nutch

2014-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-841: --- Attachment: NUTCH-841.patch This patch includes a complete update of [~fjodor.vershinin

Re: [jira] [Updated] (NUTCH-841) Create a Wicket-based Web Application for Nutch

2014-09-17 Thread Mattmann, Chris A (3980)
Awesome job Lewis ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nas

Re: Nutch won't fetch the whole page if the Transfer Dncoding is chunked

2014-09-17 Thread Sebastian Nagel
Hi, afaics, Julien is right. It's possible to check it via: bin/nutch parsechecker -Dhttp.content.limit=-1 -dumpText \ 'http://search.dangdang.com/?key=%CA%FD%BE%DD%BF%E2' With -Dhttp.content.limit=65534 (also the default) the content is truncated. Best, Sebastian On 09/17/2014 11:32 AM, J

[jira] [Commented] (NUTCH-1832) Make Nutch work without an indexer

2014-09-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137726#comment-14137726 ] Hudson commented on NUTCH-1832: --- SUCCESS: Integrated in Nutch-nutchgora #1158 (See [https:/

[Nutch Wiki] Update of "Release_HOWTO" by SebastianNagel

2014-09-17 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "Release_HOWTO" page has been changed by SebastianNagel: https://wiki.apache.org/nutch/Release_HOWTO?action=diff&rev1=38&rev2=39 Comment: how to update apidoc links for the new relea