Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "ContributorsGroup" page has been changed by JulienNioche:
https://wiki.apache.org/nutch/ContributorsGroup?action=diff&rev1=13&rev2=14
* ShakehKhudikyan
* riverma
* JorgeLui
[
https://issues.apache.org/jira/browse/NUTCH-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136994#comment-14136994
]
Julien Nioche commented on NUTCH-1841:
--
I gave you edit rights on the Wiki. Could you
Hi
Isn't that an effect of
http.content.limit 65536
The length limit for downloaded content using the http://
protocol, in bytes. If this value is nonnegative (>=0), content longer than
it will be truncated; otherwise, no truncation at all. Do not confuse this
setting with the file.content.limit
[
https://issues.apache.org/jira/browse/NUTCH-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Fjodor Vershinin updated NUTCH-841:
---
Attachment: webui.patch
GSOC patch
> Create a Wicket-based Web Application for Nutch
> ---
[
https://issues.apache.org/jira/browse/NUTCH-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137156#comment-14137156
]
Edoardo Causarano commented on NUTCH-1084:
--
Hi,
I also noticed that setting HADO
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "FirstReport" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/FirstReport?action=diff&rev1=8&rev2=9
'''Mentor Name''': Lewis John McGibbney (lewismc)
[
https://issues.apache.org/jira/browse/NUTCH-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arthur Cinader resolved NUTCH-1841.
---
Resolution: Fixed
> Two nits with developer wiki page
> -
>
>
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "Becoming_A_Nutch_Developer" page has been changed by ArthurCinader:
https://wiki.apache.org/nutch/Becoming_A_Nutch_Developer?action=diff&rev1=13&rev2=14
Comment:
Fix two non-critica
[
https://issues.apache.org/jira/browse/NUTCH-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137484#comment-14137484
]
Arthur Cinader commented on NUTCH-1841:
---
fixed. Failed to put this bug number in th
[
https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1832:
Attachment: NUTCH-1832-2.x.patch
Patch for 2.X
> Make Nutch work without an indexer
[
https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137631#comment-14137631
]
Lewis John McGibbney commented on NUTCH-1832:
-
Committed @revision 1625715 2.X
[
https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1832:
Fix Version/s: 2.3
> Make Nutch work without an indexer
> --
[
https://issues.apache.org/jira/browse/NUTCH-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-841:
---
Attachment: NUTCH-841.patch
This patch includes a complete update of [~fjodor.vershinin
Awesome job Lewis
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nas
Hi,
afaics, Julien is right. It's possible to check it via:
bin/nutch parsechecker -Dhttp.content.limit=-1 -dumpText \
'http://search.dangdang.com/?key=%CA%FD%BE%DD%BF%E2'
With -Dhttp.content.limit=65534 (also the default) the content
is truncated.
Best,
Sebastian
On 09/17/2014 11:32 AM, J
[
https://issues.apache.org/jira/browse/NUTCH-1832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14137726#comment-14137726
]
Hudson commented on NUTCH-1832:
---
SUCCESS: Integrated in Nutch-nutchgora #1158 (See
[https:/
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "Release_HOWTO" page has been changed by SebastianNagel:
https://wiki.apache.org/nutch/Release_HOWTO?action=diff&rev1=38&rev2=39
Comment:
how to update apidoc links for the new relea
17 matches
Mail list logo