Markus Jelsma created NUTCH-1932:
Summary: Automatically remove orphaned pages
Key: NUTCH-1932
URL: https://issues.apache.org/jira/browse/NUTCH-1932
Project: Nutch
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/NUTCH-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1932:
-
Attachment: NUTCH-1932.patch
Dirty patch!
> Automatically remove orphaned pages
> ---
[
https://issues.apache.org/jira/browse/NUTCH-1930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1930:
Fix Version/s: 2.3.1
> Fetcher erases Markers for certain URLs / documents
> ---
[
https://issues.apache.org/jira/browse/NUTCH-1930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1930:
Fix Version/s: (was: 2.3.1)
2.4
> Fetcher erases Markers for
Lewis John McGibbney created NUTCH-1933:
---
Summary: nutch-selenium plugin
Key: NUTCH-1933
URL: https://issues.apache.org/jira/browse/NUTCH-1933
Project: Nutch
Issue Type: Bug
C
[
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1933:
Attachment: NUTCH-selenium-trunk.patch
Patch for trunk
> nutch-selenium plugin
> --
Lewis John McGibbney created NUTCH-1934:
---
Summary: Refactor Fetcher in trunk
Key: NUTCH-1934
URL: https://issues.apache.org/jira/browse/NUTCH-1934
Project: Nutch
Issue Type: Improvement
yuanyun.cn created NUTCH-1935:
-
Summary: too many open files
Key: NUTCH-1935
URL: https://issues.apache.org/jira/browse/NUTCH-1935
Project: Nutch
Issue Type: Bug
Affects Versions: 2.2
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "ContributorsGroup" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/ContributorsGroup?action=diff&rev1=18&rev2=19
* ArthurCinader
* MaziyarBoustani
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "FrontPage" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/FrontPage?action=diff&rev1=292&rev2=293
* NutchMeetUps - Records of previous Nutch community
Hi Folks,
Does anyone have any good ideas for GSoC?
Seb mentioned moving Nutch towards Spark so potentially a pluggable runtime
execution engine abstraction?
I am currently working on a lot of security and authentication related work
so I would possibly be tempted to overhaul and improve that aspec
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "FrontPage" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/FrontPage?action=diff&rev1=293&rev2=294
* NutchMeetUps - Records of previous Nutch community
[
https://issues.apache.org/jira/browse/NUTCH-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306165#comment-14306165
]
stack commented on NUTCH-1935:
--
What did you have ulimit set to? See 'Limits on Number of Fi
[
https://issues.apache.org/jira/browse/NUTCH-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306189#comment-14306189
]
yuanyun.cn commented on NUTCH-1935:
---
Thanks, stack.
The limit is 4096.
cat /proc/17849/l
[
https://issues.apache.org/jira/browse/NUTCH-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14306194#comment-14306194
]
stack commented on NUTCH-1935:
--
The hbase refguide says "It is recommended to raise the ulimi
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "AdvancedAjaxInteraction" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/AdvancedAjaxInteraction
New page:
= AdvancedAjaxInteraction =
This page provides
Moving to Hadoop 2.x ?
On 4 February 2015 at 14:42, Lewis John Mcgibbney wrote:
> Hi Folks,
> Does anyone have any good ideas for GSoC?
> Seb mentioned moving Nutch towards Spark so potentially a pluggable
> runtime execution engine abstraction?
> I am currently working on a lot of security and
[
https://issues.apache.org/jira/browse/NUTCH-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1934:
Attachment: NUTCH-1934.patch
Patch for trunk.
Some early observations:
* Existing N
[
https://issues.apache.org/jira/browse/NUTCH-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1934:
Attachment: (was: NUTCH-1934.patch)
> Refactor Fetcher in trunk
> --
[
https://issues.apache.org/jira/browse/NUTCH-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1934:
Patch Info: Patch Available
> Refactor Fetcher in trunk
> -
[
https://issues.apache.org/jira/browse/NUTCH-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1934:
Attachment: NUTCH-1934.patch
> Refactor Fetcher in trunk
> -
[
https://issues.apache.org/jira/browse/NUTCH-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-827:
---
Fix Version/s: (was: 1.11)
1.10
> HTTP POST Authentication
> ---
22 matches
Mail list logo