[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465700
]
Armel Nene commented on NUTCH-61:
-
I have attached a new patch as the old one need updating before using with
Nutch 0.
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465540
]
Sami Siren commented on NUTCH-61:
-
ok, so in my usual use case where there are far more urls than I can fetch this
sho
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465517
]
Andrzej Bialecki commented on NUTCH-61:
Actually, there is a way to do this, and this patch implements it.
We
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465493
]
Sami Siren commented on NUTCH-61:
-
Havent looked the patch (tm)
How would one manage segments after something linke th
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12464725
]
Armel Nene commented on NUTCH-61:
-
I was able to apply the patch to Nutch 0.8.1 and have it successfully running.
I th
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12449332 ]
Armel Nene commented on NUTCH-61:
-
In the fetcher source code : src\java\org\apache\nutch\fetcher.java there is
this condition which checks to see status of file or
Armel T. Nene wrote:
Andrzej, the feature that I am after can be implemented by this patch if I
just adapt it right. I am not sure of this but the patch seems a little bit
old to be implemented in the latest release of Nutch 0.8.1.
Right, that's why I wrote it needs to be brought up-to-date
h-dev@lucene.apache.org
Subject: [jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting
umodified content
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12449170
]
Andrzej Bialecki commented on NUTCH-61:
Unfortun
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12449170 ]
Andrzej Bialecki commented on NUTCH-61:
Unfortunately, this patch hasn't been applied yet, due to its complexity and
lack of testing.
But it will be, soone
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12449128 ]
Armel Nene commented on NUTCH-61:
-
Has this patch by any chance been included in the newer release of nucth or is
any one using as Otis asked. The reason is I am abo
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12444514 ]
Otis Gospodnetic commented on NUTCH-61:
---
Has anyone been using the code with this patch applied? Just wondering if/how
well it works.
> Adaptive re-fetch int
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12368051 ]
Andrzej Bialecki commented on NUTCH-61:
I contemplated this for a while, and then decided against it.
The main reason was that currently most of the "pluggable" extensi
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12368050 ]
Jerome Charron commented on NUTCH-61:
-
Not an objection, but a simple comment.
Why not making FetchSchedule a new ExtensionPoint and then DefaultFetchSchedule
and AdaptiveFe
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361346 ]
byron miller commented on NUTCH-61:
---
Most definately! I'll be happy to give it a whirl!
> Adaptive re-fetch interval. Detecting umodified content
> ---
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361311 ]
Andrzej Bialecki commented on NUTCH-61:
I'm working on this, the patch will be available in a couple of days. I could
use then your help with review and testing... ;-)
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361302 ]
byron miller commented on NUTCH-61:
---
Is there a patch modified for the current branch or should i take a stab at
this?
> Adaptive re-fetch interval. Detecting umodified conte
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361133 ]
Andrzej Bialecki commented on NUTCH-61:
This patch already supports this. Anyway, it needs to be significantly
re-worked to fit into the current development version.
>
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361131 ]
raghavendra prabhu commented on NUTCH-61:
-
Will the same thing work for a filesystem
For a file system , We can directly get the modified date store it in the db
The pl
18 matches
Mail list logo