[
https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12854767#action_12854767
]
Chris A. Mattmann commented on NUTCH-570:
-
Hi Otis:
I think your logic perfectly ra
[
https://issues.apache.org/jira/browse/NUTCH-570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12854665#action_12854665
]
Otis Gospodnetic commented on NUTCH-570:
I'm tempted to close this issue as Won't Fi
Just a question ?
Will the new HBase implementation allow more sophisticated crawling
strategies than the current score based.
Give you a few example of what I'd like to do :
Define different crawling frequency for different set of URLs, say
weekly for some url, monthly or more for others.
Selec
On 2010-04-07 19:24, Enis Söztutar wrote:
>>> Also, the goal of the crawler-commons project is to provide APIs and
>>> implementations of stuff that is needed for every open source crawler
>>> project, like: robots handling, url filtering and url normalization, URL
>>> state management, perhaps de
On 2010-04-07 18:54, Doğacan Güney wrote:
> Hey everyone,
>
> On Tue, Apr 6, 2010 at 20:23, Andrzej Bialecki wrote:
>> On 2010-04-06 15:43, Julien Nioche wrote:
>>> Hi guys,
>>>
>>> I gather that we'll jump straight to 2.0 after 1.1 and that 2.0 will be
>>> based on what is currently referred to
Forgot to say that, at Hadoop, it is the convention that big issues,
like the ones under discussion come with a design document. So that a
solid design is agreed upon for the work. We can apply the same pattern
at Nutch.
On 04/07/2010 07:54 PM, Doğacan Güney wrote:
Hey everyone,
On Tue, Apr
Hi,
On 04/07/2010 07:54 PM, Doğacan Güney wrote:
Hey everyone,
On Tue, Apr 6, 2010 at 20:23, Andrzej Bialecki wrote:
On 2010-04-06 15:43, Julien Nioche wrote:
Hi guys,
I gather that we'll jump straight to 2.0 after 1.1 and that 2.0 will be
based on what is currently referred to a
Hey everyone,
On Tue, Apr 6, 2010 at 20:23, Andrzej Bialecki wrote:
> On 2010-04-06 15:43, Julien Nioche wrote:
>> Hi guys,
>>
>> I gather that we'll jump straight to 2.0 after 1.1 and that 2.0 will be
>> based on what is currently referred to as NutchBase. Shall we create a
>> branch for 2.0 in
[
https://issues.apache.org/jira/browse/NUTCH-650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-650:
Affects Version/s: (was: 1.0.0)
Fix Version/s: 2.0
> Hbase Integration
> ---
[
https://issues.apache.org/jira/browse/NUTCH-808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-808:
Fix Version/s: 2.0
> Evaluate ORM Frameworks which support non-relational column-oriented
> datasto
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "Nutch2Roadmap" page has been changed by JulienNioche.
http://wiki.apache.org/nutch/Nutch2Roadmap?action=diff&rev1=1&rev2=2
--
*
Hi,
I'm not sure what is the status of the nutchbase - it's missed a lot of
> fixes and changes in trunk since it's been last touched ...
>
yes, maybe we should start the 2.0 branch from 1.1 instead
Dogacan - what do you think?
BTW I see there is now a 2.0 label under JIRA, thanks to whoever add
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "Nutch2Roadmap" page has been changed by JulienNioche.
http://wiki.apache.org/nutch/Nutch2Roadmap
--
New page:
= Nutch2Roadmap =
Her
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "FrontPage" page has been changed by JulienNioche.
http://wiki.apache.org/nutch/FrontPage?action=diff&rev1=128&rev2=129
--
* [[Mai
..and here is to a Vote: +1
> Oh, per usual, forgot to throw in my +1. So, +1!
>
> Cheers,
> Chris
>
>
> On 4/7/10 1:14 AM, "Mattmann, Chris A (388J)"
> wrote:
>
> Hi Folks,
>
> I have posted a candidate for the Apache Nutch 1.1 release. The source
> code
> is at:
>
> http://people.apache.org/~ma
15 matches
Mail list logo