[
https://issues.apache.org/jira/browse/NUTCH-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978748#comment-14978748
]
Chris A. Mattmann commented on NUTCH-2153:
--
can you be more specific here, [~ahmadia]?
> Nutch
[
https://issues.apache.org/jira/browse/NUTCH-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978854#comment-14978854
]
Chris A. Mattmann commented on NUTCH-2153:
--
Yeah I think we may want to do something async here
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978902#comment-14978902
]
Aron Ahmadia commented on NUTCH-2132:
-
[~sujenshah] - I'm reviewing this again now. One issue I'm
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978938#comment-14978938
]
Aron Ahmadia commented on NUTCH-2132:
-
I'm observing crashes when fetcher.publisher is set to false.
[
https://issues.apache.org/jira/browse/NUTCH-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated NUTCH-2154:
-
Fix Version/s: 1.11
> Nutch REST API (DB) suffering NullPointerException
>
[
https://issues.apache.org/jira/browse/NUTCH-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann reassigned NUTCH-2154:
Assignee: Chris A. Mattmann
> Nutch REST API (DB) suffering NullPointerException
>
[
https://issues.apache.org/jira/browse/NUTCH-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978841#comment-14978841
]
Aron Ahmadia commented on NUTCH-2153:
-
If it's asynchronous, use a POST and return a crawldb_job
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978911#comment-14978911
]
Sujen Shah commented on NUTCH-2132:
---
[~ahmadia],
bq. One issue I'm having is that if I start a Nutch
[
https://issues.apache.org/jira/browse/NUTCH-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978769#comment-14978769
]
Chris A. Mattmann commented on NUTCH-2153:
--
Gotcha, thanks [~ahmadia]
> Nutch REST API (DB) uses
[
https://issues.apache.org/jira/browse/NUTCH-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978764#comment-14978764
]
Aron Ahmadia commented on NUTCH-2153:
-
The API from https://wiki.apache.org/nutch/Nutch_1.X_RESTAPI:
[
https://issues.apache.org/jira/browse/NUTCH-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978826#comment-14978826
]
Sujen Shah commented on NUTCH-2153:
---
Hi [~ahmadia] and [~chrismattmann],
Currently, while using Nutch
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978945#comment-14978945
]
Aron Ahmadia commented on NUTCH-2132:
-
got it.
> Publisher/Subscriber model for Nutch to emit events
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978944#comment-14978944
]
Aron Ahmadia commented on NUTCH-2132:
-
I think the protection belongs in
public void
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978942#comment-14978942
]
Sujen Shah commented on NUTCH-2132:
---
Yes the first patch does not have that property, it was
Aron Ahmadia created NUTCH-2154:
---
Summary: Nutch REST API (DB) suffering NullPointerException
Key: NUTCH-2154
URL: https://issues.apache.org/jira/browse/NUTCH-2154
Project: Nutch
Issue Type:
[
https://issues.apache.org/jira/browse/NUTCH-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978811#comment-14978811
]
Chris A. Mattmann commented on NUTCH-2154:
--
I have to respin 1.11 anyways, so I'll take a look at
[
https://issues.apache.org/jira/browse/NUTCH-2154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978824#comment-14978824
]
Aron Ahmadia commented on NUTCH-2154:
-
Looks like it's assumed that "args" is passed in to the REST
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978908#comment-14978908
]
Aron Ahmadia commented on NUTCH-2132:
-
Also, the vice-versa situation is important as well. Can I
[
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14978952#comment-14978952
]
Sujen Shah commented on NUTCH-2132:
---
Yes this is taken care of in the second patch. And, apply the
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "NewScoringIndexingExample" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/NewScoringIndexingExample?action=diff=5=6
bin/nutch
[
https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979831#comment-14979831
]
Lewis John McGibbney commented on NUTCH-1800:
-
For those who want to see the docs you can see
Good Evening Yves,
I'm contacting you on behalf of the Apache Nutch project management team.
Apache Nutch [0] is a top level open source software project at the Apache
Software Foundation licensed under the Apache License v2.0.
I am writing to request for a license key for our project. I
[
https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979830#comment-14979830
]
Sujen Shah commented on NUTCH-1800:
---
Thanks Lewis for this, it is going to be really helpful to the
[
https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1800:
Flags: Patch
Patch Info: Patch Available
> Documentation for Nutch 1.X and
[
https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney reassigned NUTCH-1800:
---
Assignee: Lewis John McGibbney
> Documentation for Nutch 1.X and 2.X REST
[
https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1800:
Fix Version/s: (was: 2.4)
2.3.1
1.11
>
[
https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979828#comment-14979828
]
Lewis John McGibbney commented on NUTCH-1800:
-
[~sujenshah]
> Documentation for Nutch 1.X and
[
https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1800:
Attachment: NUTCH-1800.patch
Patch for trunk. Currently uses my own license key
Dear Wiki user,
You have subscribed to a wiki page "NewScoringIndexingExample" for change
notification. An attachment has been added to that page by LewisJohnMcgibbney.
Following detailed information is available:
Attachment name: NutchWebGraph.png
Attachment size: 859412
Attachment link:
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979258#comment-14979258
]
ASF GitHub Bot commented on NUTCH-2155:
---
GitHub user MJJoyce opened a pull request:
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "NewScoringIndexingExample" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/NewScoringIndexingExample?action=diff=6=7
'''N.B.''' This page and the
Michael Joyce created NUTCH-2155:
Summary: Create a "crawl completeness" utility
Key: NUTCH-2155
URL: https://issues.apache.org/jira/browse/NUTCH-2155
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979196#comment-14979196
]
Michael Joyce commented on NUTCH-2155:
--
Should have a first patch up shortly for review folks
>
GitHub user MJJoyce opened a pull request:
https://github.com/apache/nutch/pull/83
NUTCH-2155 - Add crawl completion utility
- Add simple crawl completion utility that reports count of fetch and
unfetched pages per domain or host.
- Update "nutch" helper script with new
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "NewScoringIndexingExample" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/NewScoringIndexingExample?action=diff=7=8
= Class Diagram =
+ Below is
Github user MJJoyce commented on a diff in the pull request:
https://github.com/apache/nutch/pull/83#discussion_r43324656
--- Diff: src/java/org/apache/nutch/util/CrawlCompletionStats.java ---
@@ -0,0 +1,189 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979327#comment-14979327
]
ASF GitHub Bot commented on NUTCH-2155:
---
Github user lewismc commented on a diff in the pull
Github user lewismc commented on a diff in the pull request:
https://github.com/apache/nutch/pull/83#discussion_r43324772
--- Diff: src/java/org/apache/nutch/util/CrawlCompletionStats.java ---
@@ -0,0 +1,189 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979325#comment-14979325
]
ASF GitHub Bot commented on NUTCH-2155:
---
Github user MJJoyce commented on a diff in the pull
Github user MJJoyce commented on a diff in the pull request:
https://github.com/apache/nutch/pull/83#discussion_r43325287
--- Diff: src/java/org/apache/nutch/util/CrawlCompletionStats.java ---
@@ -0,0 +1,189 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979319#comment-14979319
]
ASF GitHub Bot commented on NUTCH-2155:
---
Github user lewismc commented on a diff in the pull
Github user lewismc commented on a diff in the pull request:
https://github.com/apache/nutch/pull/83#discussion_r43324357
--- Diff: src/java/org/apache/nutch/util/CrawlCompletionStats.java ---
@@ -0,0 +1,189 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979335#comment-14979335
]
ASF GitHub Bot commented on NUTCH-2155:
---
Github user MJJoyce commented on a diff in the pull
Sujen Shah created NUTCH-2156:
-
Summary: Dump via Services end point
Key: NUTCH-2156
URL: https://issues.apache.org/jira/browse/NUTCH-2156
Project: Nutch
Issue Type: Sub-task
44 matches
Mail list logo