[
https://issues.apache.org/jira/browse/NUTCH-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15118689#comment-15118689
]
Chris A. Mattmann commented on NUTCH-2206:
--
+1 please commit
> Provide example s
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The "ContributorsGroup" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/ContributorsGroup?action=diff&rev1=36&rev2=37
* PeterCiuffetti
* ayeshahasan
*
Hi Ammar,
I've given you write permissions for the wiki.
Feel free to create a page for your proposed work at the URL below
https://wiki.apache.org/nutch/GoogleSummerOfCode#A2016
On Fri, Jan 22, 2016 at 4:49 PM, Lewis John Mcgibbney <
lewis.mcgibb...@gmail.com> wrote:
> Hi Ammar,
> CC dev@
> Apo
[
https://issues.apache.org/jira/browse/NUTCH-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15118286#comment-15118286
]
Lewis John McGibbney commented on NUTCH-2206:
-
+1 [~sujenshah], thanks
> Pro
[
https://issues.apache.org/jira/browse/NUTCH-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117963#comment-15117963
]
ASF GitHub Bot commented on NUTCH-1712:
---
GitHub user sebastian-nagel opened a pull r
GitHub user sebastian-nagel opened a pull request:
https://github.com/apache/nutch/pull/86
NUTCH-1712 Injector to use MultipleInputs (new MR API)
Tested inject in combination with other CrawlDb tools (readdb, updatedb,
mergedb): everything seems to work smoothly, although output fil
[
https://issues.apache.org/jira/browse/NUTCH-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sujen Shah updated NUTCH-2206:
--
Attachment: NUTCH-2206.patch
Added example for the property in nutch-default.xml
> Provide example scor
[
https://issues.apache.org/jira/browse/NUTCH-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117839#comment-15117839
]
Hudson commented on NUTCH-1741:
---
SUCCESS: Integrated in Nutch-nutchgora #1548 (See
[https:/
[
https://issues.apache.org/jira/browse/NUTCH-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117810#comment-15117810
]
Sujen Shah commented on NUTCH-2206:
---
Ohh yes, will do it now, missed it in the patch.
[
https://issues.apache.org/jira/browse/NUTCH-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117800#comment-15117800
]
Lewis John McGibbney commented on NUTCH-2206:
-
We should most likely also prov
[
https://issues.apache.org/jira/browse/NUTCH-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sujen Shah updated NUTCH-2206:
--
Attachment: NUTCH-2206.patch
Hey [~lewismc], here's the patch providing an example for the stopword file
[
https://issues.apache.org/jira/browse/NUTCH-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney resolved NUTCH-1741.
-
Resolution: Fixed
Committed revision 1726853 in 2.X
Thank you to everyone that con
[
https://issues.apache.org/jira/browse/NUTCH-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-2208:
Attachment: TEST-org.apache.nutch.crawl.TestGenerator.txt
Attached is full test log
Lewis John McGibbney created NUTCH-2208:
---
Summary: Fix 4 skipped tests in TestGenerator
Key: NUTCH-2208
URL: https://issues.apache.org/jira/browse/NUTCH-2208
Project: Nutch
Issue Type:
[
https://issues.apache.org/jira/browse/NUTCH-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1741:
Attachment: NUTCH-1741v7.patch
Managed to update this at the weekend and forgot to u
[
https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1465:
-
Fix Version/s: 1.13
> Support sitemaps in Nutch
> -
>
> Ke
[
https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117024#comment-15117024
]
Markus Jelsma commented on NUTCH-961:
-
Yes! :)
> Expose Tika's boilerpipe support
> --
[
https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117020#comment-15117020
]
Tien Nguyen Manh commented on NUTCH-961:
Can NUTCH-1233: use tika to extract outlin
[
https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15116975#comment-15116975
]
Markus Jelsma commented on NUTCH-961:
-
With boilerpipe, you get only a very few outlink
19 matches
Mail list logo