Markus Jelsma created NUTCH-1955:
Summary: ByteWritable missing in NutchWritable
Key: NUTCH-1955
URL: https://issues.apache.org/jira/browse/NUTCH-1955
Project: Nutch
Issue Type: Task
[
https://issues.apache.org/jira/browse/NUTCH-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1955:
-
Attachment: NUTCH-1955.patch
Patch for trnk
ByteWritable missing in NutchWritable
Good Afternoon Ashwini,
You can find out information about the project at the Nutch project wiki,
which is here -
https://wiki.apache.org/nutch/GoogleSummerOfCode#NUTCH-1936_GSoC_2015_-_Move_Nutch_to_Hadoop_2.X
We are looking for students to provide input to their project proposals
based on the
[
https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ashwini Tokekar updated NUTCH-1936:
---
Comment: was deleted
(was: Thanks Lewis)
GSoC 2015 - Move Nutch to Hadoop 2.X
[
https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14356294#comment-14356294
]
Ashwini Tokekar commented on NUTCH-1936:
Thanks Lewis
GSoC 2015 - Move Nutch to
[
https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14356295#comment-14356295
]
Ashwini Tokekar commented on NUTCH-1936:
Thanks Lewis
GSoC 2015 - Move Nutch to
Great thanks. I'll add you to the wiki tomorrow.
Best
Lewis
On Tuesday, March 10, 2015, ASHWINI TOKEKAR tokekar.ashw...@gmail.com
wrote:
Thanks, Lewis for your prompt reply. My wiki username is :
ashwinitokekar. I will send you a project proposal in the format mentioned
by you by 15th March.
[
https://issues.apache.org/jira/browse/NUTCH-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1956:
-
Attachment: NUTCH-1956.patch
Patch for trunk.
Members to be public in URLCrawlDatum
Dear Wiki user,
You have subscribed to a wiki page GiuseppeTotaro for change notification. An
attachment has been added to that page by GiuseppeTotaro. Following detailed
information is available:
Attachment name: CommonCrawlDataDumper_v02.pdf
Attachment size: 99670
Attachment link:
Dear Wiki user,
You have subscribed to a wiki page CommonCrawlDataDumper for change
notification. An attachment has been added to that page by GiuseppeTotaro.
Following detailed information is available:
Attachment name: CommonCrawlDataDumper_v02.png
Attachment size: 771605
Attachment link:
Dear Wiki user,
You have subscribed to a wiki page CommonCrawlDataDumper for change
notification. An attachment has been added to that page by GiuseppeTotaro.
Following detailed information is available:
Attachment name: CommonCrawlDataDumper_v02.png
Attachment size: 325140
Attachment link:
Dear Wiki user,
You have subscribed to a wiki page GiuseppeTotaro for change notification. An
attachment has been added to that page by GiuseppeTotaro. Following detailed
information is available:
Attachment name: CommonCrawlDataDumper_v02.png
Attachment size: 771605
Attachment link:
Dear Wiki user,
You have subscribed to a wiki page CommonCrawlDataDumper for change
notification. An attachment has been added to that page by GiuseppeTotaro.
Following detailed information is available:
Attachment name: CommonCrawlDataDumper_v02.png
Attachment size: 234312
Attachment link:
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The CommonCrawlDataDumper page has been changed by GiuseppeTotaro:
https://wiki.apache.org/nutch/CommonCrawlDataDumper?action=diffrev1=1rev2=2
- The CommonCrawlDataDumper is a Nutch tool
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The ContributorsGroup page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/ContributorsGroup?action=diffrev1=23rev2=24
* JayavanthShenoy
* GiuseppeTotaro
*
[
https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355939#comment-14355939
]
Lewis John McGibbney commented on NUTCH-1936:
-
Hi [~ashwini.tokekar]
bq. I
Hi Zein,
On Mon, Mar 9, 2015 at 4:53 PM, dev-digest-h...@nutch.apache.org wrote:
I am using nutch 2.3 and faced a problem with some arabic content sites
this url displays the title by a tag in the body
and getTitle code will stop after /head and consider that there is no
title
I thought
[
https://issues.apache.org/jira/browse/NUTCH-1948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355926#comment-14355926
]
Lewis John McGibbney commented on NUTCH-1948:
-
bq. Would something like
[
https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355931#comment-14355931
]
Lewis John McGibbney commented on NUTCH-1936:
-
[~petr.shypila], you know can
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The CommandLineOptions page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/CommandLineOptions?action=diffrev1=59rev2=60
||[[bin/nutch nutchserver]]||run a (local)
20 matches
Mail list logo