[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339609#comment-14339609
]
Lewis John McGibbney commented on NUTCH-1946:
-
bq. Where can I see which tests
unsubscribe
unsubscribe
[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339252#comment-14339252
]
Henry Saputra edited comment on NUTCH-1946 at 2/26/15 9:56 PM:
-
[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339252#comment-14339252
]
Henry Saputra edited comment on NUTCH-1946 at 2/26/15 9:56 PM:
-
[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339252#comment-14339252
]
Henry Saputra commented on NUTCH-1946:
--
Try to replicate the error stack but when I r
[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339224#comment-14339224
]
Lewis John McGibbney commented on NUTCH-1946:
-
Grand
> Upgrade to Gora 0.6
>
[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339203#comment-14339203
]
Henry Saputra commented on NUTCH-1946:
--
Ah thanks, it compiled now =)
> Upgrade to G
Massimo,
http://nutch.apache.org/mailing_lists.html
=> dev-unsubscr...@nutch.apache.org
Thanks
On 26 February 2015 at 19:11, Massimo Miccoli
wrote:
>
>
> Massimo
>
> > Il giorno 26/feb/2015, alle ore 19:31, lewi...@apache.org ha scritto:
> >
> > Author: lewismc
> > Date: Thu Feb 26 18:31:39 2
Ya. I know about that. But I just thought that because Parse_Data already
does that for us, I did not want to do tthe same processing again. I will
try to figure something out. Thanks a lot.
Regards,
Ami Parikh
(213)590-0005
On Thu, Feb 26, 2015 at 12:39 PM, Renxia Wang wrote:
> Not sure how yo
Not sure how you implement it so it is hard to tell. You may want to take a
look at the SegmentReader's get and getMapRecords methods. Those may give
you ideas. You can use SegmentReader.get directly to get the segment data
too. While it is slow as it slepp(5000) at every time you call it, so slow
[
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339119#comment-14339119
]
Lewis John McGibbney commented on NUTCH-1933:
-
Hi [~jorgelbg] thanks for notic
[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339103#comment-14339103
]
Lewis John McGibbney commented on NUTCH-1946:
-
Hi [~hsaputra], can you try cle
[
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339080#comment-14339080
]
Jorge Luis Betancourt Gonzalez commented on NUTCH-1933:
---
I see a {{t
[
https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339053#comment-14339053
]
Henry Saputra commented on NUTCH-1946:
--
Tried to run ant in the 2.0 branch with your
I am using the MapFileReader to iterate through the file. And I read the
key into a Text object and the MetaData into a ParseData object. I get the
following exception:
Exception in thread "main" java.io.EOFException
at java.io.DataInputStream.readFully(DataInputStream.java:197)
at org.apache.hado
[
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338951#comment-14338951
]
Hudson commented on NUTCH-1933:
---
SUCCESS: Integrated in Nutch-trunk #2991 (See
[https://bui
Massimo
> Il giorno 26/feb/2015, alle ore 19:31, lewi...@apache.org ha scritto:
>
> Author: lewismc
> Date: Thu Feb 26 18:31:39 2015
> New Revision: 1662530
>
> URL: http://svn.apache.org/r1662530
> Log:
> NUTCH-1933 nutch-selenium plugin
>
> Added:
>nutch/trunk/src/plugin/lib-selenium/
>
Hi Ami,
What method of what class do you use to get the meta data? Please provide
more info about this, log etc.
Zhique
On Thu, Feb 26, 2015 at 10:53 AM, Ami Akshay Parikh
wrote:
> Hello,
>
> When I try to use the parse_data from the segment directory for getting
> the MetaData for finding nea
Hello,
When I try to use the parse_data from the segment directory for getting the
MetaData for finding near duplicates, My code runs into a EOFException. I
found something about a bug in nutch in the archives, but I wanted to know
if anyone else is facing this problem and how can I possibly resol
[
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney resolved NUTCH-1933.
-
Resolution: Fixed
Committed @revision 1662530 in trunk
> nutch-selenium plugin
>
[
https://issues.apache.org/jira/browse/NUTCH-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1933:
Assignee: Mohammad Al-Mohsin (was: Lewis John McGibbney)
> nutch-selenium plugin
>
[
https://issues.apache.org/jira/browse/NUTCH-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338684#comment-14338684
]
Sebastian Nagel commented on NUTCH-1950:
Great! For a MD5 calculation, see o.a.had
[
https://issues.apache.org/jira/browse/NUTCH-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338211#comment-14338211
]
Chong Li commented on NUTCH-1950:
-
I have thought about that and at first we just wanted e
[
https://issues.apache.org/jira/browse/NUTCH-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338198#comment-14338198
]
Sebastian Nagel commented on NUTCH-1950:
Is it really a good idea to take the syst
26 matches
Mail list logo