[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Joyce resolved NUTCH-2155.
--
Resolution: Fixed
Latest patch committed in r1713885
> Create a "crawl completeness" utility
>
[
https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000658#comment-15000658
]
Lewis John McGibbney commented on NUTCH-2165:
-
It means that the remaining data is not dumped.
[
https://issues.apache.org/jira/browse/NUTCH-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Joyce resolved NUTCH-2150.
--
Resolution: Fixed
Resolved in r1713892
> Add ProtocolStatus Utility
>
[
https://issues.apache.org/jira/browse/NUTCH-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000787#comment-15000787
]
Hudson commented on NUTCH-2150:
---
SUCCESS: Integrated in Nutch-trunk #3305 (See
[
https://issues.apache.org/jira/browse/NUTCH-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000788#comment-15000788
]
Hudson commented on NUTCH-1911:
---
SUCCESS: Integrated in Nutch-trunk #3305 (See
[
https://issues.apache.org/jira/browse/NUTCH-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000841#comment-15000841
]
Michael Joyce commented on NUTCH-2167:
--
Hi folks,
All looks good and tests run fine after moving
[
https://issues.apache.org/jira/browse/NUTCH-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-2167 started by Michael Joyce.
> Backport TableUtil from 2.x for URL reversing
>
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000667#comment-15000667
]
Hudson commented on NUTCH-2155:
---
SUCCESS: Integrated in Nutch-trunk #3304 (See
[
https://issues.apache.org/jira/browse/NUTCH-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-1911 started by Michael Joyce.
> Improve DomainStatistics tool command line parsing
>
[
https://issues.apache.org/jira/browse/NUTCH-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Joyce resolved NUTCH-1911.
--
Resolution: Fixed
Resolved in r1713890
> Improve DomainStatistics tool command line parsing
>
[
https://issues.apache.org/jira/browse/NUTCH-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-2150 started by Michael Joyce.
> Add ProtocolStatus Utility
> --
>
>
[
https://issues.apache.org/jira/browse/NUTCH-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-2155 started by Michael Joyce.
> Create a "crawl completeness" utility
> -
Michael Joyce created NUTCH-2166:
Summary: Add reverse URL format to dump tool
Key: NUTCH-2166
URL: https://issues.apache.org/jira/browse/NUTCH-2166
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-2166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-2166 started by Michael Joyce.
> Add reverse URL format to dump tool
> ---
>
>
Michael Joyce created NUTCH-2165:
Summary: FileDumper Util hard codes part-# folder name
Key: NUTCH-2165
URL: https://issues.apache.org/jira/browse/NUTCH-2165
Project: Nutch
Issue Type: Bug
Hi folks,
It seems like our usual workflow is to update CHANGES on commit (correct me
if I'm wrong here). What do we think about pulling the CHANGES updates from
JIRA as part of our release prep instead? Seems like it would be a bit less
error prone, although I do understand peoples' desires to
Michael Joyce created NUTCH-2167:
Summary: Backport TableUtil from 2.x for URL reversing
Key: NUTCH-2167
URL: https://issues.apache.org/jira/browse/NUTCH-2167
Project: Nutch
Issue Type:
Mike I honestly prefer just having it as a text file. If you search
way back in the logs Doug talked about this long ago, but I generally
agree. JIRA would be nice but I just like to keep it up to date in text
and in JIRA.
Sorry for the dupe work but it pays off.
[
https://issues.apache.org/jira/browse/NUTCH-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000912#comment-15000912
]
Lewis John McGibbney commented on NUTCH-2167:
-
Yes, an example of this being useful is within
[
https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-2165 started by Michael Joyce.
> FileDumper Util hard codes part-# folder name
>
[
https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Joyce reassigned NUTCH-2165:
Assignee: Michael Joyce
> FileDumper Util hard codes part-# folder name
>
[
https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000910#comment-15000910
]
Michael Joyce commented on NUTCH-2165:
--
Oh aye
> FileDumper Util hard codes part-# folder name
>
[
https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Joyce updated NUTCH-2165:
-
Attachment: NUTCH-2165_joyce_11Nov2015.patch
Patch attached
> FileDumper Util hard codes part-#
[
https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000923#comment-15000923
]
Michael Joyce commented on NUTCH-2165:
--
Note, the diff looks massive here. This is really just adding
[
https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-2120:
Issue Type: Task (was: Bug)
> Remove MapWritable from trunk codebase
>
[
https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-2120:
Flags: Patch
Patch Info: Patch Available
> Remove MapWritable from trunk
[
https://issues.apache.org/jira/browse/NUTCH-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15001105#comment-15001105
]
Lewis John McGibbney commented on NUTCH-2160:
-
Will commit by EoB today unless there are
[
https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-2120:
Attachment: NUTCH-2120.patch
Patch which removes this class from Trunk.
> Remove
28 matches
Mail list logo