[
https://issues.apache.org/jira/browse/NUTCH-2220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157831#comment-15157831
]
Sebastian Nagel commented on NUTCH-2220:
0 / +1
Since this breaks existing crawl c
[
https://issues.apache.org/jira/browse/NUTCH-2221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157816#comment-15157816
]
Sebastian Nagel commented on NUTCH-2221:
+1
Just to consider: the additional argum
[
https://issues.apache.org/jira/browse/NUTCH-2216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1515#comment-1515
]
Sebastian Nagel commented on NUTCH-2216:
* this was the case before, but shouldn't
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2228:
---
Attachment: NUTCH-2228.patch
> index-replace unit test fails
> -
>
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2228:
---
Patch Info: Patch Available
> index-replace unit test fails
> -
>
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157655#comment-15157655
]
Sebastian Nagel edited comment on NUTCH-2228 at 2/22/16 8:38 PM:
---
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157655#comment-15157655
]
Sebastian Nagel commented on NUTCH-2228:
The name of the failing test "testInvalid
[
https://issues.apache.org/jira/browse/NUTCH-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157632#comment-15157632
]
Sebastian Nagel commented on NUTCH-2228:
That's only a problem if Nutch is built w
Markus Jelsma created NUTCH-2228:
Summary: index-replace unit test fails
Key: NUTCH-2228
URL: https://issues.apache.org/jira/browse/NUTCH-2228
Project: Nutch
Issue Type: Bug
Compone
[
https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-2227 stopped by Markus Jelsma.
> RegexParseFilter
>
>
> Key: NUTCH-2227
>
[
https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2227:
-
Attachment: NUTCH-2227.patch
Updated patch, added negative test. Which works. Will commit sometime
[
https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2227:
-
Attachment: NUTCH-2227.patch
Updated patch, build.xml was missing
> RegexParseFilter
> --
[
https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2227:
-
Attachment: NUTCH-2227.patch
Patch for trunk! Tests pass.
> RegexParseFilter
>
>
[
https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-2227 started by Markus Jelsma.
> RegexParseFilter
>
>
> Key: NUTCH-2227
>
[
https://issues.apache.org/jira/browse/NUTCH-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2227:
-
Description:
A parse filter that takes a regex and a field name. If regex matches via
matcher.fin
Markus Jelsma created NUTCH-2227:
Summary: RegexParseFilter
Key: NUTCH-2227
URL: https://issues.apache.org/jira/browse/NUTCH-2227
Project: Nutch
Issue Type: New Feature
Components:
[
https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157091#comment-15157091
]
Hudson commented on NUTCH-2219:
---
SUCCESS: Integrated in Nutch-trunk #3350 (See
[https://bui
[
https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2219:
-
Fix Version/s: 1.12
> Criteria order to be configurable in DeduplicationJob
>
[
https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2219:
-
Affects Version/s: 1.11
> Criteria order to be configurable in DeduplicationJob
>
[
https://issues.apache.org/jira/browse/NUTCH-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma resolved NUTCH-2219.
--
Resolution: Fixed
Committed to trunk in revision 1731651. Thanks Ron van der Vegt
> Criteria or
[
https://issues.apache.org/jira/browse/NUTCH-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157027#comment-15157027
]
Markus Jelsma commented on NUTCH-2226:
--
Hello - how is this related? Are you using tr
[
https://issues.apache.org/jira/browse/NUTCH-2220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15156711#comment-15156711
]
Markus Jelsma commented on NUTCH-2220:
--
Any comments to this change, e.g. separate db
Can someone please put up a small howto somewhere? I need to know how to:
* check out trunk
* check out a specific tag
* do a svn up
* create a patch, e.g. svn diff
* perform a commit
Thanks,
Markus
-Original message-
> From:Mattmann, Chris A (3980)
> Sent: Sunday 21st February 2016 1
23 matches
Mail list logo