[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1113:
---
Attachment: (was: NUTCH-1113-trunk-junit-fail.patch)
> Merging segments causes URLs to va
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1113:
---
Attachment: NUTCH-1113-trunk-junit-fail.patch
Fixed also second problem in junit test: segmen
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1113:
---
Attachment: NUTCH-1113-trunk-junit-fail.patch
> Merging segments causes URLs to vanish from c
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Fix Version/s: (was: 1.9)
1.8
> Merging segments causes URLs to vanish fro
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-trunk-junit-final.patch
Final patch including the stuff mentioned by Sebas
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-trunk.patch
Includes STATUS_FETCH_NOTMODIFIED in the check. But are you su
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-junit.patch
Attached patch seems to completely fix the issue, finally!
* d
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1113:
---
Attachment: NUTCH-1113-junit.patch
* extended Junit test to fail if both linked and fetch dat
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-junit.patch
Slightly updated patch. I have no merged and indexed a large n
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-junit.patch
New patch! Previous patch had an error in the checks. With thi
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-junit.patch
New patch that actually works for Apache Nutch current trunk.
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Priority: Blocker (was: Major)
> Merging segments causes URLs to vanish from crawldb/index?
> --
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-junit.patch
Alright, manual testing did not go very well and it takes hour
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Attachment: NUTCH-1113-trunk.patch
Patch for trunk with Edward's fix. That fix at least solves a
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Fix Version/s: (was: 1.5)
1.6
20120304-push-1.6
> Merging
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Fix Version/s: (was: 1.4)
1.5
> Merging segments causes URLs to vanish
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1113:
-
Fix Version/s: 1.4
Thanks! It's marked for 1.4 now so it, at least, doesn't slip of the radar. Ca
[
https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Edward Drapkin updated NUTCH-1113:
--
Attachment: merged_segment_output.txt
unmerged_segment_output.txt
Output for se
18 matches
Mail list logo