[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017405#comment-13017405
]
Markus Jelsma commented on NUTCH-963:
-
Yes!
> Add support for deleting Solr documents
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017403#comment-13017403
]
Julien Nioche commented on NUTCH-963:
-
Shall we create a new issue to track the progres
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008469#comment-13008469
]
Markus Jelsma commented on NUTCH-963:
-
Committed for branch-1.3 in rev 1082944.
- new c
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008421#comment-13008421
]
Markus Jelsma commented on NUTCH-963:
-
Solr deduplication makes its own (fuzzy) hashes
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008410#comment-13008410
]
Julien Nioche commented on NUTCH-963:
-
Re-dedup on SOLR side : good point, although the
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008402#comment-13008402
]
Markus Jelsma commented on NUTCH-963:
-
Julien, shouldn't the deduplicate mechanism kept
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12988722#comment-12988722
]
Julien Nioche commented on NUTCH-963:
-
{quote}
@Julien: you mean to use the signature o
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987652#action_12987652
]
Claudio Martella commented on NUTCH-963:
there's a little problem in where you put t
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987617#action_12987617
]
Claudio Martella commented on NUTCH-963:
@Markus: about the commit, i did also consi
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987574#action_12987574
]
Julien Nioche commented on NUTCH-963:
-
It would be nice to couple that with the deduplic
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987559#action_12987559
]
Markus Jelsma commented on NUTCH-963:
-
The class works fine although i did add a commit
[
https://issues.apache.org/jira/browse/NUTCH-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987132#action_12987132
]
Markus Jelsma commented on NUTCH-963:
-
Thanks Claudio. I'll fix the formatting and add a
12 matches
Mail list logo