[jira] [Updated] (NUTCH-1414) Date extraction parse filter

2014-04-10 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1414: - Patch Info: Patch Available Fix Version/s: (was: 1.9) Assignee: (was:

[jira] [Updated] (NUTCH-1422) reset signature for redirects

2014-04-10 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1422: - Priority: Critical (was: Major) reset signature for redirects -

[jira] [Updated] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents

2014-04-10 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1732: - Priority: Critical (was: Major) IndexerMapReduce to delete explicitly not indexable documents

[jira] [Updated] (NUTCH-1694) Consider removing URL filter attribute warnings.

2014-04-10 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1694: - Priority: Minor (was: Major) Consider removing URL filter attribute warnings.

[jira] [Updated] (NUTCH-1748) urlfilter-validator to allow .. (two dots) inside file names (path elements)

2014-04-10 Thread Sertac TURKEL (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sertac TURKEL updated NUTCH-1748: - Attachment: NUTCH-1748.patch Hi [~wastl-nagel], I prepared a patch file. Could you review my

[jira] [Updated] (NUTCH-1748) urlfilter-validator to allow .. (two dots) inside file names (path elements)

2014-04-10 Thread Sertac TURKEL (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sertac TURKEL updated NUTCH-1748: - Attachment: (was: NUTCH-1748.patch) urlfilter-validator to allow .. (two dots) inside file

[jira] [Updated] (NUTCH-1748) urlfilter-validator to allow .. (two dots) inside file names (path elements)

2014-04-10 Thread Sertac TURKEL (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sertac TURKEL updated NUTCH-1748: - Attachment: NUTCH-1748.patch urlfilter-validator to allow .. (two dots) inside file names (path

Re: Pushing content to Solr from Nutch

2014-04-10 Thread Julien Nioche
Hi Xavier Your config file looks a bit outdated. Here are the values set by default (see http://svn.apache.org/repos/asf/nutch/trunk/conf/nutch-default.xml) property nameplugin.includes/name

Re: Pushing content to Solr from Nutch

2014-04-10 Thread Xavier Morera
Wait, ignore my last email. The issue is on the solr side! On Thu, Apr 10, 2014 at 1:13 PM, Xavier Morera xav...@familiamorera.comwrote: Thanks Julien and Sebastian. Tried that and got the exception below. Is there a way of knowing more in detail what is the exception so that I can continue