IntelliJ & Eclipse Lucene code styles available

2007-05-22 Thread Otis Gospodnetic
Those using IntelliJ or Eclipse may want to grab code styles for Lucene (and Solr, Nutch, and Hadoop) that Grant and I put in https://issues.apache.org/jira/browse/SOLR-245 . I hope they are helpful. The plan is to stick them on the Wiki (and link from HowToContribute pages?). Otis . . .

[jira] Commented: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters

2007-05-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498113 ] Doğacan Güney commented on NUTCH-489: - Hmm.. Won't it now cause Nutch to filter on path on a line like this: -(jpg

[jira] Updated: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters

2007-05-22 Thread Emmanuel Joke (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emmanuel Joke updated NUTCH-489: Attachment: SuffixURLFilter_v2.java.patch My mistake... I've added a new patchwhich is supposed to:

[jira] Commented: (NUTCH-25) needs 'character encoding' detector

2007-05-22 Thread Doug Cook (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-25?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498041 ] Doug Cook commented on NUTCH-25: Thanks! I'll take a look at your proposed patch... (that was fast! ask and ye shall r

[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implme

2007-05-22 Thread Vadim Bauer (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12497851 ] Vadim Bauer commented on NUTCH-427: --- There is an Error in the plugin.xml File the plugin id should be protocol-smb

[jira] Updated: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch)

2007-05-22 Thread Marcin Okraszewski (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcin Okraszewski updated NUTCH-490: - Attachment: nutch-extensionpoins_plugin.xml.diff Patch for plugin.xml in nutch-extensionpo

[jira] Updated: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch)

2007-05-22 Thread Marcin Okraszewski (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcin Okraszewski updated NUTCH-490: - Attachment: HtmlParser.java.diff Patch for HtmlParser. > Extension point with filters for

[jira] Created: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch)

2007-05-22 Thread Marcin Okraszewski (JIRA)
Extension point with filters for Neko HTML parser (with patch) -- Key: NUTCH-490 URL: https://issues.apache.org/jira/browse/NUTCH-490 Project: Nutch Issue Type: Improvement

[jira] Commented: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters

2007-05-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12497770 ] Doğacan Güney commented on NUTCH-489: - This is obviously useful but: * Your patches both in this issue and in NUT

[jira] Updated: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters

2007-05-22 Thread Emmanuel Joke (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emmanuel Joke updated NUTCH-489: Attachment: SuffixURLFilter.java.patch suffix-urlfilter.txt.patch > URLFilter-suffix

[jira] Created: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters

2007-05-22 Thread Emmanuel Joke (JIRA)
URLFilter-suffix management of the url path when the url contains some query parameters --- Key: NUTCH-489 URL: https://issues.apache.org/jira/browse/NUTCH-489 Projec

[jira] Updated: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-05-22 Thread Emmanuel Joke (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emmanuel Joke updated NUTCH-488: Attachment: nutch-default.xml.patch > Avoid parsing uneccessary links and get a more relevant outlin

[jira] Updated: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-05-22 Thread Emmanuel Joke (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emmanuel Joke updated NUTCH-488: Attachment: DOMContentUtils.patch > Avoid parsing uneccessary links and get a more relevant outlink

[jira] Created: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-05-22 Thread Emmanuel Joke (JIRA)
Avoid parsing uneccessary links and get a more relevant outlink list Key: NUTCH-488 URL: https://issues.apache.org/jira/browse/NUTCH-488 Project: Nutch Issue Type: Improvem