[jira] [Commented] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-06-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13045524#comment-13045524 ] Lewis John McGibbney commented on NUTCH-623: Having checked branch-1.3 in

[jira] [Commented] (NUTCH-802) Problems managing outlinks with large url length

2011-06-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13052293#comment-13052293 ] Lewis John McGibbney commented on NUTCH-802: From recent user list

[jira] [Commented] (NUTCH-1000) Add option not to commit to Solr

2011-06-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054132#comment-13054132 ] Lewis John McGibbney commented on NUTCH-1000: - Hi Markus, I'm not on a work

[jira] [Updated] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-06-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1019: Summary: Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

[jira] [Created] (NUTCH-1019) Edit comment in org.apache.nutc.crawl.Crawl to reflect removal of legacy

2011-06-27 Thread Lewis John McGibbney (JIRA)
Edit comment in org.apache.nutc.crawl.Crawl to reflect removal of legacy Key: NUTCH-1019 URL: https://issues.apache.org/jira/browse/NUTCH-1019 Project: Nutch Issue

[jira] [Created] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-06-28 Thread Lewis John McGibbney (JIRA)
Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter - Key: NUTCH-1020 URL: https://issues.apache.org/jira/browse/NUTCH-1020 Project: Nutch Issue

[jira] [Commented] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-06-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056344#comment-13056344 ] Lewis John McGibbney commented on NUTCH-1020: - I tagged this as linkdb (which

[jira] [Commented] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-06-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056715#comment-13056715 ] Lewis John McGibbney commented on NUTCH-1019: - Yes I will do when I get home

[jira] [Created] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader

2011-06-28 Thread Lewis John McGibbney (JIRA)
Trivial error in error message for org.apache.nutch.crawl.LinkDbReader -- Key: NUTCH-1023 URL: https://issues.apache.org/jira/browse/NUTCH-1023 Project: Nutch Issue Type:

[jira] [Commented] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader

2011-06-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13056750#comment-13056750 ] Lewis John McGibbney commented on NUTCH-1023: - I will submitt a patch in a

[jira] [Commented] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-06-30 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13058075#comment-13058075 ] Lewis John McGibbney commented on NUTCH-1020: - I think you are correct Markus,

[jira] [Commented] (NUTCH-628) Host database to keep track of host-level information

2011-07-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13058993#comment-13058993 ] Lewis John McGibbney commented on NUTCH-628: From previous discussion on this

[jira] [Commented] (NUTCH-1043) Add pattern for filtering .js in default url filters

2011-07-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13064045#comment-13064045 ] Lewis John McGibbney commented on NUTCH-1043: - I think some discussion on this

[jira] [Commented] (NUTCH-1054) Make linkDB optional during indexing

2011-07-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066143#comment-13066143 ] Lewis John McGibbney commented on NUTCH-1054: - Just catching up on this one,

[jira] [Commented] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html

2011-07-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066173#comment-13066173 ] Lewis John McGibbney commented on NUTCH-1048: - This affects more than one

[jira] [Resolved] (NUTCH-916) Project Naming And Descriptions

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-916. Resolution: Fixed Assignee: Lewis John McGibbney Fixed as per ASF

[jira] [Closed] (NUTCH-915) project website basics

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-915. -- project website basics -- Key: NUTCH-915

[jira] [Closed] (NUTCH-917) Website Navigation Links

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-917. -- Website Navigation Links Key: NUTCH-917

[jira] [Assigned] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1019: --- Assignee: Lewis John McGibbney Edit comment in org.apache.nutch.crawl.Crawl

[jira] [Updated] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1019: Attachment: crawl-comment.patch Patch to address the trivial task of

[jira] [Assigned] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-672: -- Assignee: Lewis John McGibbney allow unit tests to be run from bin/nutch

[jira] [Commented] (NUTCH-657) Estonian N-gram profile has wrong name

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066426#comment-13066426 ] Lewis John McGibbney commented on NUTCH-657: I have been unsuccessful in

[jira] [Created] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-16 Thread Lewis John McGibbney (JIRA)
upgrade package.html file in language identifier plugin --- Key: NUTCH-1055 URL: https://issues.apache.org/jira/browse/NUTCH-1055 Project: Nutch Issue Type: Improvement

[jira] [Updated] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1055: Attachment: NUTCH-1055-package-html.patch patch attached to update relative URL

[jira] [Updated] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1055: Attachment: europarl.ps the attached document is referred to in package.html and

[jira] [Commented] (NUTCH-657) Estonian N-gram profile has wrong name

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066467#comment-13066467 ] Lewis John McGibbney commented on NUTCH-657: I opened a separate issue for the

[jira] [Created] (NUTCH-1056) Write a new plugin example for inclusion on the wiki

2011-07-16 Thread Lewis John McGibbney (JIRA)
Write a new plugin example for inclusion on the wiki Key: NUTCH-1056 URL: https://issues.apache.org/jira/browse/NUTCH-1056 Project: Nutch Issue Type: Task Components:

[jira] [Closed] (NUTCH-16) boost documents matching a url pattern

2011-07-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-16?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-16. - Resolution: Won't Fix Assignee: Lewis John McGibbney (was: Dennis Kubes) Agreed

[jira] [Commented] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy

2011-07-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066731#comment-13066731 ] Lewis John McGibbney commented on NUTCH-1019: - Committed at revision 1147712.

[jira] [Updated] (NUTCH-1059) Remove convdb command from /bin/nutch

2011-07-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1059: Attachment: NUTCH-1059-remove-convdb.patch The patch simply removes both the

[jira] [Closed] (NUTCH-1059) Remove convdb command from /bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1059. --- Committed and closed at revision 1147813 Remove convdb command from /bin/nutch

[jira] [Resolved] (NUTCH-1059) Remove convdb command from /bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1059. - Resolution: Fixed Fix Version/s: (was: 2.0) Remove convdb command

[jira] [Resolved] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1055. - Resolution: Fixed upgrade package.html file in language identifier plugin

[jira] [Updated] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-672: --- Priority: Minor (was: Trivial) allow unit tests to be run from bin/nutch

[jira] [Closed] (NUTCH-1055) upgrade package.html file in language identifier plugin

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1055. --- Committed in revision 1147817 (trunk) Committed in revision 1147818 (branch-1.4)

[jira] [Commented] (NUTCH-1049) Add classes to bin/nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066912#comment-13066912 ] Lewis John McGibbney commented on NUTCH-1049: - I would be happy to add

[jira] [Resolved] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1020. - Resolution: Fixed Fix Version/s: (was: 2.0) Assignee: Lewis

[jira] [Closed] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1020. --- Fixed and committed as NUTCH-1059 Remove convdb command from /bin/nutch (lewismc)

[jira] [Commented] (NUTCH-881) Good quality documentation for Nutch

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066919#comment-13066919 ] Lewis John McGibbney commented on NUTCH-881: What is the current state of this

[jira] [Commented] (NUTCH-865) Format source code in unique style

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066927#comment-13066927 ] Lewis John McGibbney commented on NUTCH-865: My feelings are that this could

[jira] [Commented] (NUTCH-910) Cached.jsp has a bug with encoding

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066928#comment-13066928 ] Lewis John McGibbney commented on NUTCH-910: Mmmm... can we mark this as won't

[jira] [Commented] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067139#comment-13067139 ] Lewis John McGibbney commented on NUTCH-1048: - Committed at revision 1147969.

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067217#comment-13067217 ] Lewis John McGibbney commented on NUTCH-920: A new file should be created as

[jira] [Assigned] (NUTCH-920) Project Metadata

2011-07-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-920: -- Assignee: Lewis John McGibbney Project Metadata

[jira] [Commented] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html

2011-07-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067883#comment-13067883 ] Lewis John McGibbney commented on NUTCH-1048: - Thanks for this Julien

[jira] [Commented] (NUTCH-865) Format source code in unique style

2011-07-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067893#comment-13067893 ] Lewis John McGibbney commented on NUTCH-865: I'm happy to have a crack at

[jira] [Commented] (NUTCH-865) Format source code in unique style

2011-07-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067950#comment-13067950 ] Lewis John McGibbney commented on NUTCH-865: agreed :0) Format source code in

[jira] [Updated] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-920: --- Attachment: doap_Apache_Nutch.rdf DOAP attachment. It does not contain any of the

[jira] [Commented] (NUTCH-919) Logos and Graphics

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068995#comment-13068995 ] Lewis John McGibbney commented on NUTCH-919: So it looks like a new image

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069056#comment-13069056 ] Lewis John McGibbney commented on NUTCH-920: Committed @ revision 1149263.

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069156#comment-13069156 ] Lewis John McGibbney commented on NUTCH-920: yes Julien I'll get it committed

[jira] [Updated] (NUTCH-920) Project Metadata

2011-07-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-920: --- Attachment: doap_Nutch_trunk.rdf DOAP file for Nutch 2.0 (trunk). Release date has

[jira] [Commented] (NUTCH-919) Logos and Graphics

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069453#comment-13069453 ] Lewis John McGibbney commented on NUTCH-919: sorted and committed @ revision

[jira] [Commented] (NUTCH-920) Project Metadata

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069462#comment-13069462 ] Lewis John McGibbney commented on NUTCH-920: If deemed suitable I could commit

[jira] [Created] (NUTCH-1065) New mvn.template

2011-07-22 Thread Lewis John McGibbney (JIRA)
New mvn.template Key: NUTCH-1065 URL: https://issues.apache.org/jira/browse/NUTCH-1065 Project: Nutch Issue Type: Task Components: build Affects Versions: 1.4, 2.0 Reporter: Lewis John McGibbney

[jira] [Created] (NUTCH-1066) trivial correction of

2011-07-22 Thread Lewis John McGibbney (JIRA)
trivial correction of -- Key: NUTCH-1066 URL: https://issues.apache.org/jira/browse/NUTCH-1066 Project: Nutch Issue Type: Task Components: documentation Affects Versions: 1.4, 2.0 Reporter:

[jira] [Updated] (NUTCH-1066) trivial correction of domain-urlfilter.txt

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1066: Summary: trivial correction of domain-urlfilter.txt (was: trivial correction of )

[jira] [Updated] (NUTCH-1066) trivial correction of

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1066: Attachment: NUTCH-1066-domain-urlfilter-trivial-branch.patch

[jira] [Resolved] (NUTCH-1066) trivial correction of domain-urlfilter.txt

2011-07-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1066. - Resolution: Fixed trivial correction of domain-urlfilter.txt

[jira] [Commented] (NUTCH-914) Implement Apache Project Branding Requirements

2011-07-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072417#comment-13072417 ] Lewis John McGibbney commented on NUTCH-914: How are we doing with this. As far

[jira] [Commented] (NUTCH-917) Website Navigation Links

2011-08-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13076174#comment-13076174 ] Lewis John McGibbney commented on NUTCH-917: Committed @ revision 1153108. I

[jira] [Commented] (NUTCH-208) http: proxy exception list:

2011-08-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13076212#comment-13076212 ] Lewis John McGibbney commented on NUTCH-208: This is an interesting (if

[jira] [Commented] (NUTCH-1049) Add classes to bin/nutch

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079308#comment-13079308 ] Lewis John McGibbney commented on NUTCH-1049: - I'm glad to see that there have

[jira] [Closed] (NUTCH-1065) New mvn.template

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1065. --- New mvn.template Key: NUTCH-1065

[jira] [Assigned] (NUTCH-1056) Write a new plugin example for inclusion on the wiki

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1056: --- Assignee: Lewis John McGibbney Write a new plugin example for inclusion on

[jira] [Commented] (NUTCH-431) Move plugin specific properties out of nutch-site.xml and into specific conf files for plugins

2011-08-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079318#comment-13079318 ] Lewis John McGibbney commented on NUTCH-431: Can this issue be closed and

[jira] [Commented] (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool

2011-08-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080443#comment-13080443 ] Lewis John McGibbney commented on NUTCH-666: Chris excuse my naivety but I am

[jira] [Updated] (NUTCH-1035) Tune Solr config for Nutch users

2011-08-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1035: Attachment: solrconfig.xml Attached solrconfig.xml to get the ball rolling on this

[jira] [Commented] (NUTCH-717) Make Nutch Solr integration easier

2011-08-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080461#comment-13080461 ] Lewis John McGibbney commented on NUTCH-717: Are we to provide any support for

[jira] [Commented] (NUTCH-713) Config options for webgraph Scoring not documented

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080571#comment-13080571 ] Lewis John McGibbney commented on NUTCH-713: Is it deemed necessary to add

[jira] [Commented] (NUTCH-342) Nutch commands log to nutch/logs/hadoop.logs by default

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080572#comment-13080572 ] Lewis John McGibbney commented on NUTCH-342: What is the current status with

[jira] [Assigned] (NUTCH-208) http: proxy exception list:

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-208: -- Assignee: Lewis John McGibbney http: proxy exception list:

[jira] [Updated] (NUTCH-208) http: proxy exception list:

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-208: --- Attachment: NUTCH-208-branch-1.4-20110807.patch Attached patch to be tested on branch

[jira] [Updated] (NUTCH-208) http: proxy exception list:

2011-08-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-208: --- Priority: Trivial (was: Minor) Patch Info: [Patch Available]

[jira] [Assigned] (NUTCH-881) Good quality documentation for Nutch

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-881: -- Assignee: Lewis John McGibbney Good quality documentation for Nutch

[jira] [Commented] (NUTCH-881) Good quality documentation for Nutch

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081639#comment-13081639 ] Lewis John McGibbney commented on NUTCH-881: In Nutch trunk we currently only

[jira] [Assigned] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-623: -- Assignee: Lewis John McGibbney Change plugin source directory

[jira] [Commented] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081642#comment-13081642 ] Lewis John McGibbney commented on NUTCH-623: On second thoughts, and taking

[jira] [Commented] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081677#comment-13081677 ] Lewis John McGibbney commented on NUTCH-623: If we wished to fix this, then it

[jira] [Commented] (NUTCH-463) Nutch powerpoint parser plugin fails to parse ppt with images

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081695#comment-13081695 ] Lewis John McGibbney commented on NUTCH-463: Can we close this issue? .ppt

[jira] [Commented] (NUTCH-978) [GSoC 2011] A Plugin for extracting certain element of a web page on html page parsing.

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081703#comment-13081703 ] Lewis John McGibbney commented on NUTCH-978: If there has been a plugin written

[jira] [Commented] (NUTCH-342) Nutch commands log to nutch/logs/hadoop.logs by default

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081708#comment-13081708 ] Lewis John McGibbney commented on NUTCH-342: OK well I think that sets a

[jira] [Commented] (NUTCH-296) Image Search

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081714#comment-13081714 ] Lewis John McGibbney commented on NUTCH-296: The parsing and extraction of

[jira] [Commented] (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081731#comment-13081731 ] Lewis John McGibbney commented on NUTCH-849: I checked out the latest trunk 2.0

[jira] [Commented] (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081757#comment-13081757 ] Lewis John McGibbney commented on NUTCH-666: Thank you Dennis for confirming.

[jira] [Closed] (NUTCH-296) Image Search

2011-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-296. -- Resolution: Won't Fix Assignee: Lewis John McGibbney As there has been no

[jira] [Updated] (NUTCH-208) http: proxy exception list:

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-208: --- Attachment: NUTCH-208-trunk-2.0-20110810.patch Patch attached for trunk 2.0. I am

[jira] [Commented] (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082292#comment-13082292 ] Lewis John McGibbney commented on NUTCH-258: When I was viewing

[jira] [Updated] (NUTCH-208) http: proxy exception list:

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-208: --- Attachment: NUTCH-208-trunk-2.0-20110810-v2.patch new patch for trunk 2.0.

[jira] [Created] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-08-10 Thread Lewis John McGibbney (JIRA)
Upgrade all instances of commons logging to slf4j (with log4j backend) -- Key: NUTCH-1078 URL: https://issues.apache.org/jira/browse/NUTCH-1078 Project: Nutch Issue Type:

[jira] [Commented] (NUTCH-296) Image Search

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082388#comment-13082388 ] Lewis John McGibbney commented on NUTCH-296: Hi Simão, any chance we could

[jira] [Updated] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-623: --- Attachment: NUTCH-623-branch-1.4-20110810.patch This patch for branch-1.4 simply

[jira] [Updated] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-623: --- Attachment: NUTCH-623-branch-1.4-20110810.patch patch for trunk. Both of the above

[jira] [Reopened] (NUTCH-296) Image Search

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-296: This issue is back open... The code developed was for integration on nutchwax. The

[jira] [Commented] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082558#comment-13082558 ] Lewis John McGibbney commented on NUTCH-672: OK having tried to get this

[jira] [Commented] (NUTCH-1075) Delegate language identification to Tika

2011-08-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082595#comment-13082595 ] Lewis John McGibbney commented on NUTCH-1075: - Hi Julien, Would it be

[jira] [Assigned] (NUTCH-914) Implement Apache Project Branding Requirements

2011-08-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-914: -- Assignee: Lewis John McGibbney Implement Apache Project Branding Requirements

[jira] [Closed] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-08-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-623. -- Change plugin source directory languageidentifier to language-identifier

[jira] [Updated] (NUTCH-987) Support HTTP auth for Solr communication

2011-08-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-987: --- Comment: was deleted (was: Hi Markus, the patch for 2.0 does not apply cleanly for me

[jira] [Commented] (NUTCH-1004) Do not index empty values for title field

2011-08-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13085055#comment-13085055 ] Lewis John McGibbney commented on NUTCH-1004: - no objections from me Markus.

  1   2   3   4   5   6   7   8   9   10   >