[jira] [Updated] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2120: - Fix Version/s: (was: 1.11) 1.12 > Remove MapWritable from trunk cod

[jira] [Updated] (NUTCH-2139) Basic plugin to index inlinks and outlinks

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2139: - Fix Version/s: (was: 1.11) 1.12 > Basic plugin to index inlinks and

[jira] [Updated] (NUTCH-2122) Implement Javadoc package.html for service packages

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2122: - Fix Version/s: (was: 1.11) 1.12 > Implement Javadoc package.html fo

[jira] [Updated] (NUTCH-2135) Ant Eclipse build does not include protocol-interactiveselenium

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2135: - Fix Version/s: (was: 1.11) 1.12 > Ant Eclipse build does not includ

[jira] [Updated] (NUTCH-2128) Refactor configuration end point

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2128: - Fix Version/s: (was: 1.11) 1.12 > Refactor configuration end point

[jira] [Updated] (NUTCH-1943) Form authentication should not be global and ignore

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-1943: - Fix Version/s: (was: 1.11) 1.12 > Form authentication should not be

[jira] [Updated] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2064: - Fix Version/s: (was: 1.11) 1.12 > URLNormalizer basic to properly e

[jira] [Resolved] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2141. -- Resolution: Fixed Fix Version/s: 1.11 Thanks [~BalaJira] [~jo...@apache.org] plen

[jira] [Work started] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2141 started by Chris A. Mattmann. > Change the InteractiveSelenium plugin handler Interface to return page

[jira] [Resolved] (NUTCH-2129) Track Protocol Status in Crawl Datum

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2129. -- Resolution: Fixed Thanks [~jo...@apache.org]! {noformat} [chipotle:~/tmp/nutch1.11] mat

[jira] [Assigned] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2141: Assignee: Chris A. Mattmann > Change the InteractiveSelenium plugin handler Interfa

[jira] [Work started] (NUTCH-2129) Track Protocol Status in Crawl Datum

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2129 started by Chris A. Mattmann. > Track Protocol Status in Crawl Datum >

[jira] [Resolved] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2142. -- Resolution: Fixed Thanks [~karanjeets]! {noformat} [chipotle:~/tmp/nutch1.11] mattmann%

[jira] [Assigned] (NUTCH-2129) Track Protocol Status in Crawl Datum

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2129: Assignee: Chris A. Mattmann > Track Protocol Status in Crawl Datum > --

[jira] [Work started] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2142 started by Chris A. Mattmann. > Nutch File Dump - FileNotFoundException (Invalid Argument) Error >

[jira] [Assigned] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error

2015-10-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2142: Assignee: Chris A. Mattmann > Nutch File Dump - FileNotFoundException (Invalid Argu

[jira] [Commented] (NUTCH-2136) Implement a different version of Naive Bayes Parse Filter

2015-10-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14953367#comment-14953367 ] Chris A. Mattmann commented on NUTCH-2136: -- [~asitang]: 1. ALv2 headers missing

[jira] [Commented] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium"

2015-10-09 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14951409#comment-14951409 ] Chris A. Mattmann commented on NUTCH-2110: -- Great so can you link this to those i

[jira] [Commented] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium"

2015-10-09 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14951384#comment-14951384 ] Chris A. Mattmann commented on NUTCH-2110: -- Asitang where are we on this? > Crea

[jira] [Commented] (NUTCH-2108) Add a function to the selenium interactive plugin interface to do multiple manipulation of driver and then return the data

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14944426#comment-14944426 ] Chris A. Mattmann commented on NUTCH-2108: -- see my comments on Github, please +1

[jira] [Resolved] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2123. -- Resolution: Fixed - fixed thanks guys! {noformat} [chipotle:~/tmp/nutch1.11] mattmann% s

[jira] [Assigned] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2123: Assignee: Chris A. Mattmann > Seed List REST API returns Text but headers indicate/

[jira] [Work started] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2123 started by Chris A. Mattmann. > Seed List REST API returns Text but headers indicate/require JSON > ---

[jira] [Commented] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14943880#comment-14943880 ] Chris A. Mattmann commented on NUTCH-2132: -- Hey Julien, yeah to be honest we thou

[jira] [Commented] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14943786#comment-14943786 ] Chris A. Mattmann commented on NUTCH-2132: -- True Julien, but that locks us into u

[jira] [Commented] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14943658#comment-14943658 ] Chris A. Mattmann commented on NUTCH-2132: -- Great comments, Seb, agree. > Publis

[jira] [Commented] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14943625#comment-14943625 ] Chris A. Mattmann commented on NUTCH-2132: -- Right now here are a few comments: h

[jira] [Comment Edited] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events

2015-10-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14943625#comment-14943625 ] Chris A. Mattmann edited comment on NUTCH-2132 at 10/5/15 4:42 PM: -

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-26 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14909303#comment-14909303 ] Chris A. Mattmann commented on NUTCH-2086: -- I have reviewed both patches. They lo

[jira] [Resolved] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2104. -- Resolution: Fixed Looks great, thanks Kim! {noformat} [chipotle:~/tmp/nutch1.11] mattma

[jira] [Updated] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2104: - Component/s: protocol > Add documentation to the protocol-selenium plugin Readme file re:

[jira] [Assigned] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2104: Assignee: Chris A. Mattmann > Add documentation to the protocol-selenium plugin Rea

[jira] [Updated] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2104: - Component/s: documentation > Add documentation to the protocol-selenium plugin Readme file

[jira] [Updated] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2104: - Fix Version/s: 1.11 > Add documentation to the protocol-selenium plugin Readme file re: se

[jira] [Work started] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2104 started by Chris A. Mattmann. > Add documentation to the protocol-selenium plugin Readme file re: selen

[jira] [Updated] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2104: - Labels: memex (was: ) > Add documentation to the protocol-selenium plugin Readme file re:

[jira] [Resolved] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2094. -- Resolution: Fixed I committed this to 2.x branch but Github auto closing integration isn

[jira] [Reopened] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reopened NUTCH-2094: -- > Stopping and Restarting a crawl has issues in the Web UI > ---

[jira] [Updated] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2094: - Fix Version/s: 2.4 > Stopping and Restarting a crawl has issues in the Web UI > --

[jira] [Work started] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2094 started by Chris A. Mattmann. > Stopping and Restarting a crawl has issues in the Web UI >

[jira] [Updated] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2094: - Component/s: web gui > Stopping and Restarting a crawl has issues in the Web UI >

[jira] [Updated] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2094: - Summary: Stopping and Restarting a crawl has issues in the Web UI (was: When stopping a c

[jira] [Commented] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again.

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876885#comment-14876885 ] Chris A. Mattmann commented on NUTCH-2094: -- Lewis, doesn't look like it. prernasa

[jira] [Resolved] (NUTCH-2099) Refactoring the REST endpoints for integration with webui

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2099. -- Resolution: Fixed Thanks Sujen and Lewis! {noformat} [chipotle:~/tmp/nutch1.11] mattman

[jira] [Work started] (NUTCH-2099) Refactoring the REST endpoints for integration with webui

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2099 started by Chris A. Mattmann. > Refactoring the REST endpoints for integration with webui > ---

[jira] [Assigned] (NUTCH-2099) Refactoring the REST endpoints for integration with webui

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2099: Assignee: Chris A. Mattmann > Refactoring the REST endpoints for integration with w

[jira] [Updated] (NUTCH-2091) Increase robustness and crawling versatility of Nutch for the Deep Web

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2091: - Summary: Increase robustness and crawling versatility of Nutch for the Deep Web (was: Mak

[jira] [Commented] (NUTCH-2091) Make Nutch more robust and smart

2015-09-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14875894#comment-14875894 ] Chris A. Mattmann commented on NUTCH-2091: -- This is a fantastic summary of many o

[jira] [Commented] (NUTCH-2011) Endpoint to support realtime JSON output from the fetcher

2015-09-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14805010#comment-14805010 ] Chris A. Mattmann commented on NUTCH-2011: -- [~sujenshah] [~asitang] > Endpoint t

[jira] [Resolved] (NUTCH-2098) Add null SeedUrl constructor

2015-09-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2098. -- Resolution: Fixed Thanks [~ahmadia] fixed in trunk! {noformat} [mattmann-0420740:~/tmp/

[jira] [Assigned] (NUTCH-2098) Add null SeedUrl constructor

2015-09-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2098: Assignee: Chris A. Mattmann > Add null SeedUrl constructor > --

[jira] [Work started] (NUTCH-2098) Add null SeedUrl constructor

2015-09-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2098 started by Chris A. Mattmann. > Add null SeedUrl constructor > > >

[jira] [Commented] (NUTCH-2100) Nutch dump command doesnt dump anything

2015-09-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14746141#comment-14746141 ] Chris A. Mattmann commented on NUTCH-2100: -- Kim I think that the directory expect

[jira] [Assigned] (NUTCH-2100) Nutch dump command doesnt dump anything

2015-09-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2100: Assignee: Chris A. Mattmann > Nutch dump command doesnt dump anything > --

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-14 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743765#comment-14743765 ] Chris A. Mattmann commented on NUTCH-2086: -- yep: https://github.com/apache/nutch/

[jira] [Resolved] (NUTCH-2092) Unit Test for NutchServer

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2092. -- Resolution: Fixed thanks Sujen! {noformat} [chipotle:~/tmp/nutch1.11] mattmann% svn com

[jira] [Commented] (NUTCH-2092) Unit Test for NutchServer

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14742149#comment-14742149 ] Chris A. Mattmann commented on NUTCH-2092: -- All tests pass. {noformat} deploy:

[jira] [Commented] (NUTCH-2092) Unit Test for NutchServer

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14742151#comment-14742151 ] Chris A. Mattmann commented on NUTCH-2092: -- commiting > Unit Test for NutchServe

[jira] [Assigned] (NUTCH-2092) Unit Test for NutchServer

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2092: Assignee: Chris A. Mattmann > Unit Test for NutchServer > -

[jira] [Work started] (NUTCH-2092) Unit Test for NutchServer

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2092 started by Chris A. Mattmann. > Unit Test for NutchServer > - > >

[jira] [Resolved] (NUTCH-2090) Refactor Seed Resource in REST API

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2090. -- Resolution: Fixed fixed! > Refactor Seed Resource in REST API > ---

[jira] [Resolved] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2096. -- Resolution: Fixed Committed, thanks Kim! {noformat} [chipotle:~/tmp/nutch1.11] mattmann

[jira] [Commented] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14742131#comment-14742131 ] Chris A. Mattmann commented on NUTCH-2096: -- all tests pass, commiting now. {nofo

[jira] [Commented] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14742115#comment-14742115 ] Chris A. Mattmann commented on NUTCH-2096: -- PR is here: https://github.com/apache

[jira] [Updated] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2096: - Labels: memex (was: ) > Explicitly indicate broswer binary to use when selecting selenium

[jira] [Commented] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14742113#comment-14742113 ] Chris A. Mattmann commented on NUTCH-2096: -- I added you to the contributors group

[jira] [Assigned] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2096: Assignee: Chris A. Mattmann > Explicitly indicate broswer binary to use when select

[jira] [Work started] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config

2015-09-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2096 started by Chris A. Mattmann. > Explicitly indicate broswer binary to use when selecting selenium remot

[jira] [Commented] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again.

2015-09-11 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14741901#comment-14741901 ] Chris A. Mattmann commented on NUTCH-2094: -- no problem just switch to branch-2.3

[jira] [Commented] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again.

2015-09-11 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14741292#comment-14741292 ] Chris A. Mattmann commented on NUTCH-2094: -- Hi [~prernasatija] would you be willi

[jira] [Work started] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again.

2015-09-11 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2094 started by Chris A. Mattmann. > When stopping a crawl in Nutch 2.3, I was having trouble when I start a

[jira] [Reopened] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again.

2015-09-11 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reopened NUTCH-2094: -- Assignee: Chris A. Mattmann > When stopping a crawl in Nutch 2.3, I was having trouble

[jira] [Work started] (NUTCH-2090) Refactor Seed Resource in REST API

2015-09-06 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2090 started by Chris A. Mattmann. > Refactor Seed Resource in REST API > --

[jira] [Assigned] (NUTCH-2090) Refactor Seed Resource in REST API

2015-09-06 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2090: Assignee: Chris A. Mattmann > Refactor Seed Resource in REST API >

[jira] [Work stopped] (NUTCH-978) A Plugin for extracting certain element of a web page on html page parsing.

2015-08-30 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-978 stopped by Chris A. Mattmann. --- > A Plugin for extracting certain element of a web page on html page parsing

[jira] [Work started] (NUTCH-978) A Plugin for extracting certain element of a web page on html page parsing.

2015-08-30 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-978 started by Chris A. Mattmann. --- > A Plugin for extracting certain element of a web page on html page parsing

[jira] [Resolved] (NUTCH-2088) Add Optional Execution to Interactive Selenium Handlers

2015-08-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2088. -- Resolution: Fixed - thanks [~mjoyce]! {noformat} [guest-wireless-207-151-035-079:~/tmp/

[jira] [Work started] (NUTCH-2088) Add Optional Execution to Interactive Selenium Handlers

2015-08-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2088 started by Chris A. Mattmann. > Add Optional Execution to Interactive Selenium Handlers > -

[jira] [Updated] (NUTCH-2088) Add Optional Execution to Interactive Selenium Handlers

2015-08-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2088: - Labels: memex (was: ) > Add Optional Execution to Interactive Selenium Handlers > ---

[jira] [Assigned] (NUTCH-2088) Add Optional Execution to Interactive Selenium Handlers

2015-08-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2088: Assignee: Chris A. Mattmann > Add Optional Execution to Interactive Selenium Handle

[jira] [Updated] (NUTCH-2088) Add Optional Execution to Interactive Selenium Handlers

2015-08-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2088: - Fix Version/s: 1.11 > Add Optional Execution to Interactive Selenium Handlers > --

[jira] [Assigned] (NUTCH-2086) Nutch 1.X Webui

2015-08-26 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-2086: Assignee: Chris A. Mattmann > Nutch 1.X Webui > > >

[jira] [Work started] (NUTCH-2086) Nutch 1.X Webui

2015-08-26 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2086 started by Chris A. Mattmann. > Nutch 1.X Webui > > > Key: NUTCH-2086

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-21 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706884#comment-14706884 ] Chris A. Mattmann commented on NUTCH-2049: -- +1 to commit this. Great work team.

[jira] [Commented] (NUTCH-1936) GSoC 2015 - Move Nutch to Hadoop 2.X

2015-08-21 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706883#comment-14706883 ] Chris A. Mattmann commented on NUTCH-1936: -- +1 > GSoC 2015 - Move Nutch to Hadoo

[jira] [Commented] (NUTCH-2081) outseq and vectors directories pollute $NUTCH_HOME

2015-08-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702314#comment-14702314 ] Chris A. Mattmann commented on NUTCH-2081: -- I would suggest: model/bayes/

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701497#comment-14701497 ] Chris A. Mattmann commented on NUTCH-2049: -- Asitang, if you recall, we discussed

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701315#comment-14701315 ] Chris A. Mattmann commented on NUTCH-2049: -- Great, thanks Lewis. The introduction

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14700551#comment-14700551 ] Chris A. Mattmann commented on NUTCH-2049: -- Thanks Lewis. [~asitang] please creat

[jira] [Resolved] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins

2015-08-08 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2059. -- Resolution: Fixed OK [~pet...@knowledgesite.com] I went ahead and committed your fixes,

[jira] [Work started] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins

2015-08-08 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2059 started by Chris A. Mattmann. > protocol-httpclient, protocol-http unit test errors on Jenkins > --

[jira] [Commented] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins

2015-08-03 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14652673#comment-14652673 ] Chris A. Mattmann commented on NUTCH-2059: -- [~PeterCiuffetti] > protocol-httpcli

[jira] [Commented] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14651315#comment-14651315 ] Chris A. Mattmann commented on NUTCH-2059: -- we have a failed build - https://bui

[jira] [Resolved] (NUTCH-2066) Parameterize Generate REST endpoint

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2066. -- Resolution: Fixed Committed to trunk: {noformat} [chipotle:~/tmp/nutch-trunk] mattmann%

[jira] [Commented] (NUTCH-2066) Parameterize Generate REST endpoint

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14651297#comment-14651297 ] Chris A. Mattmann commented on NUTCH-2066: -- All tests pass: {noformat} test:

[jira] [Updated] (NUTCH-2066) Parameterize Generate REST endpoint

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2066: - Description: Allow user to specify crawldb and segment db in the Generate Job REST endpoin

[jira] [Work started] (NUTCH-2066) Allow user to specify crawldb and segment db in the Generate JOb REST endpoint

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2066 started by Chris A. Mattmann. > Allow user to specify crawldb and segment db in the Generate JOb REST

[jira] [Updated] (NUTCH-2066) Allow user to specify crawldb and segment db in the Generate Job REST endpoint

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2066: - Labels: memex (was: ) > Allow user to specify crawldb and segment db in the Generate Job

[jira] [Updated] (NUTCH-2066) Parameterize Generate REST endpoint

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2066: - Summary: Parameterize Generate REST endpoint (was: Allow user to specify crawldb and segm

[jira] [Updated] (NUTCH-2066) Allow user to specify crawldb and segment db in the Generate Job REST endpoint

2015-08-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-2066: - Summary: Allow user to specify crawldb and segment db in the Generate Job REST endpoint

<    1   2   3   4   5   6   7   8   9   >