[jira] [Work started] (NUTCH-2239) Selenium Handlers for Ajax Patterns from Student submissions

2016-03-13 Thread Chris A. Mattmann (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on NUTCH-2239 started by Chris A. Mattmann.

> Selenium Handlers for Ajax Patterns from Student submissions
> 
>
> Key: NUTCH-2239
> URL: https://issues.apache.org/jira/browse/NUTCH-2239
> Project: Nutch
>  Issue Type: Improvement
>  Components: fetcher, protocol
>Reporter: Raghav Bharadwaj Jayasimha Rao
>Assignee: Chris A. Mattmann
>  Labels: memex
> Fix For: 1.12
>
>
> - Refactor student submissions from USC class of CSCI 572 to obtain a 
> comprehensive set of selenium handlers for various Ajax Patterns



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (NUTCH-2239) Selenium Handlers for Ajax Patterns from Student submissions

2016-03-13 Thread Chris A. Mattmann (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann updated NUTCH-2239:
-
Labels: memex  (was: )

> Selenium Handlers for Ajax Patterns from Student submissions
> 
>
> Key: NUTCH-2239
> URL: https://issues.apache.org/jira/browse/NUTCH-2239
> Project: Nutch
>  Issue Type: Improvement
>  Components: fetcher, protocol
>Reporter: Raghav Bharadwaj Jayasimha Rao
>Assignee: Chris A. Mattmann
>  Labels: memex
> Fix For: 1.12
>
>
> - Refactor student submissions from USC class of CSCI 572 to obtain a 
> comprehensive set of selenium handlers for various Ajax Patterns



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (NUTCH-2239) Selenium Handlers for Ajax Patterns from Student submissions

2016-03-13 Thread Chris A. Mattmann (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann updated NUTCH-2239:
-
Component/s: protocol
 fetcher

> Selenium Handlers for Ajax Patterns from Student submissions
> 
>
> Key: NUTCH-2239
> URL: https://issues.apache.org/jira/browse/NUTCH-2239
> Project: Nutch
>  Issue Type: Improvement
>  Components: fetcher, protocol
>Reporter: Raghav Bharadwaj Jayasimha Rao
>Assignee: Chris A. Mattmann
>  Labels: memex
> Fix For: 1.12
>
>
> - Refactor student submissions from USC class of CSCI 572 to obtain a 
> comprehensive set of selenium handlers for various Ajax Patterns



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (NUTCH-2239) Selenium Handlers for Ajax Patterns from Student submissions

2016-03-13 Thread Chris A. Mattmann (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann reassigned NUTCH-2239:


Assignee: Chris A. Mattmann

> Selenium Handlers for Ajax Patterns from Student submissions
> 
>
> Key: NUTCH-2239
> URL: https://issues.apache.org/jira/browse/NUTCH-2239
> Project: Nutch
>  Issue Type: Improvement
>Reporter: Raghav Bharadwaj Jayasimha Rao
>Assignee: Chris A. Mattmann
> Fix For: 1.12
>
>
> - Refactor student submissions from USC class of CSCI 572 to obtain a 
> comprehensive set of selenium handlers for various Ajax Patterns



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (NUTCH-2239) Selenium Handlers for Ajax Patterns from Student submissions

2016-03-13 Thread Chris A. Mattmann (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann updated NUTCH-2239:
-
Fix Version/s: 1.12

> Selenium Handlers for Ajax Patterns from Student submissions
> 
>
> Key: NUTCH-2239
> URL: https://issues.apache.org/jira/browse/NUTCH-2239
> Project: Nutch
>  Issue Type: Improvement
>Reporter: Raghav Bharadwaj Jayasimha Rao
>Assignee: Chris A. Mattmann
> Fix For: 1.12
>
>
> - Refactor student submissions from USC class of CSCI 572 to obtain a 
> comprehensive set of selenium handlers for various Ajax Patterns



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (NUTCH-2239) Selenium Handlers for Ajax Patterns from Student submissions

2016-03-13 Thread Raghav Bharadwaj Jayasimha Rao (JIRA)
Raghav Bharadwaj Jayasimha Rao created NUTCH-2239:
-

 Summary: Selenium Handlers for Ajax Patterns from Student 
submissions
 Key: NUTCH-2239
 URL: https://issues.apache.org/jira/browse/NUTCH-2239
 Project: Nutch
  Issue Type: Improvement
Reporter: Raghav Bharadwaj Jayasimha Rao


- Refactor student submissions from USC class of CSCI 572 to obtain a 
comprehensive set of selenium handlers for various Ajax Patterns



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events

2016-03-13 Thread Chris A. Mattmann (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15192488#comment-15192488
 ] 

Chris A. Mattmann commented on NUTCH-2132:
--

Example of this in action in MEMEX-Explorer: 
https://github.com/memex-explorer/nutch-python/pull/15
Another example in MEMEX-Explorer: 
https://github.com/memex-explorer/memex-explorer/pull/720#issuecomment-150004911

> Publisher/Subscriber model for Nutch to emit events 
> 
>
> Key: NUTCH-2132
> URL: https://issues.apache.org/jira/browse/NUTCH-2132
> Project: Nutch
>  Issue Type: New Feature
>  Components: fetcher, REST_api
>Reporter: Sujen Shah
>Assignee: Chris A. Mattmann
>  Labels: memex
> Fix For: 1.12
>
> Attachments: NUTCH-2132.patch, NUTCH-2132.v2.patch, 
> PubSub_routingkey.patch
>
>
> It would be nice to have a Pub/Sub model in Nutch to emit certain events (ex- 
> Fetcher events like fetch-start, fetch-end, a fetch report which may contain 
> data like outlinks of the current fetched url, score, etc). 
> A consumer of this functionality could use this data to generate real time 
> visualization and generate statics of the crawl without having to wait for 
> the fetch round to finish. 
> The REST API could contain an endpoint which would respond with a url to 
> which a client could subscribe to get the fetcher events. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)