[Nutch Wiki] Update of "bin/nutch webapp" by kamaci

2017-01-18 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "bin/nutch webapp" page has been changed by kamaci:
https://wiki.apache.org/nutch/bin/nutch%20webapp?action=diff&rev1=3&rev2=4

  
  = Nutch 2.X =
  Invoking the webapp starts a NutchUiServer on port 8080. There is a 
[[https://issues.apache.org/jira/browse/NUTCH-1859|Jira ticket available for 
making the port configurable]]. The WebApp interacts directly with the Nutch 
2.X REST API for configuring and administering your Nutch crawler.  In it's 
current form, this webapp is designed to provide Nutch administration and 
configuration. The webapp is not designed to be an end-to-end system or a 
search application (yet).
+ 
+ Webapp has an authentication support with 
[[https://issues.apache.org/jira/browse/NUTCH-2344|Jira ticket for 
Authentication Support for Web GUI]]. Default username and password 
configurations are:
+ 
+ {{{
+ admin|admin
+ user|user
+ }}}
+ 
+ One can customise it at nutch-site.xml.
  
  If you feel like contributing towards improving the WebApp, please checkout 
the 
[[https://issues.apache.org/jira/browse/NUTCH/component/11630/?selectedTab=com.atlassian.jira.jira-projects-plugin:component-summary-panel|webapp
 Jira issues]].
  


[jira] [Updated] (NUTCH-2353) Create seed file with metadata using the REST API

2017-01-18 Thread Jorge Luis Betancourt Gonzalez (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jorge Luis Betancourt Gonzalez updated NUTCH-2353:
--
Description: 
At the moment its not possible to create a seed file and specify any metadata 
when using the REST API. The file gets created but there is no option to add 
any metadata to the seed URLs.

If we use a payload like this:

{code}
{
"name":"name-of-seedlist", 
"seedUrls":[
{
"url" : "http://example.com";,
"metadata" : {
"key1" : "value1",
"key2" : "value2",
"key3" : "value3"
}
}
]
}
{code}

It should be easy to specify the desired metadata. Also this should keep BC 
with the previous array syntax if we only want to specify the list of URLs 
without any metadata at all.

  was:
At the moment its not possible to create a seed file and specify any metadata 
when using the REST API. The file gets created but there is no option to add 
any metadata to the seed URLs.

If we use a payload like this:

{code}
{
"name":"name-of-seedlist", 
"seedUrls":[
{
"url" : "http://example.com";,
"metadata" : {
"key1" : "value1",
"key2" : "value2",
"key3" : "value3"
}
}
]
}
{code}

It should be easy to specify the desired metadata.


> Create seed file with metadata using the REST API
> -
>
> Key: NUTCH-2353
> URL: https://issues.apache.org/jira/browse/NUTCH-2353
> Project: Nutch
>  Issue Type: Improvement
>  Components: injector, REST_api
>Affects Versions: 1.12
>Reporter: Jorge Luis Betancourt Gonzalez
>Assignee: Jorge Luis Betancourt Gonzalez
>Priority: Minor
>  Labels: rest_api
> Fix For: 1.13
>
>
> At the moment its not possible to create a seed file and specify any metadata 
> when using the REST API. The file gets created but there is no option to add 
> any metadata to the seed URLs.
> If we use a payload like this:
> {code}
> {
> "name":"name-of-seedlist", 
> "seedUrls":[
> {
> "url" : "http://example.com";,
> "metadata" : {
> "key1" : "value1",
> "key2" : "value2",
> "key3" : "value3"
> }
> }
> ]
> }
> {code}
> It should be easy to specify the desired metadata. Also this should keep BC 
> with the previous array syntax if we only want to specify the list of URLs 
> without any metadata at all.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (NUTCH-2353) Create seed file with metadata using the REST API

2017-01-18 Thread Jorge Luis Betancourt Gonzalez (JIRA)
Jorge Luis Betancourt Gonzalez created NUTCH-2353:
-

 Summary: Create seed file with metadata using the REST API
 Key: NUTCH-2353
 URL: https://issues.apache.org/jira/browse/NUTCH-2353
 Project: Nutch
  Issue Type: Improvement
  Components: injector, REST_api
Affects Versions: 1.12
Reporter: Jorge Luis Betancourt Gonzalez
Assignee: Jorge Luis Betancourt Gonzalez
Priority: Minor
 Fix For: 1.13


At the moment its not possible to create a seed file and specify any metadata 
when using the REST API. The file gets created but there is no option to add 
any metadata to the seed URLs.

If we use a payload like this:

{code}
{
"name":"name-of-seedlist", 
"seedUrls":[
{
"url" : "http://example.com";,
"metadata" : {
"key1" : "value1",
"key2" : "value2",
"key3" : "value3"
}
}
]
}
{code}

It should be easy to specify the desired metadata.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (NUTCH-2352) Log with Generic Class Name at Nutch 1.x

2017-01-18 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828683#comment-15828683
 ] 

ASF GitHub Bot commented on NUTCH-2352:
---

GitHub user kamaci opened a pull request:

https://github.com/apache/nutch/pull/172

NUTCH-2352 Logging with generic class name.



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/kamaci/nutch NUTCH-2352

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/nutch/pull/172.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #172


commit ea921b71ea2e1bf480b7417443fa04829c31cf82
Author: kamaci 
Date:   2017-01-18T21:00:06Z

NUTCH-2352 Logging with generic class name.




> Log with Generic Class Name at Nutch 1.x
> 
>
> Key: NUTCH-2352
> URL: https://issues.apache.org/jira/browse/NUTCH-2352
> Project: Nutch
>  Issue Type: Improvement
>Affects Versions: 1.12
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
>Priority: Minor
> Fix For: 1.13
>
>
> There are many mistakes when some reference code is copied and created a new 
> class and a logger is used. We can log with a generic class name to avoid it 
> as like:
> {code:java}
> private static final Logger LOG = 
> LoggerFactory.getLogger(MethodHandles.lookup().lookupClass());
> {code}
> (cf. SOLR-8324)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] nutch pull request #172: NUTCH-2352 Logging with generic class name.

2017-01-18 Thread kamaci
GitHub user kamaci opened a pull request:

https://github.com/apache/nutch/pull/172

NUTCH-2352 Logging with generic class name.



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/kamaci/nutch NUTCH-2352

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/nutch/pull/172.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #172


commit ea921b71ea2e1bf480b7417443fa04829c31cf82
Author: kamaci 
Date:   2017-01-18T21:00:06Z

NUTCH-2352 Logging with generic class name.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (NUTCH-2352) Log with Generic Class Name at Nutch 1.x

2017-01-18 Thread Furkan KAMACI (JIRA)
Furkan KAMACI created NUTCH-2352:


 Summary: Log with Generic Class Name at Nutch 1.x
 Key: NUTCH-2352
 URL: https://issues.apache.org/jira/browse/NUTCH-2352
 Project: Nutch
  Issue Type: Improvement
Affects Versions: 1.12
Reporter: Furkan KAMACI
Assignee: Furkan KAMACI
Priority: Minor
 Fix For: 1.13


There are many mistakes when some reference code is copied and created a new 
class and a logger is used. We can log with a generic class name to avoid it as 
like:

{code:java}
private static final Logger LOG = 
LoggerFactory.getLogger(MethodHandles.lookup().lookupClass());
{code}

(cf. SOLR-8324)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (NUTCH-2344) Authentication Support for Web GUI

2017-01-18 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828445#comment-15828445
 ] 

Hudson commented on NUTCH-2344:
---

SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1577 (See 
[https://builds.apache.org/job/Nutch-nutchgora/1577/])
NUTCH-2344 Authentication support for Web GUI (kamaci: rev 
def067735c5a6dc46d867c4c89cb176a275b1967)
* (add) src/java/org/apache/nutch/webui/pages/auth/SignInPage.html
* (add) src/java/org/apache/nutch/webui/pages/auth/SignInPage.java
* (edit) ivy/ivy.xml
* (edit) src/java/org/apache/nutch/webui/pages/assets/nutch-style.css
* (add) src/java/org/apache/nutch/webui/pages/auth/SignInSession.java
* (add) src/java/org/apache/nutch/webui/pages/auth/AuthenticatedWebPage.java
* (add) src/java/org/apache/nutch/webui/pages/auth/package-info.java
* (edit) conf/nutch-default.xml
* (edit) src/java/org/apache/nutch/webui/pages/AbstractBasePage.java
* (edit) src/java/org/apache/nutch/webui/NutchUiApplication.properties
* (add) src/java/org/apache/nutch/webui/pages/auth/User.java
* (edit) src/java/org/apache/nutch/webui/NutchUiApplication.java
* (edit) src/java/org/apache/nutch/webui/pages/LogOutPage.java
* (add) src/java/org/apache/nutch/webui/pages/auth/AuthorizationStrategy.java


> Authentication Support for Web GUI
> --
>
> Key: NUTCH-2344
> URL: https://issues.apache.org/jira/browse/NUTCH-2344
> Project: Nutch
>  Issue Type: New Feature
>  Components: web gui
>Affects Versions: 2.3.1
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
> Fix For: 2.4
>
> Attachments: Firefox_Screenshot_2017-01-13T19-10-49.499Z.png
>
>
> We should implement an authentication support for Web GUI.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (NUTCH-2350) Add Missing activeConfId Field to NutchStatus Object

2017-01-18 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828446#comment-15828446
 ] 

Hudson commented on NUTCH-2350:
---

SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1577 (See 
[https://builds.apache.org/job/Nutch-nutchgora/1577/])
NUTCH-2350 Added missing activeConfId field to NutchStatus. (kamaci: rev 
6e074fc0b61f421cb7bc516e92dea33c3ce23fd5)
* (edit) src/java/org/apache/nutch/webui/client/model/NutchStatus.java


> Add Missing activeConfId Field to NutchStatus Object
> 
>
> Key: NUTCH-2350
> URL: https://issues.apache.org/jira/browse/NUTCH-2350
> Project: Nutch
>  Issue Type: Bug
>  Components: web gui
>Affects Versions: 2.3.1
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
> Fix For: 2.4
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (NUTCH-2346) Check Types at Object Equality

2017-01-18 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828417#comment-15828417
 ] 

Lewis John McGibbney commented on NUTCH-2346:
-

[~kamaci] can you please substantiate on what the issue is here? We need to 
have Jira as the canonical provenance tracking for the issues. Thanks.

> Check Types at Object Equality
> --
>
> Key: NUTCH-2346
> URL: https://issues.apache.org/jira/browse/NUTCH-2346
> Project: Nutch
>  Issue Type: Bug
>  Components: generator, metadata
>Affects Versions: 2.3.1
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
>Priority: Minor
> Fix For: 2.4
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (NUTCH-2344) Authentication Support for Web GUI

2017-01-18 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney resolved NUTCH-2344.
-
Resolution: Fixed

Nice work [~kamaci] thank you

> Authentication Support for Web GUI
> --
>
> Key: NUTCH-2344
> URL: https://issues.apache.org/jira/browse/NUTCH-2344
> Project: Nutch
>  Issue Type: New Feature
>  Components: web gui
>Affects Versions: 2.3.1
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
> Fix For: 2.4
>
> Attachments: Firefox_Screenshot_2017-01-13T19-10-49.499Z.png
>
>
> We should implement an authentication support for Web GUI.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (NUTCH-2350) Add Missing activeConfId Field to NutchStatus Object

2017-01-18 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney resolved NUTCH-2350.
-
Resolution: Fixed

Nice work [~kamaci] thank you

> Add Missing activeConfId Field to NutchStatus Object
> 
>
> Key: NUTCH-2350
> URL: https://issues.apache.org/jira/browse/NUTCH-2350
> Project: Nutch
>  Issue Type: Bug
>  Components: web gui
>Affects Versions: 2.3.1
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
> Fix For: 2.4
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (NUTCH-2350) Add Missing activeConfId Field to NutchStatus Object

2017-01-18 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828411#comment-15828411
 ] 

ASF GitHub Bot commented on NUTCH-2350:
---

Github user asfgit closed the pull request at:

https://github.com/apache/nutch/pull/170


> Add Missing activeConfId Field to NutchStatus Object
> 
>
> Key: NUTCH-2350
> URL: https://issues.apache.org/jira/browse/NUTCH-2350
> Project: Nutch
>  Issue Type: Bug
>  Components: web gui
>Affects Versions: 2.3.1
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
> Fix For: 2.4
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (NUTCH-2344) Authentication Support for Web GUI

2017-01-18 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828410#comment-15828410
 ] 

ASF GitHub Bot commented on NUTCH-2344:
---

Github user asfgit closed the pull request at:

https://github.com/apache/nutch/pull/163


> Authentication Support for Web GUI
> --
>
> Key: NUTCH-2344
> URL: https://issues.apache.org/jira/browse/NUTCH-2344
> Project: Nutch
>  Issue Type: New Feature
>  Components: web gui
>Affects Versions: 2.3.1
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
> Fix For: 2.4
>
> Attachments: Firefox_Screenshot_2017-01-13T19-10-49.499Z.png
>
>
> We should implement an authentication support for Web GUI.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] nutch pull request #170: NUTCH-2350 Added missing activeConfId field to Nutc...

2017-01-18 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/nutch/pull/170


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] nutch pull request #163: NUTCH-2344 Authentication support for Web GUI

2017-01-18 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/nutch/pull/163


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


ApacheCon CFP closing soon (11 February)

2017-01-18 Thread Rich Bowen
Hello, fellow Apache enthusiast. Thanks for your participation, and
interest in, the projects of the Apache Software Foundation.

I wanted to remind you that the Call For Papers (CFP) for ApacheCon
North America, and Apache: Big Data North America, closes in less than a
month. If you've been putting it off because there was lots of time
left, it's time to dig for that inspiration and get those talk proposals in.

It's also time to discuss with your developer and user community whether
there's a track of talks that you might want to propose, so that you
have more complete coverage of your project than a talk or two.

We're looking for talks directly, and indirectly, related to projects at
the Apache Software Foundation. These can be anything from in-depth
technical discussions of the projects you work with, to talks about
community, documentation, legal issues, marketing, and so on. We're also
very interested in talks about projects and services built on top of
Apache projects, and case studies of how you use Apache projects to
solve real-world problems.

We are particularly interested in presentations from Apache projects
either in the Incubator, or recently graduated. ApacheCon is where
people come to find out what technology they'll be using this time next
year.

Important URLs are:

To submit a talk for Apache: Big Data -
http://events.linuxfoundation.org/events/apache-big-data-north-america/program/cfp
To submit a talk for ApacheCon -
http://events.linuxfoundation.org/events/apachecon-north-america/program/cfp

To register for Apache: Big Data -
http://events.linuxfoundation.org/events/apache-big-data-north-america/attend/register-
To register for ApacheCon -
http://events.linuxfoundation.org/events/apachecon-north-america/attend/register-

Early Bird registration rates end March 12th, but if you're a committer
on an Apache project, you get the low committer rate, which is less than
half of the early bird rate!

For further updated about ApacheCon, follow us on Twitter, @ApacheCon,
or drop by our IRC channel, #apachecon on the Freenode IRC network. Or
contact me - rbo...@apache.org - with any questions or concerns.

Thanks!

Rich Bowen, VP Conferences, Apache Software Foundation

-- 
(You've received this email because you're on a dev@ or users@ mailing
list of an Apache Software Foundation project. For subscription and
unsubscription information, consult the headers of this email message,
as this varies from one list to another.)


[jira] [Commented] (NUTCH-2333) Indexer for RabbitMQ

2017-01-18 Thread Sebastian Nagel (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828234#comment-15828234
 ] 

Sebastian Nagel commented on NUTCH-2333:


+1 looks good, although I haven't tested it.

Yes, there is some overlap between indexer-rabbit and publish-rabbitmq, mostly 
regarding configuration of and connection to RabbitMQ. Eventually, code could 
be shared in a lib-rabbitmq plugin, now or as a later improvement.

To implement indexer-rabbit as a NutchPublisher seems difficult: the 
IndexWriter and NutchPublisher interfaces are different, esp. how objects are 
serialized (a specific object "NutchDocument" vs. an unknown but universally 
JSON serializable object). Of course, one could think of indexing as a event, 
but in reality it's likely that different consumers/queues are used for 
monitoring and indexing content.

> Indexer for RabbitMQ
> 
>
> Key: NUTCH-2333
> URL: https://issues.apache.org/jira/browse/NUTCH-2333
> Project: Nutch
>  Issue Type: New Feature
>  Components: indexer
>Affects Versions: 1.12
>Reporter: Roannel Fernández Hernández
>Priority: Minor
> Fix For: 1.13
>
>
> A plugin to send the documents to a RabbitMQ server.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)