Re: [VOTE] Accept the nutch-helm project into Apache Nutch project

2021-07-07 Thread Sebastian Nagel

> under a separate repository

Ok, got it. Thanks!

On 7/7/21 6:17 AM, Lewis John McGibbney wrote:

Hi Seb,

On 2021/07/05 20:29:43, Sebastian Nagel  wrote:

+1


Thanks for casting your VOTE



  From recent questions of users on the mailing list and on stackoverflow,
it's evident that some are required to run Nutch on K8s or similar deployments.
So, it's definitely worth to support this out-of-the-box.


I agree!



@Lewis, could you explain why an IP CLEARANCE is required? When looking at the 
commit
history of [0], there are only contributions from a Nutch committer (you).


Definitely, I am proposing to bring in the project under a separate repository 
i.e. https://github.com/apache/nutch-helm.git rather than donate directly to 
the https://github.com/apache/nutch.git. In this case, IP Clearance is required 
as per the Incubator PMC guidance 
http://incubator.apache.org/ip-clearance/index.html

I actually did more or less the exact same thing with tika-helm recently cf.

I hope this it clear.
lewismc





[GitHub] [nutch] sebastian-nagel commented on a change in pull request #691: NUTCH-2883 Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread GitBox


sebastian-nagel commented on a change in pull request #691:
URL: https://github.com/apache/nutch/pull/691#discussion_r665509195



##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)
+logfile_maxbytes=50MB   ; (max main logfile bytes b4 rotation;default 50MB)
+logfile_backups=10  ; (num of main logfile rotation backups;default 10)
+loglevel=info   ; (log level;default info; others: 
debug,warn,trace)
+minfds=1024 ; (min. avail startup file descriptors;default 
1024)
+minprocs=200; (min. avail process descriptors;default 200)
+nodaemon=false  ; (start in foreground if true;default false)
+pidfile=/tmp/supervisord.pid ; (supervisord pidfile;default supervisord.pid)

Review comment:
   ... and pid files I'd expect in `/run/` or `/var/run/`

##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)

Review comment:
   Not a common place for log files. Is there a reason why the log file 
isn't also placed in `/var/log/supervisord/`?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Commented] (NUTCH-2883) Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17376680#comment-17376680
 ] 

ASF GitHub Bot commented on NUTCH-2883:
---

sebastian-nagel commented on a change in pull request #691:
URL: https://github.com/apache/nutch/pull/691#discussion_r665509195



##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)
+logfile_maxbytes=50MB   ; (max main logfile bytes b4 rotation;default 50MB)
+logfile_backups=10  ; (num of main logfile rotation backups;default 10)
+loglevel=info   ; (log level;default info; others: 
debug,warn,trace)
+minfds=1024 ; (min. avail startup file descriptors;default 
1024)
+minprocs=200; (min. avail process descriptors;default 200)
+nodaemon=false  ; (start in foreground if true;default false)
+pidfile=/tmp/supervisord.pid ; (supervisord pidfile;default supervisord.pid)

Review comment:
   ... and pid files I'd expect in `/run/` or `/var/run/`

##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)

Review comment:
   Not a common place for log files. Is there a reason why the log file 
isn't also placed in `/var/log/supervisord/`?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Provide means to run server and webapp as persistent services in Docker 
> container
> -
>
> Key: NUTCH-2883
> URL: https://issues.apache.org/jira/browse/NUTCH-2883
> Project: Nutch
>  Issue Type: Improvement
>  Components: docker, nutch server, web gui
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Major
> Fix For: 1.19
>
>
> Some conditional logic and some Docker build arguments would allow a user to 
> run the nutch server alone or both the nutch server and webapp as long 
> running services.
> This could be useful for users who want to remotely interact with Nutch via 
> REST where jobs may actually be executed on yet another remote deployment.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: [VOTE] Accept the nutch-helm project into Apache Nutch project

2021-07-07 Thread Furkan KAMACI
Hi,

+1 from me too!

Kind Regards,
Furkan KAMACI

On Wed, Jul 7, 2021 at 4:48 PM Sebastian Nagel 
wrote:

>  > under a separate repository
>
> Ok, got it. Thanks!
>
> On 7/7/21 6:17 AM, Lewis John McGibbney wrote:
> > Hi Seb,
> >
> > On 2021/07/05 20:29:43, Sebastian Nagel 
> wrote:
> >> +1
> >
> > Thanks for casting your VOTE
> >
> >>
> >>   From recent questions of users on the mailing list and on
> stackoverflow,
> >> it's evident that some are required to run Nutch on K8s or similar
> deployments.
> >> So, it's definitely worth to support this out-of-the-box.
> >
> > I agree!
> >
> >>
> >> @Lewis, could you explain why an IP CLEARANCE is required? When looking
> at the commit
> >> history of [0], there are only contributions from a Nutch committer
> (you).
> >
> > Definitely, I am proposing to bring in the project under a separate
> repository i.e. https://github.com/apache/nutch-helm.git rather than
> donate directly to the https://github.com/apache/nutch.git. In this case,
> IP Clearance is required as per the Incubator PMC guidance
> http://incubator.apache.org/ip-clearance/index.html
> >
> > I actually did more or less the exact same thing with tika-helm recently
> cf.
> >
> > I hope this it clear.
> > lewismc
> >
>
>


Re: [VOTE] Accept the nutch-helm project into Apache Nutch project

2021-07-07 Thread BlackIce
Hi everybody!

+1 as well!

Cheers

Ralf Kotowski

On Wed, Jul 7, 2021 at 7:37 PM Furkan KAMACI  wrote:
>
> Hi,
>
> +1 from me too!
>
> Kind Regards,
> Furkan KAMACI
>
> On Wed, Jul 7, 2021 at 4:48 PM Sebastian Nagel  
> wrote:
>>
>>  > under a separate repository
>>
>> Ok, got it. Thanks!
>>
>> On 7/7/21 6:17 AM, Lewis John McGibbney wrote:
>> > Hi Seb,
>> >
>> > On 2021/07/05 20:29:43, Sebastian Nagel  wrote:
>> >> +1
>> >
>> > Thanks for casting your VOTE
>> >
>> >>
>> >>   From recent questions of users on the mailing list and on stackoverflow,
>> >> it's evident that some are required to run Nutch on K8s or similar 
>> >> deployments.
>> >> So, it's definitely worth to support this out-of-the-box.
>> >
>> > I agree!
>> >
>> >>
>> >> @Lewis, could you explain why an IP CLEARANCE is required? When looking 
>> >> at the commit
>> >> history of [0], there are only contributions from a Nutch committer (you).
>> >
>> > Definitely, I am proposing to bring in the project under a separate 
>> > repository i.e. https://github.com/apache/nutch-helm.git rather than 
>> > donate directly to the https://github.com/apache/nutch.git. In this case, 
>> > IP Clearance is required as per the Incubator PMC guidance 
>> > http://incubator.apache.org/ip-clearance/index.html
>> >
>> > I actually did more or less the exact same thing with tika-helm recently 
>> > cf.
>> >
>> > I hope this it clear.
>> > lewismc
>> >
>>


Re: [VOTE] Accept the nutch-helm project into Apache Nutch project

2021-07-07 Thread Matei Miroslav
Hi everybody!

+1 as well

Cheers,
M

Sent from my iPhone

> On 7 Jul 2021, at 21:16, BlackIce  wrote:
> 
> Hi everybody!
> 
> +1 as well!
> 
> Cheers
> 
> Ralf Kotowski
> 
>> On Wed, Jul 7, 2021 at 7:37 PM Furkan KAMACI  wrote:
>> 
>> Hi,
>> 
>> +1 from me too!
>> 
>> Kind Regards,
>> Furkan KAMACI
>> 
>>> On Wed, Jul 7, 2021 at 4:48 PM Sebastian Nagel  
>>> wrote:
>>> 
 under a separate repository
>>> 
>>> Ok, got it. Thanks!
>>> 
>>> On 7/7/21 6:17 AM, Lewis John McGibbney wrote:
 Hi Seb,
 
 On 2021/07/05 20:29:43, Sebastian Nagel  wrote:
> +1
 
 Thanks for casting your VOTE
 
> 
>  From recent questions of users on the mailing list and on stackoverflow,
> it's evident that some are required to run Nutch on K8s or similar 
> deployments.
> So, it's definitely worth to support this out-of-the-box.
 
 I agree!
 
> 
> @Lewis, could you explain why an IP CLEARANCE is required? When looking 
> at the commit
> history of [0], there are only contributions from a Nutch committer (you).
 
 Definitely, I am proposing to bring in the project under a separate 
 repository i.e. https://github.com/apache/nutch-helm.git rather than 
 donate directly to the https://github.com/apache/nutch.git. In this case, 
 IP Clearance is required as per the Incubator PMC guidance 
 http://incubator.apache.org/ip-clearance/index.html
 
 I actually did more or less the exact same thing with tika-helm recently 
 cf.
 
 I hope this it clear.
 lewismc
 
>>> 


[GitHub] [nutch] lewismc commented on a change in pull request #691: NUTCH-2883 Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread GitBox


lewismc commented on a change in pull request #691:
URL: https://github.com/apache/nutch/pull/691#discussion_r665828450



##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)
+logfile_maxbytes=50MB   ; (max main logfile bytes b4 rotation;default 50MB)
+logfile_backups=10  ; (num of main logfile rotation backups;default 10)
+loglevel=info   ; (log level;default info; others: 
debug,warn,trace)
+minfds=1024 ; (min. avail startup file descriptors;default 
1024)
+minprocs=200; (min. avail process descriptors;default 200)
+nodaemon=false  ; (start in foreground if true;default false)
+pidfile=/tmp/supervisord.pid ; (supervisord pidfile;default supervisord.pid)

Review comment:
   Same, I am happy to update this.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [nutch] lewismc commented on a change in pull request #691: NUTCH-2883 Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread GitBox


lewismc commented on a change in pull request #691:
URL: https://github.com/apache/nutch/pull/691#discussion_r665828374



##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)

Review comment:
   I am happy to update this... 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Commented] (NUTCH-2883) Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17376963#comment-17376963
 ] 

ASF GitHub Bot commented on NUTCH-2883:
---

lewismc commented on a change in pull request #691:
URL: https://github.com/apache/nutch/pull/691#discussion_r665828374



##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)

Review comment:
   I am happy to update this... 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Provide means to run server and webapp as persistent services in Docker 
> container
> -
>
> Key: NUTCH-2883
> URL: https://issues.apache.org/jira/browse/NUTCH-2883
> Project: Nutch
>  Issue Type: Improvement
>  Components: docker, nutch server, web gui
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Major
> Fix For: 1.19
>
>
> Some conditional logic and some Docker build arguments would allow a user to 
> run the nutch server alone or both the nutch server and webapp as long 
> running services.
> This could be useful for users who want to remotely interact with Nutch via 
> REST where jobs may actually be executed on yet another remote deployment.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (NUTCH-2883) Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17376964#comment-17376964
 ] 

ASF GitHub Bot commented on NUTCH-2883:
---

lewismc commented on a change in pull request #691:
URL: https://github.com/apache/nutch/pull/691#discussion_r665828450



##
File path: docker/config/supervisord_startserver.conf
##
@@ -0,0 +1,47 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+[supervisord]
+childlogdir=/var/log/supervisord/
+logfile=/tmp/supervisord.log ; (main log file;default $CWD/supervisord.log)
+logfile_maxbytes=50MB   ; (max main logfile bytes b4 rotation;default 50MB)
+logfile_backups=10  ; (num of main logfile rotation backups;default 10)
+loglevel=info   ; (log level;default info; others: 
debug,warn,trace)
+minfds=1024 ; (min. avail startup file descriptors;default 
1024)
+minprocs=200; (min. avail process descriptors;default 200)
+nodaemon=false  ; (start in foreground if true;default false)
+pidfile=/tmp/supervisord.pid ; (supervisord pidfile;default supervisord.pid)

Review comment:
   Same, I am happy to update this.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Provide means to run server and webapp as persistent services in Docker 
> container
> -
>
> Key: NUTCH-2883
> URL: https://issues.apache.org/jira/browse/NUTCH-2883
> Project: Nutch
>  Issue Type: Improvement
>  Components: docker, nutch server, web gui
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Major
> Fix For: 1.19
>
>
> Some conditional logic and some Docker build arguments would allow a user to 
> run the nutch server alone or both the nutch server and webapp as long 
> running services.
> This could be useful for users who want to remotely interact with Nutch via 
> REST where jobs may actually be executed on yet another remote deployment.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[RESULT] WAS Re: [VOTE] Accept the nutch-helm project into Apache Nutch project

2021-07-07 Thread Lewis John McGibbney
Hi All,
The usual 72 hours has come and gone so I will close this thread. Thanks for 
VOTE'ing.

[5] +1 Accept nutch-helm donation into Nutch PMC
Sebastian Nagel*
Furkan Kamaci*
Ralf Kotowski*
Matei Miroslav

[0] +/-0 Not strong either way about the nutch-helm donation into Nutch PMC
[0] +1 DO NOT accept the nutch-helm donation into Nutch PMC... please state why

*Nutch PMC-binding

I'll go ahead with the remaining IP CLEARANCE.
Thanks

lewismc

On 2021/07/03 03:45:49, Lewis John McGibbney  wrote: 
> dev@,
> This is a VOTE to accept the nutch-helm project [0] as a donation to the 
> Apache Nutch PMC.
> nutch-helm is a Helm chart I wrote to deploy Apache Nutch (containers) on 
> Kubernetes. It makes it really easy to establish a Nutch REST server and 
> WebApp (should you wish) as well as a Nutch installation.
> If this VOTE were to pass I would go ahead and complete an IP CLEARANCE 
> effort with the Apache Incubator PMC.
> Thanks for your consideration in this VOTE.
> 
> [ ] +1 Accept nutch-helm donation into Nutch PMC
> [ ] +/-0 Not strong either way about the nutch-helm donation into Nutch PMC
> [ ] +1 DO NOT accept the nutch-helm donation into Nutch PMC... please state 
> why
> 
> Thanks
> lewismc
> 
> [0] https://github.com/lewismc/nutch-helm
> 


Suggestions for 9-month senior capstone project with Oregon State University

2021-07-07 Thread Lewis John McGibbney
Hi dev@,
I've been presented with an opportunity to propose a project to the senior 
computer science capstone program at Oreogon State University. It is 
anticipated to last 9-months which means that a significant project could be 
undertaken.
My thoughts are as follows

1. Convert the Ant + Ivy build system to Gradle. This would also involve 
upgrading Fireant to work with a Gradle build system instead of Ant + Ivy.
2. Continue to the effort to port Nutch to run on Apache Tez (as opposed to 
Mapreduce)

Are there any other suggestions?
Thanks

lewismc


[GitHub] [nutch] lewismc commented on pull request #691: NUTCH-2883 Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread GitBox


lewismc commented on pull request #691:
URL: https://github.com/apache/nutch/pull/691#issuecomment-876084540


   @sebastian-nagel here's what I tried. First prune entire Docker development 
cache
   ```
   docker system prune -a
   ...
   Total reclaimed space: 14.1GB
   ```
   The build
   ```
   docker % docker build -t apache/nutch . --build-arg BUILD_MODE=2 --build-arg 
SERVER_PORT=8081 --build-arg SERVER_HOST=0.0.0.0 --build-arg WEBAPP_PORT=8080
   ...
   [+] Building 743.6s (17/17) FINISHED
=> [internal] load build definition from Dockerfile 

0.0s
=> => transferring dockerfile: 4.42kB   

0.0s
=> [internal] load .dockerignore

0.0s
=> => transferring context: 2B  

0.0s
=> [internal] load metadata for docker.io/library/alpine:3.13   

1.4s
=> [base 1/8] FROM 
docker.io/library/alpine:3.13@sha256:f51ff2d96627690d62fee79e6eecd9fa87429a38142b5df8a3bfbb26061df7fc

0.6s
=> => resolve 
docker.io/library/alpine:3.13@sha256:f51ff2d96627690d62fee79e6eecd9fa87429a38142b5df8a3bfbb26061df7fc

 0.0s
=> => 
sha256:f51ff2d96627690d62fee79e6eecd9fa87429a38142b5df8a3bfbb26061df7fc 1.64kB 
/ 1.64kB
   0.0s
=> => 
sha256:def822f9851ca422481ec6fee59a9966f12b351c62ccb9aca841526ffaa9f748 528B / 
528B
   0.0s
=> => 
sha256:6dbb9cc54074106d46d4ccb330f2a40a682d49dda5f4844962b7dce9fe44aaec 1.47kB 
/ 1.47kB
   0.0s
=> => 
sha256:540db60ca9383eac9e418f78490994d0af424aab7bf6d0e47ac8ed4e2e9bcbba 2.81MB 
/ 2.81MB
   0.4s
=> => extracting 
sha256:540db60ca9383eac9e418f78490994d0af424aab7bf6d0e47ac8ed4e2e9bcbba 

   0.2s
=> [internal] load build context

0.0s
=> => transferring context: 2.74kB  

0.0s
=> [base 2/8] WORKDIR /root/

0.0s
=> [base 3/8] RUN apk update

1.0s
=> [base 4/8] RUN apk --no-cache add apache-ant bash git openjdk11 
supervisor  

18.6s
=> [base 5/8] RUN echo 'export JAVA_HOME=/usr/lib/jvm/java-11-openjdk' >> 
$HOME/.bashrc   
  
0.2s
=> [base 6/8] RUN git clone https://github.com/apache/nutch.git 
nutch_source &&  cd nutch_source &&   

[jira] [Commented] (NUTCH-2883) Provide means to run server and webapp as persistent services in Docker container

2021-07-07 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17376989#comment-17376989
 ] 

ASF GitHub Bot commented on NUTCH-2883:
---

lewismc commented on pull request #691:
URL: https://github.com/apache/nutch/pull/691#issuecomment-876084540


   @sebastian-nagel here's what I tried. First prune entire Docker development 
cache
   ```
   docker system prune -a
   ...
   Total reclaimed space: 14.1GB
   ```
   The build
   ```
   docker % docker build -t apache/nutch . --build-arg BUILD_MODE=2 --build-arg 
SERVER_PORT=8081 --build-arg SERVER_HOST=0.0.0.0 --build-arg WEBAPP_PORT=8080
   ...
   [+] Building 743.6s (17/17) FINISHED
=> [internal] load build definition from Dockerfile 

0.0s
=> => transferring dockerfile: 4.42kB   

0.0s
=> [internal] load .dockerignore

0.0s
=> => transferring context: 2B  

0.0s
=> [internal] load metadata for docker.io/library/alpine:3.13   

1.4s
=> [base 1/8] FROM 
docker.io/library/alpine:3.13@sha256:f51ff2d96627690d62fee79e6eecd9fa87429a38142b5df8a3bfbb26061df7fc

0.6s
=> => resolve 
docker.io/library/alpine:3.13@sha256:f51ff2d96627690d62fee79e6eecd9fa87429a38142b5df8a3bfbb26061df7fc

 0.0s
=> => 
sha256:f51ff2d96627690d62fee79e6eecd9fa87429a38142b5df8a3bfbb26061df7fc 1.64kB 
/ 1.64kB
   0.0s
=> => 
sha256:def822f9851ca422481ec6fee59a9966f12b351c62ccb9aca841526ffaa9f748 528B / 
528B
   0.0s
=> => 
sha256:6dbb9cc54074106d46d4ccb330f2a40a682d49dda5f4844962b7dce9fe44aaec 1.47kB 
/ 1.47kB
   0.0s
=> => 
sha256:540db60ca9383eac9e418f78490994d0af424aab7bf6d0e47ac8ed4e2e9bcbba 2.81MB 
/ 2.81MB
   0.4s
=> => extracting 
sha256:540db60ca9383eac9e418f78490994d0af424aab7bf6d0e47ac8ed4e2e9bcbba 

   0.2s
=> [internal] load build context

0.0s
=> => transferring context: 2.74kB  

0.0s
=> [base 2/8] WORKDIR /root/

0.0s
=> [base 3/8] RUN apk update

1.0s
=> [base 4/8] RUN apk --no-cache add apache-ant bash git openjdk11 
supervisor  

18.6s
=> [base 5/8] RUN echo 'export JAVA_HOME=/usr/lib/jvm/java-11-openjdk' >> 
$HOME/.bashrc