Re: Why is Nutch not involved in Google Summer of Code - 2008?
I'm also looking forward to solr integration to nutch. On Mon, Mar 24, 2008 at 2:39 AM, All day coders [EMAIL PROTECTED] wrote: Well Susam I agree with you. I can dedicate some time to the POST based authentication(something i've been working on). Also, i've noticed there's no book about nutch, which makes things extremely hard if you want to dive in. Well, I know it takes time to do such a thing but maybe we can put our efforts to create something closer to it. So, here are the things I miss the most: - Supported Solr Integration - POST based authentication Regards, Yoanis On 3/22/08, Susam Pal [EMAIL PROTECTED] wrote: Hi, I was wondering why Nutch project is not involved in Google SoC: http://code.google.com/soc/2008/ Many Apache projects including Commons, Hadoop and Mahout have put up the ideas here: http://wiki.apache.org/general/SummerOfCode2008 Wouldn't it be great to have students helping the project out with some of the work which noone has found time for? For example, many people have requested for a POST based authentication support in Nutch. I personally wanted to do it after adding HTTP Authentication Schemes, but unfortunately I could never manage my time well to do it since it would require a good deal of effort. I am sure, there are many such ideas which have not been done because the contributors did not get time. IMHO, it would be great if students are given opportunity to contribute through GSoC 2008. The mentors can guide them through the work for a few hours every week and some valuable work can be done. What do you say? Regards, Susam Pal
Build failed in Hudson: Nutch-trunk #399
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/399/changes -- [...truncated 6092 lines...] init-plugin: deps-jar: compile: [echo] Compiling plugin: lib-regex-filter jar: init: init-plugin: deps-jar: compile: [echo] Compiling plugin: lib-regex-filter compile-test: compile: [echo] Compiling plugin: urlfilter-automaton compile-test: [javac] Compiling 1 source file to http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlfilter-automaton/test jar: deps-test: init: init-plugin: deps-jar: compile: [echo] Compiling plugin: lib-regex-filter jar: deps-test: deploy: copy-generated-lib: deploy: copy-generated-lib: test: [echo] Testing plugin: urlfilter-automaton [junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 3.528 sec init: init-plugin: deps-jar: init: [junit] Running org.apache.nutch.urlfilter.automaton.TestAutomatonURLFilter init-plugin: deps-jar: compile: [echo] Compiling plugin: lib-regex-filter jar: init: init-plugin: deps-jar: compile: [echo] Compiling plugin: lib-regex-filter compile-test: compile: [echo] Compiling plugin: urlfilter-regex compile-test: [javac] Compiling 1 source file to http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlfilter-regex/test jar: deps-test: init: init-plugin: deps-jar: compile: [echo] Compiling plugin: lib-regex-filter jar: deps-test: deploy: copy-generated-lib: deploy: copy-generated-lib: test: [echo] Testing plugin: urlfilter-regex [junit] Running org.apache.nutch.urlfilter.regex.TestRegexURLFilter [junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 16.231 sec init: init-plugin: deps-jar: compile: [echo] Compiling plugin: urlfilter-suffix compile-test: [javac] Compiling 1 source file to http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlfilter-suffix/test jar: deps-test: deploy: copy-generated-lib: test: [echo] Testing plugin: urlfilter-suffix [junit] Running org.apache.nutch.urlfilter.suffix.TestSuffixURLFilter [junit] Tests run: 6, Failures: 0, Errors: 0, Time elapsed: 0.131 sec init: init-plugin: deps-jar: compile: [echo] Compiling plugin: urlnormalizer-basic compile-test: [javac] Compiling 1 source file to http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlnormalizer-basic/test jar: deps-test: deploy: copy-generated-lib: test: [echo] Testing plugin: urlnormalizer-basic [junit] Running org.apache.nutch.net.urlnormalizer.basic.TestBasicURLNormalizer [junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.129 sec init: init-plugin: deps-jar: compile: [echo] Compiling plugin: urlnormalizer-pass compile-test: [javac] Compiling 1 source file to http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlnormalizer-pass/test jar: deps-test: deploy: copy-generated-lib: test: [echo] Testing plugin: urlnormalizer-pass [junit] Running org.apache.nutch.net.urlnormalizer.pass.TestPassURLNormalizer [junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.139 sec init: init-plugin: deps-jar: compile: [echo] Compiling plugin: urlnormalizer-regex compile-test: [javac] Compiling 1 source file to http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlnormalizer-regex/test [javac] Note: http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/src/plugin/urlnormalizer-regex/src/test/org/apache/nutch/net/urlnormalizer/regex/TestRegexURLNormalizer.java uses unchecked or unsafe operations. [javac] Note: Recompile with -Xlint:unchecked for details. jar: deps-test: init: init-plugin: compile: jar: [jar] Warning: skipping jar archive http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/nutch-extensionpoints/nutch-extensionpoints.jar because no files were included. deps-test: deploy: copy-generated-lib: deploy: copy-generated-lib: test: [echo] Testing plugin: urlnormalizer-regex [junit] Running org.apache.nutch.net.urlnormalizer.regex.TestRegexURLNormalizer [junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 1.232 sec [junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 29.879 sec BUILD FAILED http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build.xml :303: The following error occurred while executing this line: http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/src/plugin/build.xml :92: The following error occurred while executing this line: http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/src/plugin/build-plugin.xml :200: Tests failed! Total time: 34 minutes 12 seconds ERROR: No artifacts found that match the file pattern trunk/build/*.tar.gz. Configuration error? ERROR: 'trunk/build/*.tar.gz' doesn't match anything: 'trunk'
Re: Why is Nutch not involved in Google Summer of Code - 2008?
Sishen: I'm not very good at organizing things, but I'm looking forward to do it. Are you a student? Susam, would I be asking too much if I ask you to share your experiences about how to came up with the HTTP Authentication for Nutch? I spent a couple of days struggling with the code, but I didn't make much progress. I guess I'm missing the big picture (something that happens quite often when trying to extend Nutch, at least for me). On Mon, Mar 24, 2008 at 4:04 AM, sishen [EMAIL PROTECTED] wrote: I'm also looking forward to solr integration to nutch. On Mon, Mar 24, 2008 at 2:39 AM, All day coders [EMAIL PROTECTED] wrote: Well Susam I agree with you. I can dedicate some time to the POST based authentication(something i've been working on). Also, i've noticed there's no book about nutch, which makes things extremely hard if you want to dive in. Well, I know it takes time to do such a thing but maybe we can put our efforts to create something closer to it. So, here are the things I miss the most: - Supported Solr Integration - POST based authentication Regards, Yoanis On 3/22/08, Susam Pal [EMAIL PROTECTED] wrote: Hi, I was wondering why Nutch project is not involved in Google SoC: http://code.google.com/soc/2008/ Many Apache projects including Commons, Hadoop and Mahout have put up the ideas here: http://wiki.apache.org/general/SummerOfCode2008 Wouldn't it be great to have students helping the project out with some of the work which noone has found time for? For example, many people have requested for a POST based authentication support in Nutch. I personally wanted to do it after adding HTTP Authentication Schemes, but unfortunately I could never manage my time well to do it since it would require a good deal of effort. I am sure, there are many such ideas which have not been done because the contributors did not get time. IMHO, it would be great if students are given opportunity to contribute through GSoC 2008. The mentors can guide them through the work for a few hours every week and some valuable work can be done. What do you say? Regards, Susam Pal
Re: Why is Nutch not involved in Google Summer of Code - 2008?
Hi, rac.nosotros. I'm not a student. But i'm eager to do the work. Maybe I can work with some guys if there are to do that. I think it's very meaningful to integrate the solr into nutch. On Tue, Mar 25, 2008 at 4:26 AM, All day coders [EMAIL PROTECTED] wrote: Sishen: I'm not very good at organizing things, but I'm looking forward to do it. Are you a student? Susam, would I be asking too much if I ask you to share your experiences about how to came up with the HTTP Authentication for Nutch? I spent a couple of days struggling with the code, but I didn't make much progress. I guess I'm missing the big picture (something that happens quite often when trying to extend Nutch, at least for me). On Mon, Mar 24, 2008 at 4:04 AM, sishen [EMAIL PROTECTED] wrote: I'm also looking forward to solr integration to nutch. On Mon, Mar 24, 2008 at 2:39 AM, All day coders [EMAIL PROTECTED] wrote: Well Susam I agree with you. I can dedicate some time to the POST based authentication(something i've been working on). Also, i've noticed there's no book about nutch, which makes things extremely hard if you want to dive in. Well, I know it takes time to do such a thing but maybe we can put our efforts to create something closer to it. So, here are the things I miss the most: - Supported Solr Integration - POST based authentication Regards, Yoanis On 3/22/08, Susam Pal [EMAIL PROTECTED] wrote: Hi, I was wondering why Nutch project is not involved in Google SoC: http://code.google.com/soc/2008/ Many Apache projects including Commons, Hadoop and Mahout have put up the ideas here: http://wiki.apache.org/general/SummerOfCode2008 Wouldn't it be great to have students helping the project out with some of the work which noone has found time for? For example, many people have requested for a POST based authentication support in Nutch. I personally wanted to do it after adding HTTP Authentication Schemes, but unfortunately I could never manage my time well to do it since it would require a good deal of effort. I am sure, there are many such ideas which have not been done because the contributors did not get time. IMHO, it would be great if students are given opportunity to contribute through GSoC 2008. The mentors can guide them through the work for a few hours every week and some valuable work can be done. What do you say? Regards, Susam Pal