Re: Why is Nutch not involved in Google Summer of Code - 2008?

2008-03-24 Thread sishen
I'm also looking forward to solr integration to nutch.

On Mon, Mar 24, 2008 at 2:39 AM, All day coders [EMAIL PROTECTED]
wrote:

 Well Susam I agree with you. I can dedicate some time to the POST
 based authentication(something i've been working on).

 Also, i've noticed there's no book about nutch, which makes things
 extremely hard  if you want to dive in.  Well, I know it takes time to
 do such a thing but maybe we can put our efforts to create something
 closer to it.

 So, here are the things I miss the most:

 - Supported Solr Integration
 - POST based authentication

 Regards,
   Yoanis







 On 3/22/08, Susam Pal [EMAIL PROTECTED] wrote:
  Hi,
 
  I was wondering why Nutch project is not involved in Google SoC:
  http://code.google.com/soc/2008/ Many Apache projects including
  Commons, Hadoop and Mahout have put up the ideas here:
  http://wiki.apache.org/general/SummerOfCode2008
 
  Wouldn't it be great to have students helping the project out with
  some of the work which noone has found time for? For example, many
  people have requested for a POST based authentication support in
  Nutch. I personally wanted to do it after adding HTTP Authentication
  Schemes, but unfortunately I could never manage my time well to do it
  since it would require a good deal of effort. I am sure, there are
  many such ideas which have not been done because the contributors did
  not get time. IMHO, it would be great if students are given
  opportunity to contribute through GSoC 2008. The mentors can guide
  them through the work for a few hours every week and some valuable
  work can be done. What do you say?
 
  Regards,
  Susam Pal
 



Build failed in Hudson: Nutch-trunk #399

2008-03-24 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/399/changes

--
[...truncated 6092 lines...]

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: lib-regex-filter

jar:

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: lib-regex-filter

compile-test:

compile:
 [echo] Compiling plugin: urlfilter-automaton

compile-test:
[javac] Compiling 1 source file to 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlfilter-automaton/test
 

jar:

deps-test:

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: lib-regex-filter

jar:

deps-test:

deploy:

copy-generated-lib:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlfilter-automaton
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 3.528 sec

init:

init-plugin:

deps-jar:

init:
[junit] Running org.apache.nutch.urlfilter.automaton.TestAutomatonURLFilter

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: lib-regex-filter

jar:

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: lib-regex-filter

compile-test:

compile:
 [echo] Compiling plugin: urlfilter-regex

compile-test:
[javac] Compiling 1 source file to 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlfilter-regex/test
 

jar:

deps-test:

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: lib-regex-filter

jar:

deps-test:

deploy:

copy-generated-lib:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlfilter-regex
[junit] Running org.apache.nutch.urlfilter.regex.TestRegexURLFilter
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 16.231 sec

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: urlfilter-suffix

compile-test:
[javac] Compiling 1 source file to 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlfilter-suffix/test
 

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlfilter-suffix
[junit] Running org.apache.nutch.urlfilter.suffix.TestSuffixURLFilter
[junit] Tests run: 6, Failures: 0, Errors: 0, Time elapsed: 0.131 sec

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: urlnormalizer-basic

compile-test:
[javac] Compiling 1 source file to 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlnormalizer-basic/test
 

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-basic
[junit] Running 
org.apache.nutch.net.urlnormalizer.basic.TestBasicURLNormalizer
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.129 sec

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: urlnormalizer-pass

compile-test:
[javac] Compiling 1 source file to 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlnormalizer-pass/test
 

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-pass
[junit] Running 
org.apache.nutch.net.urlnormalizer.pass.TestPassURLNormalizer
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.139 sec

init:

init-plugin:

deps-jar:

compile:
 [echo] Compiling plugin: urlnormalizer-regex

compile-test:
[javac] Compiling 1 source file to 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/urlnormalizer-regex/test
 
[javac] Note: 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/src/plugin/urlnormalizer-regex/src/test/org/apache/nutch/net/urlnormalizer/regex/TestRegexURLNormalizer.java
  uses unchecked or unsafe operations.
[javac] Note: Recompile with -Xlint:unchecked for details.

jar:

deps-test:

init:

init-plugin:

compile:

jar:
  [jar] Warning: skipping jar archive 
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/nutch-extensionpoints/nutch-extensionpoints.jar
  because no files were included.

deps-test:

deploy:

copy-generated-lib:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-regex
[junit] Running 
org.apache.nutch.net.urlnormalizer.regex.TestRegexURLNormalizer
[junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 1.232 sec
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 29.879 sec

BUILD FAILED
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build.xml :303: 
The following error occurred while executing this line:
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/src/plugin/build.xml
 :92: The following error occurred while executing this line:
http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/src/plugin/build-plugin.xml
 :200: Tests failed!

Total time: 34 minutes 12 seconds
ERROR: No artifacts found that match the file pattern trunk/build/*.tar.gz. 
Configuration error?
ERROR: 'trunk/build/*.tar.gz' doesn't match anything: 'trunk' 

Re: Why is Nutch not involved in Google Summer of Code - 2008?

2008-03-24 Thread All day coders
Sishen:
I'm not very good at organizing things, but I'm looking forward to do it.
Are you a student?

Susam, would I be asking too much if I ask you to share your experiences
about how to came up with the HTTP Authentication for Nutch? I spent a
couple of days struggling with the code, but I didn't make much progress. I
guess I'm missing the big picture (something that happens quite often when
trying to extend Nutch, at least for me).



On Mon, Mar 24, 2008 at 4:04 AM, sishen [EMAIL PROTECTED] wrote:

 I'm also looking forward to solr integration to nutch.

 On Mon, Mar 24, 2008 at 2:39 AM, All day coders [EMAIL PROTECTED]
 wrote:

  Well Susam I agree with you. I can dedicate some time to the POST
  based authentication(something i've been working on).
 
  Also, i've noticed there's no book about nutch, which makes things
  extremely hard  if you want to dive in.  Well, I know it takes time to
  do such a thing but maybe we can put our efforts to create something
  closer to it.
 
  So, here are the things I miss the most:
 
  - Supported Solr Integration
  - POST based authentication
 
  Regards,
Yoanis
 
 
 
 
 
 
 
  On 3/22/08, Susam Pal [EMAIL PROTECTED] wrote:
   Hi,
  
   I was wondering why Nutch project is not involved in Google SoC:
   http://code.google.com/soc/2008/ Many Apache projects including
   Commons, Hadoop and Mahout have put up the ideas here:
   http://wiki.apache.org/general/SummerOfCode2008
  
   Wouldn't it be great to have students helping the project out with
   some of the work which noone has found time for? For example, many
   people have requested for a POST based authentication support in
   Nutch. I personally wanted to do it after adding HTTP Authentication
   Schemes, but unfortunately I could never manage my time well to do it
   since it would require a good deal of effort. I am sure, there are
   many such ideas which have not been done because the contributors did
   not get time. IMHO, it would be great if students are given
   opportunity to contribute through GSoC 2008. The mentors can guide
   them through the work for a few hours every week and some valuable
   work can be done. What do you say?
  
   Regards,
   Susam Pal
  
 



Re: Why is Nutch not involved in Google Summer of Code - 2008?

2008-03-24 Thread sishen
Hi, rac.nosotros.

I'm not a student.
But i'm eager to do the work. Maybe I can work with some guys if there are
to do that.

I think it's very meaningful to integrate the solr into nutch.


On Tue, Mar 25, 2008 at 4:26 AM, All day coders [EMAIL PROTECTED]
wrote:

 Sishen:
 I'm not very good at organizing things, but I'm looking forward to do it.
 Are you a student?

 Susam, would I be asking too much if I ask you to share your experiences
 about how to came up with the HTTP Authentication for Nutch? I spent a
 couple of days struggling with the code, but I didn't make much progress.
 I
 guess I'm missing the big picture (something that happens quite often when
 trying to extend Nutch, at least for me).



 On Mon, Mar 24, 2008 at 4:04 AM, sishen [EMAIL PROTECTED] wrote:

  I'm also looking forward to solr integration to nutch.
 
  On Mon, Mar 24, 2008 at 2:39 AM, All day coders [EMAIL PROTECTED]
  wrote:
 
   Well Susam I agree with you. I can dedicate some time to the POST
   based authentication(something i've been working on).
  
   Also, i've noticed there's no book about nutch, which makes things
   extremely hard  if you want to dive in.  Well, I know it takes time to
   do such a thing but maybe we can put our efforts to create something
   closer to it.
  
   So, here are the things I miss the most:
  
   - Supported Solr Integration
   - POST based authentication
  
   Regards,
 Yoanis
  
  
  
  
  
  
  
   On 3/22/08, Susam Pal [EMAIL PROTECTED] wrote:
Hi,
   
I was wondering why Nutch project is not involved in Google SoC:
http://code.google.com/soc/2008/ Many Apache projects including
Commons, Hadoop and Mahout have put up the ideas here:
http://wiki.apache.org/general/SummerOfCode2008
   
Wouldn't it be great to have students helping the project out with
some of the work which noone has found time for? For example, many
people have requested for a POST based authentication support in
Nutch. I personally wanted to do it after adding HTTP Authentication
Schemes, but unfortunately I could never manage my time well to do
 it
since it would require a good deal of effort. I am sure, there are
many such ideas which have not been done because the contributors
 did
not get time. IMHO, it would be great if students are given
opportunity to contribute through GSoC 2008. The mentors can guide
them through the work for a few hours every week and some valuable
work can be done. What do you say?
   
Regards,
Susam Pal