Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-10-02 Thread Erlend Garåsen

On 28.09.12 13.31, Erlend Garåsen wrote:


OK, I will give you a stack trace in the beginning of next week.


Do you still need the stack trace? If you do, I need to adjust the log 
level and/or change the source code in order to print it out.


I'm still a little bit worried about how MCF deals with 500 server 
errors since the job I started last Friday is still running. It retries 
and retries the three documents I previously mentioned.


Is it really a normal behaviour that MCF retries the same document every 
fourth second after the last attempt and continues do do this (perhaps) 
thousand times? MCF has probably retried these documents in four days 
now. I doubt this is normal behaviour.


The job should end in the middle of the day on Saturday, and now it's 
Tuesday.


I will test the latest RC after these issues have been clarified.

Erlend

--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050


Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-10-02 Thread Erlend Garåsen


Karl, you wrote: I was able to reproduce the exception here using your 
URL.  It is indeed a bug in how it handles the 500 error.


OK, then I guess that the StringIndexOutOfBoundsException *was* related 
to the 500 server issue (It is not clear at all that it is related

to the 500 error you described before, but it could be.).

To clarify another thing: These three documents are fetched over and 
over again every fourth second (in four days). I was mentioning this in 
case we had another issue.


I'm just trying to clarify this before I deploy RC7 as I wrote.

Anyway, I will deploy RC7 now and start my job once more.

Erlend

On 02.10.12 11.03, Karl Wright wrote:

No stack trace needed.  If you read the rest of the mail, you will
note that I was able to reproduce the issue using the URL you had
provided.  There have been two RC's since; we are on RC7 now.

Karl


On Tue, Oct 2, 2012 at 4:38 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:

On 28.09.12 13.31, Erlend Garåsen wrote:


OK, I will give you a stack trace in the beginning of next week.



Do you still need the stack trace? If you do, I need to adjust the log level
and/or change the source code in order to print it out.

I'm still a little bit worried about how MCF deals with 500 server errors
since the job I started last Friday is still running. It retries and retries
the three documents I previously mentioned.

Is it really a normal behaviour that MCF retries the same document every
fourth second after the last attempt and continues do do this (perhaps)
thousand times? MCF has probably retried these documents in four days now. I
doubt this is normal behaviour.

The job should end in the middle of the day on Saturday, and now it's
Tuesday.

I will test the latest RC after these issues have been clarified.

Erlend


--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050



--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050


Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Erlend Garåsen


I'm trying to start a crawl before I have to run to the airport. I just 
discovered that MCF recrawls the same host over and over again when it 
returns result code 500:
09-28-2012 11:40:11.024 	fetch 
http://foreninger.uio.no/go/oslo_open_2012_no.php

500

It's just not this document, but several others returning the same HTTP 
result code.


Meanwhile, the following is filling up my log:
FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed: 
String index out of range: -1

java.lang.StringIndexOutOfBoundsException: String index out of range: -1

I'm pretty sure they are related to each other.

I will end this job before I leave because I'm afraid that MCF will try 
to fetch these documents over and over again during this weekend.


Erlend

On 28.09.12 09.58, Karl Wright wrote:

Please vote +1 to release ManifoldCF 1.0, RC5.  The release artifact
can be found at:

http://people.apache.org/~kwright/apache-manifoldcf-1.0

There is also an SVN tag at:

https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5

Fixes since RC4:

CONNECTORS-545

Fixes since RC3:

CONNECTORS-544




--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050


Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Karl Wright
Hi Erlend,

This is expected behavior.  ManifoldCF is designed to retry on errors
that mean potential server problems for a period of time and then give
up.  The logic here is that 500 may well mean the server is down but
will be rebooted or whatever.  It retries every N minutes until M
minutes have elapsed.  For a 500 error, I believe it's every 5 minutes
for either 6 or 12 hours.

Karl


On Fri, Sep 28, 2012 at 5:49 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:

 I'm trying to start a crawl before I have to run to the airport. I just
 discovered that MCF recrawls the same host over and over again when it
 returns result code 500:
 09-28-2012 11:40:11.024 fetch
 http://foreninger.uio.no/go/oslo_open_2012_no.php
 500

 It's just not this document, but several others returning the same HTTP
 result code.

 Meanwhile, the following is filling up my log:
 FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed: String
 index out of range: -1
 java.lang.StringIndexOutOfBoundsException: String index out of range: -1

 I'm pretty sure they are related to each other.

 I will end this job before I leave because I'm afraid that MCF will try to
 fetch these documents over and over again during this weekend.

 Erlend


 On 28.09.12 09.58, Karl Wright wrote:

 Please vote +1 to release ManifoldCF 1.0, RC5.  The release artifact
 can be found at:

 http://people.apache.org/~kwright/apache-manifoldcf-1.0

 There is also an SVN tag at:

 https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5

 Fixes since RC4:

 CONNECTORS-545

 Fixes since RC3:

 CONNECTORS-544



 --
 Erlend Garåsen
 Center for Information Technology Services
 University of Oslo
 P.O. Box 1086 Blindern, N-0317 OSLO, Norway
 Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050


Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Karl Wright
Meanwhile, the following is filling up my log:
FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed:
String index out of range: -1
java.lang.StringIndexOutOfBoundsException: String index out of range: -1

This is indeed a problem I agree we should fix, but in order to do
that I need a stack trace.  It is not clear at all that it is related
to the 500 error you described before, but it could be.  I will create
a ticket for it though.
Karl

On Fri, Sep 28, 2012 at 5:49 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:

 I'm trying to start a crawl before I have to run to the airport. I just
 discovered that MCF recrawls the same host over and over again when it
 returns result code 500:
 09-28-2012 11:40:11.024 fetch
 http://foreninger.uio.no/go/oslo_open_2012_no.php
 500

 It's just not this document, but several others returning the same HTTP
 result code.

 Meanwhile, the following is filling up my log:
 FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed: String
 index out of range: -1
 java.lang.StringIndexOutOfBoundsException: String index out of range: -1

 I'm pretty sure they are related to each other.

 I will end this job before I leave because I'm afraid that MCF will try to
 fetch these documents over and over again during this weekend.

 Erlend


 On 28.09.12 09.58, Karl Wright wrote:

 Please vote +1 to release ManifoldCF 1.0, RC5.  The release artifact
 can be found at:

 http://people.apache.org/~kwright/apache-manifoldcf-1.0

 There is also an SVN tag at:

 https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5

 Fixes since RC4:

 CONNECTORS-545

 Fixes since RC3:

 CONNECTORS-544



 --
 Erlend Garåsen
 Center for Information Technology Services
 University of Oslo
 P.O. Box 1086 Blindern, N-0317 OSLO, Norway
 Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050


Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Erlend Garåsen


OK, I will give you a stack trace in the beginning of next week.

I will start the crawler once more and check the results when I'm back 
and change my vote then if it is ok.


Erlend

On 28.09.12 13.26, Karl Wright wrote:

Meanwhile, the following is filling up my log:
FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed:
String index out of range: -1
java.lang.StringIndexOutOfBoundsException: String index out of range: -1

This is indeed a problem I agree we should fix, but in order to do
that I need a stack trace.  It is not clear at all that it is related
to the 500 error you described before, but it could be.  I will create
a ticket for it though.
Karl

On Fri, Sep 28, 2012 at 5:49 AM, Erlend Garåsen e.f.gara...@usit.uio.no wrote:


I'm trying to start a crawl before I have to run to the airport. I just
discovered that MCF recrawls the same host over and over again when it
returns result code 500:
09-28-2012 11:40:11.024 fetch
http://foreninger.uio.no/go/oslo_open_2012_no.php
 500

It's just not this document, but several others returning the same HTTP
result code.

Meanwhile, the following is filling up my log:
FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed: String
index out of range: -1
java.lang.StringIndexOutOfBoundsException: String index out of range: -1

I'm pretty sure they are related to each other.

I will end this job before I leave because I'm afraid that MCF will try to
fetch these documents over and over again during this weekend.

Erlend


On 28.09.12 09.58, Karl Wright wrote:


Please vote +1 to release ManifoldCF 1.0, RC5.  The release artifact
can be found at:

http://people.apache.org/~kwright/apache-manifoldcf-1.0

There is also an SVN tag at:

https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5

Fixes since RC4:

CONNECTORS-545

Fixes since RC3:

CONNECTORS-544




--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050



--
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050


Re: [VOTE] Release Apache ManifoldCF 1.0, RC5

2012-09-28 Thread Karl Wright
CONNECTORS-547 (index out of bounds)
CONNECTORS-548 (cannot build with maven)

Karl

On Fri, Sep 28, 2012 at 7:26 AM, Karl Wright daddy...@gmail.com wrote:
 Meanwhile, the following is filling up my log:
 FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed:
 String index out of range: -1
 java.lang.StringIndexOutOfBoundsException: String index out of range: -1

 This is indeed a problem I agree we should fix, but in order to do
 that I need a stack trace.  It is not clear at all that it is related
 to the 500 error you described before, but it could be.  I will create
 a ticket for it though.
 Karl

 On Fri, Sep 28, 2012 at 5:49 AM, Erlend Garåsen e.f.gara...@usit.uio.no 
 wrote:

 I'm trying to start a crawl before I have to run to the airport. I just
 discovered that MCF recrawls the same host over and over again when it
 returns result code 500:
 09-28-2012 11:40:11.024 fetch
 http://foreninger.uio.no/go/oslo_open_2012_no.php
 500

 It's just not this document, but several others returning the same HTTP
 result code.

 Meanwhile, the following is filling up my log:
 FATAL 2012-09-28 11:42:32,112 (Worker thread '29') - Error tossed: String
 index out of range: -1
 java.lang.StringIndexOutOfBoundsException: String index out of range: -1

 I'm pretty sure they are related to each other.

 I will end this job before I leave because I'm afraid that MCF will try to
 fetch these documents over and over again during this weekend.

 Erlend


 On 28.09.12 09.58, Karl Wright wrote:

 Please vote +1 to release ManifoldCF 1.0, RC5.  The release artifact
 can be found at:

 http://people.apache.org/~kwright/apache-manifoldcf-1.0

 There is also an SVN tag at:

 https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.0-RC5

 Fixes since RC4:

 CONNECTORS-545

 Fixes since RC3:

 CONNECTORS-544



 --
 Erlend Garåsen
 Center for Information Technology Services
 University of Oslo
 P.O. Box 1086 Blindern, N-0317 OSLO, Norway
 Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050