Re: SharePoint 2007 Connector - (401)HTTP/1.1 401 Unauthorized

2012-12-06 Thread Karl Wright
Hi Luigi,

Others have also run into this exception, from one or more SharePoint
web services.  It is a server side catch-all exception which tells us
very little.

You may get more details by looking at the server's event logs.
SharePoint also has a log you can look at which may be even more
helpful.  In my experience, this is often the result of administrators
changing the system's permissions in ways that cause SharePoint's web
services to stop functioning correctly.  At MetaCarta we never would
see this on fresh SharePoint installations, but only on those where
SharePoint was first installed, and then afterwards people made
adjustments to the system permissions.

I hope you have access to a competent SharePoint system administrator,
because without that, it will be very hard to resolve this problem.

Thanks,
Karl

On Thu, Dec 6, 2012 at 5:12 AM, Luigi D'Addario
luigi.dadda...@googlemail.com wrote:
 Karl,

 I'm trying to put into Solr my SharPoint documents from Shared Documents.
 What do you think about this exception ?
 Permission problems again or ?

 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Getting
 version of '/Shared Documents//'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Checking
 whether to include library '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Library
 '/Shared Documents' exactly matched rule path '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Including
 library '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Processing:
 '/Shared Documents//'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Document
 identifier is a library: '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,515 (Worker thread '3') - Enter:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,000 (Worker thread '3') - Exit:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,031 (Worker thread '3') - Enter:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,406 (Worker thread '3') - Exit:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,421 (Worker thread '3') - SharePoint: Got an
 unknown remote exception getting child documents for site  guid
 {CC072748-E1EE-4F34-B120-FAF33273A616} - axis fault = Server.Dsp.Connect,
 detail = Cannot open the requested Sharepoint Site. - retrying
 AxisFault
  faultCode: {http://schemas.xmlsoap.org/soap/envelope/}Server.Dsp.Connect
  faultSubcode:
  faultString: Cannot open the requested Sharepoint Site.
  faultActor:
  faultNode:
  faultDetail:
 {http://schemas.microsoft.com/sharepoint/dsp}queryResponse:dsQueryResponse
 status=failure/

 Cannot open the requested Sharepoint Site.


 I send you manifoldcf.log.


 Thanks.

 Luigi


 2012/12/5 Luigi D'Addario luigi.dadda...@googlemail.com

 ..and I, finally, tomorrow will try to put into Solr my SharPoint
 documents  !


 2012/12/5 Karl Wright daddy...@gmail.com

 I'll have to figure out how to get this patched httpcomponents release
 into the field





Re: SharePoint 2007 Connector - (401)HTTP/1.1 401 Unauthorized

2012-12-06 Thread Karl Wright
If you have access to the SharePoint installation media itself, one
approach would be to try to install your own version of SharePoint on
a similar environment.  Prove to yourself (and others) that you can
actually crawl on that SharePoint.  Then, based on what the target
system's event logs and SharePoint logs tell you, you can start
modifying settings and module permissions to match the fresh
installation's, until it works.  You can also save yourself some time
by getting the actual request being done using http wire debugging in
ManifoldCF, and then trying that request over and over with curl until
you get it to not fail.

Thanks,
Karl

On Thu, Dec 6, 2012 at 6:29 AM, Karl Wright daddy...@gmail.com wrote:
 Hi Luigi,

 Others have also run into this exception, from one or more SharePoint
 web services.  It is a server side catch-all exception which tells us
 very little.

 You may get more details by looking at the server's event logs.
 SharePoint also has a log you can look at which may be even more
 helpful.  In my experience, this is often the result of administrators
 changing the system's permissions in ways that cause SharePoint's web
 services to stop functioning correctly.  At MetaCarta we never would
 see this on fresh SharePoint installations, but only on those where
 SharePoint was first installed, and then afterwards people made
 adjustments to the system permissions.

 I hope you have access to a competent SharePoint system administrator,
 because without that, it will be very hard to resolve this problem.

 Thanks,
 Karl

 On Thu, Dec 6, 2012 at 5:12 AM, Luigi D'Addario
 luigi.dadda...@googlemail.com wrote:
 Karl,

 I'm trying to put into Solr my SharPoint documents from Shared Documents.
 What do you think about this exception ?
 Permission problems again or ?

 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Getting
 version of '/Shared Documents//'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Checking
 whether to include library '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Library
 '/Shared Documents' exactly matched rule path '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Including
 library '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Processing:
 '/Shared Documents//'
 DEBUG 2012-12-06 11:02:09,500 (Worker thread '3') - SharePoint: Document
 identifier is a library: '/Shared Documents'
 DEBUG 2012-12-06 11:02:09,515 (Worker thread '3') - Enter:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,000 (Worker thread '3') - Exit:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,031 (Worker thread '3') - Enter:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,406 (Worker thread '3') - Exit:
 CommonsHTTPSender::invoke
 DEBUG 2012-12-06 11:02:10,421 (Worker thread '3') - SharePoint: Got an
 unknown remote exception getting child documents for site  guid
 {CC072748-E1EE-4F34-B120-FAF33273A616} - axis fault = Server.Dsp.Connect,
 detail = Cannot open the requested Sharepoint Site. - retrying
 AxisFault
  faultCode: {http://schemas.xmlsoap.org/soap/envelope/}Server.Dsp.Connect
  faultSubcode:
  faultString: Cannot open the requested Sharepoint Site.
  faultActor:
  faultNode:
  faultDetail:
 {http://schemas.microsoft.com/sharepoint/dsp}queryResponse:dsQueryResponse
 status=failure/

 Cannot open the requested Sharepoint Site.


 I send you manifoldcf.log.


 Thanks.

 Luigi


 2012/12/5 Luigi D'Addario luigi.dadda...@googlemail.com

 ..and I, finally, tomorrow will try to put into Solr my SharPoint
 documents  !


 2012/12/5 Karl Wright daddy...@gmail.com

 I'll have to figure out how to get this patched httpcomponents release
 into the field





RE: Cannot connect to SharePoint 2010 instance

2012-12-06 Thread Iannetti, Robert
Thanks Karl I will give it a try today

-Original Message-
From: Karl Wright [mailto:daddy...@gmail.com] 
Sent: Wednesday, December 05, 2012 6:04 PM
To: user@manifoldcf.apache.org
Subject: Re: Cannot connect to SharePoint 2010 instance

Hi Robert,

I've solved Luigi's problem - and now I want to know if it solves
yours.  Unfortunately, you WILL have to build ManifoldCF for this
step, since I cannot modify the build process easily to accommodate
the patched httpcomponents dependencies.

Can you do the following:

(1) Check out a trunk copy of manifoldcf sources, e.g svn co
https://svn.apache.org/repos/asf/manifoldcf/trunk; .
(2) Download the lib package from
http://people.apache.org/~kwright/apache-manifoldcf-1.1-dev, unpack
it, and install it in the lib directory as per the instructions in the
lib package.
(3) Run ant build to be sure you can actually build the project.  If
that works, download the two patched httpcomponents jars from
http://people.apache.org/~kwright , and use them to overwrite
lib/httpcore.jar and lib/httpclient.jar.
(4) Run ant build clean
(5) Start manifoldcf (it's under the dist directory), and see if you
can connect to your sharepoint instance.

Thanks!
Karl


On Thu, Nov 29, 2012 at 8:56 AM, Iannetti, Robert
robert.ianne...@novartis.com wrote:
 Hi Karl,

 I have been following your thread with Luigi I look forward to testing the 
 new release.

 Thanks
 Bob


 -Original Message-
 From: Karl Wright [mailto:daddy...@gmail.com]
 Sent: Thursday, November 29, 2012 3:28 AM
 To: user@manifoldcf.apache.org
 Subject: Re: Cannot connect to SharePoint 2010 instance

 Hi Robert,

 Luigi and I think we've discovered the issue, which we're going to see if we 
 can confirm today.  There is a ticket tracking it, which is CONNECTORS-572.  
 If correct, it appears that Windows may have changed what it considers to be 
 the name of the user at some recent time, and the httpcomponents and 
 commons-httpclient implementations of NTLM are not resilient to this change - 
 which isn't surprising since they are basically reverse-engineered.  If 
 correct, httpcomponents will likely need to release a patch, so the schedule 
 will be, in part, up to them.
  Alternatively, we can build and patch httpcomponents as part of the 
 ManifoldCF release process, but it would require us to have a new Maven 
 dependency for the make-core-deps part of our release.

 Karl

 On Wed, Nov 28, 2012 at 9:01 AM, Iannetti, Robert 
 robert.ianne...@novartis.com wrote:
 Karl,

 Here is my debug output

 DEBUG 2012-11-28 08:59:25,884 (Thread-479) -  POST
 /_vti_bin/lists.asmx HTTP/1
 .1
 DEBUG 2012-11-28 08:59:25,899 (Thread-479) -  Content-Type:
 text/xml; charset=
 utf-8
 DEBUG 2012-11-28 08:59:25,899 (Thread-479) -  SOAPAction:
 http://schemas.micr osoft.com/sharepoint/soap/GetListCollection
 DEBUG 2012-11-28 08:59:25,899 (Thread-479) -  User-Agent: Axis/1.4
 DEBUG 2012-11-28 08:59:25,899 (Thread-479) -  Content-Length: 335
 DEBUG 2012-11-28 08:59:25,899 (Thread-479) -  Host:
 searchpoc.testprojects.nib r.novartis.intra DEBUG 2012-11-28
 08:59:25,899 (Thread-479) -  Connection: Keep-Alive DEBUG 2012-11-28
 08:59:30,629 (Thread-479) -  HTTP/1.1 401 Unauthorized DEBUG
 2012-11-28 08:59:30,629 (Thread-479) -  Server: Microsoft-IIS/7.5
 DEBUG 2012-11-28 08:59:30,629 (Thread-479) -  SPRequestGuid:
 56647ed0-9bac-4a2
 e-b61a-2d2e76ae8db0
 DEBUG 2012-11-28 08:59:30,629 (Thread-479) -  WWW-Authenticate: NTLM
 DEBUG 2012-11-28 08:59:30,629 (Thread-479) -  X-Powered-By: ASP.NET
 DEBUG 2012-11-28 08:59:30,630 (Thread-479) -  
 MicrosoftSharePointTeamServices:
  14.0.0.6123
 DEBUG 2012-11-28 08:59:30,630 (Thread-479) -  X-MS-InvokeApp: 1;
 RequireReadOn ly DEBUG 2012-11-28 08:59:30,630 (Thread-479) -  Date:
 Wed, 28 Nov 2012 13:59:30 GMT DEBUG 2012-11-28 08:59:30,630
 (Thread-479) -  Content-Length: 0 DEBUG 2012-11-28 08:59:30,663
 (Thread-479) -  POST /_vti_bin/lists.asmx HTTP/1.1 DEBUG 2012-11-28
 08:59:30,663 (Thread-479) -  Content-Type: text/xml; charset=utf-8
 DEBUG 2012-11-28 08:59:30,663 (Thread-479) -  SOAPAction: 
 http://schemas.microsoft.com/sharepoint/soap/GetListCollection;
 DEBUG 2012-11-28 08:59:30,663 (Thread-479) -  User-Agent: Axis/1.4
 DEBUG 2012-11-28 08:59:30,663 (Thread-479) -  Content-Length: 335
 DEBUG 2012-11-28 08:59:30,663 (Thread-479) -  Host:
 searchpoc.testprojects.nibr.novartis.intra
 DEBUG 2012-11-28 08:59:30,663 (Thread-479) -  Connection: Keep-Alive
 DEBUG 2012-11-28 08:59:30,663 (Thread-479) -  Authorization: NTLM
 TlRMTVNTUAABNQIIIAoACgBAIAAgACBJAFAALQAxADAALQAxADQANQAtAD
 MAMgAtADEAMgAxAE4AQQBOAEUAVAA= DEBUG 2012-11-28 08:59:30,680
 (Thread-479) -  HTTP/1.1 401 Unauthorized DEBUG 2012-11-28
 08:59:30,680 (Thread-479) -  Server: Microsoft-IIS/7.5 DEBUG
 2012-11-28 08:59:30,680 (Thread-479) -  SPRequestGuid:
 208f5c66-7d26-4761-b578-d01645f042ed
 DEBUG 2012-11-28 08:59:30,680 (Thread-479) -  WWW-Authenticate: NTLM
 

Re: Web crawl exited with an unexpected jobqueue status error under MySQL

2012-12-06 Thread Shigeki Kobayashi
Oh , I see...


I will upgrade MCF and will try again...


Thanks!


Regards,

Shigeki


2012/12/6 Karl Wright daddy...@gmail.com

 Actually, I just noticed this: I ran MCF0.6 .  MCF 0.6 runs MySQL in
 the wrong mode, so this was a problem.  It was fixed in ManifoldCF
 1.0.

 Can you upgrade to MCF 1.01 and see if this still happens for you?

 Karl

 On Wed, Dec 5, 2012 at 9:46 PM, Shigeki Kobayashi
 shigeki.kobayas...@g.softbank.co.jp wrote:
  Hello Karl.
 
  MySQL:  5.5.24
  Tomcat:  6.0.35
  CentOS: 6.3
 
 
  Regards,
 
  Shigeki
 
 
  2012/12/5 Karl Wright daddy...@gmail.com
 
  Yes, I believe it is related, in the sense that the fix for
  CONNECTORS-246 was a fix to the HSQLDB database.  This error makes it
  clear that MySQL has a similar problem with its MVCC model, and will
  also require a fix.  However, I do not have the same kinds of leverage
  in the MySQL community that I do with HSQLDB.
 
  Can you give some details about the version of MySQL you are running,
  and on what platform?  I will capture that and then maybe figure out
  how to open a MySQL ticket.
 
  Karl
 
  On Wed, Dec 5, 2012 at 6:57 AM, Shigeki Kobayashi
  shigeki.kobayas...@g.softbank.co.jp wrote:
  
   Hi.
  
   I ran MCF0.6 under MySQL5.5. I crawled WEB and the following error
   occurred,
   then MCF stopped the job:
  
   
   2012/12/04 18:50:07 ERROR (Worker thread '0') - Exception tossed:
   Unexpected
   jobqueue status - record id 1354608871138, expecting active status,
 saw
   3
   org.apache.manifoldcf.core.interfaces.ManifoldCFException: Unexpected
   jobqueue status - record id 1354608871138, expecting active status,
 saw
   3
   at
  
  
 org.apache.manifoldcf.crawler.jobs.JobQueue.updateCompletedRecord(JobQueue.java:711)
   at
  
  
 org.apache.manifoldcf.crawler.jobs.JobManager.markDocumentCompletedMultiple(JobManager.java:2435)
   at
  
  
 org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:745)
   
  
   There was a similar ticket A file crawl exited with an unexpected
   jobqueue
   status error under HSQLDB.
   https://issues.apache.org/jira/browse/CONNECTORS-246
  
  
   Wondering if this is related..
  
  
   Regards,
  
   Shigeki