Re: Unable to index Windows share repositories

2012-02-10 Thread Karl Wright
Nothing has changed as far as the connectors are concerned.  Is your
domain controller now upgraded to a different version of windows too?
If so you may need to play around with the fields that are used for
authorization, e.g. the form of the username and/or the domain name.

Windows is not an open platform and they change stuff all the time,
but to the best of my knowledge they have not introduced any new
authentication modes in Windows 7, so something should work.  If not
the guy to talk with is Michael Allen, who maintains the jcifs
library.

Karl

On Fri, Feb 10, 2012 at 2:17 AM, Swapna Vuppala
swapna.kollip...@gmail.com wrote:
 Hi,

 Till recently, I have been using ManifoldCF trunk code (before 0.4 was
 released) on Windows XP. I was able to index files from Windows Share
 repositories successfully into Solr. Now, I have started using ManifoldCF
 0.4 version on Windows 7.
 With the new setup, am able to index files from File system repository with
 no issue, but I have problems indexing data from Windows Share repository.

 The job starts and ends with Result Description : Authorization: Access is
 denied. in Simple History. The log file has the message JCIFS:
 Authorization exception reading document/directory smb://nhance29/TestMails/
 - skipping

 Can you please tell me what needs to be done to resolve this ?

 I tried enabling Debug from properties.xml and this is what I get in the log
 file.

 DEBUG 2012-02-10 12:34:37,869 (Startup thread) - Connecting to:
 smb://GLOBAL;stgserver:password@nhance29/
 DEBUG 2012-02-10 12:34:37,907 (Startup thread) - Seed =
 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:39,781 (Worker thread '1') - JCIFS: getVersions():
 documentIdentifiers[0] is: smb://nhance29/TestMails/
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: In checkInclude
 for 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Matching
 startpoint 'smb://nhance29/TestMails/' against actual
 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint found!
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint:
 always included
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Leaving
 checkInclude for 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS: Processing
 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS:
 'smb://nhance29/TestMails/' is a directory
  WARN 2012-02-10 12:34:44,425 (Worker thread '1') - JCIFS: Possibly
 transient exception detected on attempt 1 while listing files: Access is
 denied.
 jcifs.smb.SmbAuthException: Access is denied.
     at jcifs.smb.SmbTransport.checkStatus(SmbTransport.java:546)
     at jcifs.smb.SmbTransport.send(SmbTransport.java:640)
     at jcifs.smb.SmbSession.send(SmbSession.java:238)
     at jcifs.smb.SmbTree.send(SmbTree.java:119)
     at jcifs.smb.SmbFile.send(SmbFile.java:775)
     at jcifs.smb.SmbFile.doFindFirstNext(SmbFile.java:1986)
     at jcifs.smb.SmbFile.doEnum(SmbFile.java:1738)
     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1715)
     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1704)
     at
 org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.fileListFiles(SharedDriveConnector.java:2224)
     at
 org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:701)
     at
 org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
     at
 org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561)

 Thanks and Regards,
 Swapna.


Re: Unable to index Windows share repositories

2012-02-10 Thread Karl Wright
Good to hear.

The connector, by the way, is resigned to the fact that sometimes
various things fail when talking to Windows, which is why you see the
transient failure notification; it will retry on its own eventually
without killing the job, and only give up when things don't work for
an extended period of time.

Karl


On Fri, Feb 10, 2012 at 5:08 AM, Swapna Vuppala
swapna.kollip...@gmail.com wrote:
 Hi,

 Not sure why, but now am able to index data from Windows Share repositories
 into Solr. I don't get the Access denied messages any more, although I
 haven't changed anything.

 Sorry for the inconvenience caused. Will get back again if I see any issue.

 Thanks and Regards,
 Swapna.


 On Fri, Feb 10, 2012 at 12:47 PM, Swapna Vuppala
 swapna.kollip...@gmail.com wrote:

 Hi,

 Till recently, I have been using ManifoldCF trunk code (before 0.4 was
 released) on Windows XP. I was able to index files from Windows Share
 repositories successfully into Solr. Now, I have started using ManifoldCF
 0.4 version on Windows 7.
 With the new setup, am able to index files from File system repository
 with no issue, but I have problems indexing data from Windows Share
 repository.

 The job starts and ends with Result Description : Authorization: Access
 is denied. in Simple History. The log file has the message JCIFS:
 Authorization exception reading document/directory smb://nhance29/TestMails/
 - skipping

 Can you please tell me what needs to be done to resolve this ?

 I tried enabling Debug from properties.xml and this is what I get in the
 log file.

 DEBUG 2012-02-10 12:34:37,869 (Startup thread) - Connecting to:
 smb://GLOBAL;stgserver:password@nhance29/
 DEBUG 2012-02-10 12:34:37,907 (Startup thread) - Seed =
 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:39,781 (Worker thread '1') - JCIFS: getVersions():
 documentIdentifiers[0] is: smb://nhance29/TestMails/
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: In checkInclude
 for 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Matching
 startpoint 'smb://nhance29/TestMails/' against actual
 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint
 found!
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Startpoint:
 always included
 DEBUG 2012-02-10 12:34:44,417 (Worker thread '1') - JCIFS: Leaving
 checkInclude for 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS: Processing
 'smb://nhance29/TestMails/'
 DEBUG 2012-02-10 12:34:44,421 (Worker thread '1') - JCIFS:
 'smb://nhance29/TestMails/' is a directory
  WARN 2012-02-10 12:34:44,425 (Worker thread '1') - JCIFS: Possibly
 transient exception detected on attempt 1 while listing files: Access is
 denied.
 jcifs.smb.SmbAuthException: Access is denied.
     at jcifs.smb.SmbTransport.checkStatus(SmbTransport.java:546)
     at jcifs.smb.SmbTransport.send(SmbTransport.java:640)
     at jcifs.smb.SmbSession.send(SmbSession.java:238)
     at jcifs.smb.SmbTree.send(SmbTree.java:119)
     at jcifs.smb.SmbFile.send(SmbFile.java:775)
     at jcifs.smb.SmbFile.doFindFirstNext(SmbFile.java:1986)
     at jcifs.smb.SmbFile.doEnum(SmbFile.java:1738)
     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1715)
     at jcifs.smb.SmbFile.listFiles(SmbFile.java:1704)
     at
 org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.fileListFiles(SharedDriveConnector.java:2224)
     at
 org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector.processDocuments(SharedDriveConnector.java:701)
     at
 org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
     at
 org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561)

 Thanks and Regards,
 Swapna.