Good news again! "was there any part of CONNECTORS-772 which represented any legitimate issues you had to fix?"
Yes; path handling was not robust in two different ways: first, relative vs. absolute, and second, we didn't check for sanity very often, so bad stuff would happen on misconfiguration. ;-) "How about CONNECTORS-777?" This was the issue that Will referred to in his posts. This is fixed too. All of this has been pulled up to trunk now, and one MORE fix was added there: the way the URL for a list item was built was incorrect. This did not affect crawls but if you tried to use the URL to get the list item, it would give a 404. "Also, a minor thing but I noticed that I've crawled 12 documents total but the Status of Jobs page shows 37 processed - ? Is it possible some docs got counted more than once? I could not tell just by looking at the logs." The number reported is the number of job queue records. You get a job queue record for each node in the hierarchy with SharePoint, so there is a record for the root site, a record for the subsite, etc. Many of these (most in the test case) do not index documents. Also, FWIW, I am working now on CONNECTORS-778, and it would be very helpful if I could continue to crawl against your test server for this purpose. Hope this is OK. Thanks, Karl On Wed, Sep 18, 2013 at 11:03 PM, Dmitry Goldenberg <[email protected]>wrote: > Karl and Will, thanks so much for your help on this. I'm still verifying > but I believe there may have been a snag in our 'alternative access > mappings' where a mapping had a wrong server name. Well, at least, once > I've changed it, I was able to: a) add content to subsites via the > SharePoint GUI and b) more importantly, am now fully able to do a complete > wildcarded crawl! > > Karl, just to understand this a bit better, was there any part of > CONNECTORS-772 which represented any legitimate issues you had to fix? I > thought there were still some snags with paths which you fixed. How about > CONNECTORS-777? I'm just trying to see what was a legitimate issue vs. the > fallout of our test env misconfiguration. > > Also, a minor thing but I noticed that I've crawled 12 documents total but > the Status of Jobs page shows 37 processed - ? Is it possible some docs got > counted more than once? I could not tell just by looking at the logs. > > Thanks again! > - Dmitry > > > >
