[ 
https://issues.apache.org/jira/browse/CONNECTORS-470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13274809#comment-13274809
 ] 

Karl Wright commented on CONNECTORS-470:
----------------------------------------

My colleague says the following:

"Yes, we have a customer that has updated to SharePoint 2010 so we've had to 
update the connector. I went into this knowing nothing about SharePoint and 
found ... I hate SharePoint. Here is what I know at this point:

- I never could get the DspSts.asmx service to work with 2010. Found a couple 
forum posts from MS employees that say that the DspSts interfaces are 
deprecated in 2010 and should not be used. Use the lists.amx service to get the 
same information. In the crawler, I wrote another proxy class that inherits 
from the original and uses the lists.asmx service to get the same information 
for version 4.0 (2010).
- You still need the MCPermssions.asmx service as there still exists no way 
through the web services to get permissions info at the file level.
- MS has added a feature in 2010 call the “List View Threshold” to help with 
performance. This does not allow users to get the items in a list if the list 
has more than a “threshold” amount of items, default being 5000. This means 
that a crawing user cannot crawl a list having more than 5000 items without 
getting a “List View Threshold” exception from the SharePoint server (even 
using the lists.asmx web services). Unfortunately our clients have lists with 
WAY more than this number (closer to a million items). Some ways around this:
-- Have the SharePoint admins turn off the list view threshold for the lists in 
question, which most are NOT willing to do.
-- Crawl with a admin user (their threshold is defaulted at 20,000).
-- Have the SharePoint admins increase the Threshold of the crawling user 
(again, they’re not crazy about that).
-- I have implemented a couple more custom web service methods that the crawler 
uses to get list items that use the .NET SharePoint Server Object Model API. 
There are ways to circumvent this threshold using this API, then page the 
results back to the crawler. This allows us to crawl lists with an unlimited 
number of items. We are still in a testing phase with a customer for this.
-- Here is a link the describes this: 
http://blogs.technet.com/b/speschka/archive/2009/10/27/working-with-large-lists-in-sharepoint-2010-list-throttling.aspx
"
                
> SharePoint 2010 support
> -----------------------
>
>                 Key: CONNECTORS-470
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-470
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: SharePoint connector
>    Affects Versions: ManifoldCF 0.5, ManifoldCF 0.6
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 0.6
>
>
> See here for list of the SharePoint web services for SharePoint 2010:
> http://msdn.microsoft.com/en-us/library/ee705814.aspx

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


Reply via email to