[ 
https://issues.apache.org/jira/browse/CONNECTORS-917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17349809#comment-17349809
 ] 

Shashank Dwivedi commented on CONNECTORS-917:
---------------------------------------------

Hi, do you have any update for this I have like 2000+ sites in sharepoint and i 
cannot manually add them plus i only need contents of specific folder inside a 
site. How can I achieve this.

> SharePoint connector would benefit from site discovery
> ------------------------------------------------------
>
>                 Key: CONNECTORS-917
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-917
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: SharePoint connector
>    Affects Versions: ManifoldCF 1.7
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>            Priority: Major
>             Fix For: ManifoldCF next
>
>
> The current SharePoint connector only can crawl a single SharePoint site.  
> But SharePoint can support multiple sites.  Indeed, in some cases there are 
> hundreds of such sites.  Setting up a connection and jobs for each one would 
> be a difficult task.
> The SharePoint admin site allows you to discover the sites that exist.  Using 
> this feature as part of the crawl would allow for a much more automated way 
> of handling large SharePoint installations.
> Some notes:
>    - Not yet clear how "one site" vs. "many sites" should coexist in one 
> connector
>      - Form of document identifier must change
>      - Each document identifier must include the site path first
>      - Since subsite path can be just "/", also needs to be resilient against 
> that
>      - Something like: <site_path>//<current_subsite_doc_list_item_etc_path>. 
>  But "//" will collide with old-style.
>      - If old-style document identifier always must start with a "/", then we 
> can simply start it with (say) a "+", to signal that it is a new-style 
> identifier
>      - Not clear yet if there's a new form that would allow us to know if a 
> doc identifier was old form or not
>    - Native authority also right now needs to know what site it is working 
> with
>      - Site discovery therefore must also be run in the authority, and tokens 
> for each discovered site must be returned
>      - Native tokens must therefore be qualified with a site ID



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to