David Morana created CONNECTORS-688: ---------------------------------------
Summary: Can we exclude the Recycle Bin from being crawled in the Livelink Connector? Key: CONNECTORS-688 URL: https://issues.apache.org/jira/browse/CONNECTORS-688 Project: ManifoldCF Issue Type: Improvement Affects Versions: ManifoldCF 1.1.1, ManifoldCF 1.2, ManifoldCF 1.3 Environment: running solr 4.0 final with manifoldcf v1.1.1 on RHEL 6 64 bit on tomcat v7.0.34 Reporter: David Morana Priority: Minor Fix For: ManifoldCF 1.1.1 When a file in Livelink (Content Server 10 update 6) gets moved to the Recycle Bin (RC v10.0.0; this module is NOT a part of the basic content server install) the file is still crawled, indexed and it appears in search results (although the link will be inaccessible to users) the recycle bin is a special folder on the content server; it holds documents to be purged at a later date. LAPI still shows that they are not deleted. Can we add a filter to the UI and Livelink connector to exclude certain ownerID's (i.e. the ID of the recycle bin) from the crawl? In LivelinkConnectors.java you check to see if the version has been deleted and an additional check would need to be added to see if it was sent to the recycle bin (for example, the recycle bin's object id is 426023) Here's an example: after this call {code} int status = LLDocs.GetVersionInfo(vol,id,revNumber,versioninfo); {code} Just check the OWNER in the versioninfo object like so: {code} int ownerID = versioninfo.toInteger("OWNER"); {code} If owner is the NEGATIVE value of the recycle bin ID (i.e -426023) then it's marked for deletion and should be excluded from the index. Thanks, -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira