Hi Karl, I am terribly sorry for bothering you while you are busy. For this issue, I've tried to add a property for disabling hopcount logging to the database only for jobs with its hopcount mode set to "keep unreachable documents, forever" in the "Hop Filters" tab. I would appreciate it if you could give me your opinion or advice.
<property name="org.apache.manifoldcf.crawler.jobs.storehopcount" value="false"/> https://issues.apache.org/jira/browse/CONNECTORS-1747 Kind regards, Mingchun 2023年5月11日(木) 15:28 Mingchun Zhao <mingchun.zha...@gmail.com>: > > Hi Karl, > > Thank you for taking time out of your busy schedule to reply. > > > There is an option on the "hopcount" tab of your job to disable hopcount > > You mean setting "Hop count mode" to "keep unreachable documents, > forever" in the "Hop Filters" tab? > Yes, I did it, however, it seems that the records were still inserted > into the "intrinsiclink" and "hopcount" tables. Is there a way to tell > MCF not to insert data into those tables because operations on it can > become a performance bottleneck when the tables bloat? > > Regards, > Mingchun > > 2023年5月10日(水) 19:53 Karl Wright <daddy...@gmail.com>: > > > > There is an option on the "hopcount" tab of your job to disable hopcount > > tracking entirely. > > Karl > > > > On Tue, May 9, 2023 at 11:49 PM Mingchun Zhao <mingchun.zha...@gmail.com> > > wrote: > > > > > Hi Karl, > > > > > > Could you please advise me on tracking hopcount. > > > I'm using ManifoldCF 2.24 with PostgreSQL 12.14 as the database for now. > > > In my case, I don't need to use the 'Hop Filters' feature so I'd like > > > to disable tracking hopcount and reduce the insert/update/delete load > > > on the 'intrinsiclink' and 'hopcount' tables. So I have two questions > > > about this. > > > First, is there an option to disable tracking hopcount? > > > Second, if I disable tracking hopcount , can it affect other crawling > > > processes? > > > > > > Thank you in advance. > > > Kind regards, > > > Mingchun > > >