The jcifs connector does not include a lot of information in the version string for a file - basically, the length, and the modified date. So I would not expect there to be lot of actual work involved if there are no changes to a document.
The activity "access" does imply that the system believes that the document does need to be reindexed. It clearly reads the document properly. I would check to be sure it actually indexes the document. I suspect that your job may be reading the file but determining it is not suitable for indexing and then repeating that every day. You can see this by looking for the document in the activity log to see what ManifoldCF decided to do with it. Karl On Thu, May 25, 2023 at 6:03 AM Bisonti Mario <mario.biso...@vimar.com> wrote: > Hi, > > I would like to understand how recrawl works > > > > My job scan, using “Connection Type” “Windows shares” works for near 18 > hours. > > My document numebr a little bit of 1 million. > > > > If I check the documents scan from MifoldCF I see, for example: > > > > It seems that re work on the document every day even if it hadn’t been > modified. > > So, is it right or I chose a wrong job to crawl the documents? > > > > Thanks a lot > > Mario > > > > >