Re: beagle indexes and cache
does beagle cleans up its indexes and textcaches now and then? I mean it looks like old documents (now removed) still come up when searching... is there a way to clean indexes and textcache, or should I simply remove these directories in the .beagle folder now and then? It should, but I have had the indexes get stale, especially if I worked for awhile with the beagle service turned off. That is right. If beagle service is running, it will monitor new files that are created and existing files that are removed almost immediately. If it is turned off, and then turned on again, then the changes while it was turned off are handled later: - new files are reported only during the next re-crawl - deleted files are not reported but they are removed from the beagle database much later and only when such a file comes up a search result In short, deleted files should not come up as a search result (I dont remember, but there was a bug in 0.3.7 or 0.3.8 which can cause something like that). Removing textcache and indexes manually should not be necessary, assuming there is no bug (and even less reason if beagle is running continuously). Note that the indexes are separated into backends and can be easier to remove but the textcache is not, so removing the textcache directory will wipe out all of textcache. - dBera ___ Dashboard-hackers mailing list Dashboard-hackers@gnome.org http://mail.gnome.org/mailman/listinfo/dashboard-hackers
Re: beagle indexes and cache
D Bera wrote: does beagle cleans up its indexes and textcaches now and then? I mean it looks like old documents (now removed) still come up when searching... is there a way to clean indexes and textcache, or should I simply remove these directories in the .beagle folder now and then? It should, but I have had the indexes get stale, especially if I worked for awhile with the beagle service turned off. That is right. If beagle service is running, it will monitor new files that are created and existing files that are removed almost immediately. If it is turned off, and then turned on again, then the changes while it was turned off are handled later: - new files are reported only during the next re-crawl - deleted files are not reported but they are removed from the beagle database much later and only when such a file comes up a search result In short, deleted files should not come up as a search result (I dont remember, but there was a bug in 0.3.7 or 0.3.8 which can cause something like that). Removing textcache and indexes manually should not be necessary, assuming there is no bug (and even less reason if beagle is running continuously). Note that the indexes are separated into backends and can be easier to remove but the textcache is not, so removing the textcache directory will wipe out all of textcache. thanks for your answers! I was worried, though, about files being indexed that are in a directory which is later marked as ignored with beagle-settings: will those files well be removed from the beagle database? thanks again Lorenzo -- Lorenzo Bettini, PhD in Computer Science, DI, Univ. Torino ICQ# lbetto, 16080134 (GNU/Linux User # 158233) HOME: http://www.lorenzobettini.it MUSIC: http://www.purplesucker.com http://www.myspace.com/supertrouperabba BLOGS: http://tronprog.blogspot.com http://longlivemusic.blogspot.com http://www.gnu.org/software/src-highlite http://www.gnu.org/software/gengetopt http://www.gnu.org/software/gengen http://doublecpp.sourceforge.net ___ Dashboard-hackers mailing list Dashboard-hackers@gnome.org http://mail.gnome.org/mailman/listinfo/dashboard-hackers
Re: beagle indexes and cache
does beagle cleans up its indexes and textcaches now and then? I mean it looks like old documents (now removed) still come up when searching... is there a way to clean indexes and textcache, or should I simply remove these directories in the .beagle folder now and then? ... I was worried, though, about files being indexed that are in a directory which is later marked as ignored with beagle-settings: will those files well be removed from the beagle database? They would definitely not show up in search results - the search results are filtered based on current availability and beagle-settings ignore list. As far as actually removing them from the database is concerned, I don't remember off the top of my head how it is dealt with. Ideally, the entries should be removed since the index (and esp. the textcache) might retain information that the user doesnt want it to contain ... but ignoring a directory involves traversing all files and directories beneath it which soon creates a nightmare. I remember seeing some code on dropping the contents of the ignored directories but I am can't remember if it is complete or what exactly it does. - dBera -- - Debajyoti Bera @ http://dtecht.blogspot.com beagle / KDE fan Mandriva / Inspiron-1100 user ___ Dashboard-hackers mailing list Dashboard-hackers@gnome.org http://mail.gnome.org/mailman/listinfo/dashboard-hackers
Re: beagle indexes and cache
does beagle cleans up its indexes and textcaches now and then? I mean it looks like old documents (now removed) still come up when searching... is there a way to clean indexes and textcache, or should I simply remove these directories in the .beagle folder now and then? It should, but I have had the indexes get stale, especially if I worked for awhile with the beagle service turned off. It helps to occasionally stop the Beagle service, open a shell, export BEAGLE_EXERCISE_THE_DOG=1, run beagle-search (yes - from the shell!), then start the service. This will make the service index aggressively as fast as it can; bringing the index up to date. I do this on my laptop when I go to bed - then in the morning stop the service, close the shell, run beagle-search from the GUI (GNOME-Do!) and restart the service [now running in an environment without BEAGLE_EXERCISE_THE_DOG set] so it is back to running as a low-priority background service. You can also stop the service, rm -R ~/.beagle, export BEAGLE_EXERCISE_THE_DOG=1, beagle-search, start the service - to rebuild all your indexes from scratch as fast as it can. -- openSUSE http://www.opensuse.org/en/ Linux for human beings who need to get things done. signature.asc Description: This is a digitally signed message part ___ Dashboard-hackers mailing list Dashboard-hackers@gnome.org http://mail.gnome.org/mailman/listinfo/dashboard-hackers