Re: beagle indexes and cache

2009-10-27 Thread D Bera
 does beagle cleans up its indexes and textcaches now and then?
 I mean it looks like old documents (now removed) still come up when
 searching...
 is there a way to clean indexes and textcache, or should I simply remove
 these directories in the .beagle folder now and then?

 It should, but I have had the indexes get stale, especially if I worked
 for awhile with the beagle service turned off.

That is right. If beagle service is running, it will monitor new files
that are created and existing files that are removed almost
immediately. If it is turned off, and then turned on again, then the
changes while it was turned off are handled later:
- new files are reported only during the next re-crawl
- deleted files are not reported but they are removed from the beagle
database much later and only when such a file comes up a search result

In short, deleted files should not come up as a search result (I dont
remember, but there was a bug in 0.3.7 or 0.3.8 which can cause
something like that).

Removing textcache and indexes manually should not be necessary,
assuming there is no bug (and even less reason if beagle is running
continuously). Note that the indexes are separated into backends and
can be easier to remove but the textcache is not, so removing the
textcache directory will wipe out all of textcache.

- dBera
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: beagle indexes and cache

2009-10-27 Thread Lorenzo Bettini

D Bera wrote:

does beagle cleans up its indexes and textcaches now and then?
I mean it looks like old documents (now removed) still come up when
searching...
is there a way to clean indexes and textcache, or should I simply remove
these directories in the .beagle folder now and then?

It should, but I have had the indexes get stale, especially if I worked
for awhile with the beagle service turned off.


That is right. If beagle service is running, it will monitor new files
that are created and existing files that are removed almost
immediately. If it is turned off, and then turned on again, then the
changes while it was turned off are handled later:
- new files are reported only during the next re-crawl
- deleted files are not reported but they are removed from the beagle
database much later and only when such a file comes up a search result

In short, deleted files should not come up as a search result (I dont
remember, but there was a bug in 0.3.7 or 0.3.8 which can cause
something like that).

Removing textcache and indexes manually should not be necessary,
assuming there is no bug (and even less reason if beagle is running
continuously). Note that the indexes are separated into backends and
can be easier to remove but the textcache is not, so removing the
textcache directory will wipe out all of textcache.



thanks for your answers!

I was worried, though, about files being indexed that are in a directory 
which is later marked as ignored with beagle-settings: will those files 
well be removed from the beagle database?


thanks again
Lorenzo

--
Lorenzo Bettini, PhD in Computer Science, DI, Univ. Torino
ICQ# lbetto, 16080134 (GNU/Linux User # 158233)
HOME: http://www.lorenzobettini.it MUSIC: http://www.purplesucker.com
http://www.myspace.com/supertrouperabba
BLOGS: http://tronprog.blogspot.com  http://longlivemusic.blogspot.com
http://www.gnu.org/software/src-highlite
http://www.gnu.org/software/gengetopt
http://www.gnu.org/software/gengen http://doublecpp.sourceforge.net

___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: beagle indexes and cache

2009-10-27 Thread D Bera
 does beagle cleans up its indexes and textcaches now and then?
 I mean it looks like old documents (now removed) still come up when
 searching...
 is there a way to clean indexes and textcache, or should I simply remove
 these directories in the .beagle folder now and then?
...

 I was worried, though, about files being indexed that are in a directory
 which is later marked as ignored with beagle-settings: will those files well
 be removed from the beagle database?

They would definitely not show up in search results - the search
results are filtered based on current availability and beagle-settings
ignore list.
As far as actually removing them from the database is concerned, I
don't remember off the top of my head how it is dealt with. Ideally,
the entries should be removed since the index (and esp. the textcache)
might retain information that the user doesnt want it to contain ...
but ignoring a directory involves traversing all files and directories
beneath it which soon creates a nightmare. I remember seeing some code
on dropping the contents of the ignored directories but I am can't
remember if it is complete or what exactly it does.

- dBera

-- 
-
Debajyoti Bera @ http://dtecht.blogspot.com
beagle / KDE fan
Mandriva / Inspiron-1100 user
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: beagle indexes and cache

2009-10-23 Thread Adam Tauno Williams
 does beagle cleans up its indexes and textcaches now and then?
 I mean it looks like old documents (now removed) still come up when 
 searching...
 is there a way to clean indexes and textcache, or should I simply remove 
 these directories in the .beagle folder now and then?

It should, but I have had the indexes get stale, especially if I worked
for awhile with the beagle service turned off.

It helps to occasionally stop the Beagle service, open a shell,  export
BEAGLE_EXERCISE_THE_DOG=1, run beagle-search (yes - from the shell!),
then start the service.  This will make the service index aggressively
as fast as it can;  bringing the index up to date.  I do this on my
laptop when I go to bed - then in the morning stop the service, close
the shell, run beagle-search from the GUI (GNOME-Do!) and restart the
service [now running in an environment without BEAGLE_EXERCISE_THE_DOG
set] so it is back to running as a low-priority background service.

You can also stop the service, rm -R ~/.beagle, export
BEAGLE_EXERCISE_THE_DOG=1, beagle-search, start the service - to
rebuild all your indexes from scratch as fast as it can.
-- 
openSUSE http://www.opensuse.org/en/
Linux for human beings who need to get things done.


signature.asc
Description: This is a digitally signed message part
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers