Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Mark Gaiser
On Thu, Oct 16, 2014 at 1:20 PM, Vishesh Handa wrote: > Hey guys > > While Baloo performs better than Nepomuk. It does have its share of problems > - mostly large text files, and high IO usage. Additionally, users on linux > often seem to have the craziest files. Currently, we do not index plain t

Re: Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Martin Gräßlin
On Friday 17 October 2014 18:24:48 Vishesh Handa wrote: > On Thu, Oct 16, 2014 at 2:15 PM, Martin Gräßlin wrote: > > the txt being genome data doesn't surprise me[1], but I find it sad that > > now > > txt is disabled by default (I use them quite a lot for blog posts). As > > genome > > data is re

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Rex Dieter
Vishesh Handa wrote: > ... Instead, we could only index - > > * $HOME - Not including any subfolders. > * Desktop, Documents, Videos, Pictures and Music. All of these are xdg > user directories. +1 Yes, please! -- Rex ___ Plasma-devel mailing list Pl

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Eike Hein
On 17.10.2014 18:24, Vishesh Handa wrote: About gibberish. It's hard to figure out what gibberish is. I think I'll add some code that we only index the first 20 characters of each word. That should help to a certain extent. Define "word" - Chinese and Japanese (unless mostly kana) often don't

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Vishesh Handa
On Thu, Oct 16, 2014 at 2:15 PM, Martin Gräßlin wrote: > the txt being genome data doesn't surprise me[1], but I find it sad that > now > txt is disabled by default (I use them quite a lot for blog posts). As > genome > data is really huge wouldn't it make sense to go rather for file size or > ab

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Vishesh Handa
On Fri, Oct 17, 2014 at 2:11 AM, Ömer Fadıl USTA wrote: > Couldnt we add a .baloo file to specify for attributes for current > directory, file or subdirectories about not indexing ? > For example of a .baloo file > skip_all > skip_if_greater 1m > skip_if_smaller 50k > skip_ext txt jpg > With --su

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Todd Rme
On Thu, Oct 16, 2014 at 2:15 PM, Martin Gräßlin wrote: > On Thursday 16 October 2014 13:20:57 Vishesh Handa wrote: >> Hey guys >> >> While Baloo performs better than Nepomuk. It does have its share of >> problems - mostly large text files, and high IO usage. Additionally, users >> on linux often s

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Ömer Fadıl USTA
Couldnt we add a .baloo file to specify for attributes for current directory, file or subdirectories about not indexing ? For example of a .baloo file skip_all skip_if_greater 1m skip_if_smaller 50k skip_ext txt jpg With --subdirs flag like Skip_ext --subdirs txt jpg Skip_all --subdirs On Oct 16,

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Weng Xuetian
As for text file, in linux world people don't usually use .txt extension, especially when writing something like vimwiki or something similar. I guess cap the size is some what better solution ( 1-5MB is good enough). And as for folder limitation, that doesn't sound good, people usually organze f

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Steigerwald
Am Donnerstag, 16. Oktober 2014, 14:20:06 schrieb Luca Beltrame: > In data giovedì 16 ottobre 2014 14:15:15, Martin Gräßlin ha scritto: > > genome data is really huge wouldn't it make sense to go rather for file > > size or abort the indexing if it's obvious random gibberish? > > As the person who

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Luca Beltrame
In data giovedì 16 ottobre 2014 14:15:15, Martin Gräßlin ha scritto: > genome data is really huge wouldn't it make sense to go rather for file > size or abort the indexing if it's obvious random gibberish? As the person who mentioned this first (hey, I'm famous ;), I'm guessing that limiting on

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Gräßlin
On Thursday 16 October 2014 13:20:57 Vishesh Handa wrote: > Hey guys > > While Baloo performs better than Nepomuk. It does have its share of > problems - mostly large text files, and high IO usage. Additionally, users > on linux often seem to have the craziest files. Currently, we do not index > p

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Eike Hein
On 16.10.2014 13:20, Vishesh Handa wrote: Comments? I understand the pragmatic motivation behind it, but it seems like a strange step to me. The idea behind indexing is that you can find things regardless of location, so you don't need to be aware of where things are. By making the index sele

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Kai Uwe Broulik
Hi, > * Desktop, Documents, Videos, Pictures and Music. All of these are xdg user > directories. The only reason I actually index my kf5 folder with all the git clones in it, is because Dolphin doesn't properly fallback when searching non-indexed locations and then it won't find anything (in fa

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Steigerwald
Am Donnerstag, 16. Oktober 2014, 13:27:02 schrieb Marco Martin: > On Thursday 16 October 2014, Vishesh Handa wrote: > > * $HOME - Not including any subfolders. > > * Desktop, Documents, Videos, Pictures and Music. All of these are xdg > > user > > directories. > > > > Gnome Tracker actually does s

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Steigerwald
Am Donnerstag, 16. Oktober 2014, 13:20:57 schrieb Vishesh Handa: > Hey guys Hi Vishesh, > While Baloo performs better than Nepomuk. It does have its share of > problems - mostly large text files, and high IO usage. Additionally, users > on linux often seem to have the craziest files. Currently, w

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread David Edmundson
Seems a shame to do this now that things are working so well. Since the .txt only change Baloo hasn't bothered me at all. David ___ Plasma-devel mailing list Plasma-devel@kde.org https://mail.kde.org/mailman/listinfo/plasma-devel

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Marco Martin
On Thursday 16 October 2014, Vishesh Handa wrote: > * $HOME - Not including any subfolders. > * Desktop, Documents, Videos, Pictures and Music. All of these are xdg user > directories. > > Gnome Tracker actually does something quite similar. > > Comments? +1 i tend to prefer a whitelist on what