Re: GSoC Weekly Report

2007-10-02 Thread Daniel Naber
On Tuesday 02 October 2007 19:13, you wrote:

 Thinking quickly, one way to do this would be to add an option to
 query to specify the language.

That's a nice option, but the default should be to search all languages I 
think. People are used to just type a word without setting another option.

Regards
 Daniel

-- 
http://www.danielnaber.de
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: Find identical files

2007-02-12 Thread Daniel Naber
On Monday 12 February 2007 21:41, Joe Shaw wrote:

 There's no way to do this right now, but it's a good idea.  We don't do
 anything with md5sums right now because that's an expensive operation
 that can take quite a while on large files.

Java Lucene actually has a similarity search in its contrib section. It 
doesn't only find duplicates but also similar files which might also be 
useful. I don't know if this has been ported to .NET.

Regards
 Daniel

-- 
http://www.danielnaber.de
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: Beagle Wiki Update

2006-11-08 Thread Daniel Naber
On Wednesday 08 November 2006 12:59, Carl van Tonder wrote:

  it should really be
 'Venice' not 'venice' - I know Beagle it self is case-insensitive but I
 think it makes it look less professional if you have an 'error' like
 that on the front page.

The case-insensivity is a feature, so I see no reason to use proper 
uppercase spelling here.

Regards
 Daniel

-- 
http://www.danielnaber.de
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: RoadMap for next Beagle releases

2006-09-21 Thread Daniel Naber
On Thursday 21 September 2006 19:40, Adam T. Gautier wrote:

 I thought it was because it needed to be transfered across the network?
  I also thought that Lucene queried the index in the DB why would it
 need one disk access per document?

Because the documents, i.e. the stored fields like e.g. title are 
usually scattered all over the disk. If you only need the IDs, those disk 
accesses are not required. But as you usually want to display something 
and the IDs are not stable (they can change when documents are added etc) 
the ID is usually not enough. But that is the Lucene point-of-view, I'm 
not familiar with with the layers added by Beagle.

Regards
 Daniel

-- 
http://www.danielnaber.de
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: Matching partial strings?

2006-06-27 Thread Daniel Naber
On Dienstag 27 Juni 2006 04:24, Adrian Sai-wah Tam wrote:

 Actually, I agree that having partial string match is difficult (in
 terms of computational complexity) and not possible due to performance
 reasons.

Lucene can do these kind of queries, using the WildcardQuery class. Just 
note that at least the Java version of Lucene doesn't accept * as the 
first character when using QueryParser, but creating a WildcardQuery like 
*foo* using the API works. However, this is slow. Faster solutions have 
been discussed on the Lucene mailing list, using a special way to rotate 
terms.

regards
 Daniel

-- 
http://www.danielnaber.de
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers


Re: Is Beagle indexing my email?

2005-05-18 Thread Daniel Naber
On Wednesday 18 May 2005 04:58, Joe Shaw wrote:

 A big problem with handling Maildir files outside of Evo is that I'm not
 aware of apps that know how to open the mails from a file URI. (Evo
 doesn't.)

KMail can do it like this:
kmail --view url
Where url is the path to the maildir file.

Regards
 Daniel

-- 
http://www.danielnaber.de
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dashboard-hackers