Re: searching for parts of words

2005-11-15 Thread Colin Marquardt
Richard Boulton <[EMAIL PROTECTED]> writes: > Automatic language guessing is not actually that tricky to implement - > there tend to be characteristic words which occur frequently only in a > certain language, or infrequently in a certain language. FWIW, here are two links that have lists of co

Re: searching for parts of words

2005-11-15 Thread Kevin Kubasik
The other issue seems to be that we keep focusing on what google can do with multi-language support, however, they only offer comprehensive desktop searching in one language at a time (near as I can tell) We forget that Google has millions of servers worldwide to handle the processing, beagle has t

Re: searching for parts of words

2005-11-15 Thread D Bera
> HTML documents may carry a lang tag: Yes. Thats good. The bad part is that this works only for HTML files. Its easy for web-search engines because all (mostly) they index are html files. Things are a little unclear in the desktop search area. ___ Dashb

Re: searching for parts of words

2005-11-15 Thread Richard Boulton
On Tue, Nov 15, 2005 at 06:50:33PM +0100, [EMAIL PROTECTED] wrote: > Do you know if search engines such as google has different code to > support different languages? I don't know about Google specifically, but certainly some engines process text in different languages differently. Specifically,

Re: searching for parts of words

2005-11-15 Thread Sean Carlos
Joe Shaw wrote: [cut] We need a strategy for how to determine the language of a document. I don't think that just going with LANG is the right thing, because the number of English documents a user will encounter on the Internet is significant, and we'd really like those to be indexed correctly

Re: searching for parts of words

2005-11-15 Thread [EMAIL PROTECTED]
Joe Shaw wrote: > On Tue, 2005-11-15 at 10:09 +0100, Bernhard Kleine wrote: > > the search algorithm is either biased for the English language or > > something else is wrong: > > Beagle assumes English at present. > > This is actually kind of a hard problem to fix generally. Maybe we > could ass

Re: searching for parts of words

2005-11-15 Thread Tomasz Torcz
On Tue, Nov 15, 2005 at 11:39:25AM -0500, Joe Shaw wrote: > Hi, > > On Tue, 2005-11-15 at 10:09 +0100, Bernhard Kleine wrote: > > > > the search algorithm is either biased for the english language or > > something else is wrong: > > Beagle assumes English at present. Isn't Beagle using stemmin

Re: searching for parts of words

2005-11-15 Thread Joe Shaw
Hi, On Tue, 2005-11-15 at 10:09 +0100, Bernhard Kleine wrote: > > the search algorithm is either biased for the english language or > something else is wrong: Beagle assumes English at present. This is actually kind of a hard problem to fix generally. Maybe we could assume a language based on

Re: searching for parts of words

2005-11-15 Thread Bernhard Kleine
the search algorithm is either biased for the english language or something else is wrong: looking up "gene" shows 131 results and the following words in red since they were found: gene, Gene, genes (genes as the plural of gene, !!) however, looking up "gen" (the german word for "gene") s

Re: searching for parts of words?

2005-11-14 Thread Bernhard Kleine
Am Montag, den 14.11.2005, 11:20 +0100 schrieb [EMAIL PROTECTED]: > Bernhard Kleine wrote: > > when searching for "retinoblastom" 4 results, for "retinoblastoma" 72 > > results, bug or feature?? > > > > I would propose, this is a bug! > > Google does just the same. Would you submit a bug report t

Re: searching for parts of words?

2005-11-14 Thread [EMAIL PROTECTED]
Bernhard Kleine wrote: > when searching for "retinoblastom" 4 results, for "retinoblastoma" 72 > results, bug or feature?? > > I would propose, this is a bug! Google does just the same. Would you submit a bug report to google? Prueba el Nuevo Correo Terra; Seguro, RĂ¡pido, Fiable.

searching for parts of words?

2005-11-13 Thread Bernhard Kleine
Hallo, when searching for "retinoblastom" 4 results, for "retinoblastoma" 72 results, bug or feature?? I would propose, this is a bug! Bernhard ___ Dashboard-hackers mailing list Dashboard-hackers@gnome.org http://mail.gnome.org/mailman/listinfo/dash