Richard Boulton <[EMAIL PROTECTED]> writes:
> Automatic language guessing is not actually that tricky to implement -
> there tend to be characteristic words which occur frequently only in a
> certain language, or infrequently in a certain language.
FWIW, here are two links that have lists of co
The other issue seems to be that we keep focusing on what google can
do with multi-language support, however, they only offer comprehensive
desktop searching in one language at a time (near as I can tell) We
forget that Google has millions of servers worldwide to handle the
processing, beagle has t
> HTML documents may carry a lang tag:
Yes. Thats good. The bad part is that this works only for HTML files.
Its easy for web-search engines because all (mostly) they index are
html files. Things are a little unclear in the desktop search area.
___
Dashb
On Tue, Nov 15, 2005 at 06:50:33PM +0100, [EMAIL PROTECTED] wrote:
> Do you know if search engines such as google has different code to
> support different languages?
I don't know about Google specifically, but certainly some engines process
text in different languages differently. Specifically,
Joe Shaw wrote:
[cut]
We need a strategy for how to determine the language of a document. I
don't think that just going with LANG is the right thing, because the
number of English documents a user will encounter on the Internet is
significant, and we'd really like those to be indexed correctly
Joe Shaw wrote:
> On Tue, 2005-11-15 at 10:09 +0100, Bernhard Kleine wrote:
> > the search algorithm is either biased for the English language or
> > something else is wrong:
>
> Beagle assumes English at present.
>
> This is actually kind of a hard problem to fix generally. Maybe we
> could ass
On Tue, Nov 15, 2005 at 11:39:25AM -0500, Joe Shaw wrote:
> Hi,
>
> On Tue, 2005-11-15 at 10:09 +0100, Bernhard Kleine wrote:
> >
> > the search algorithm is either biased for the english language or
> > something else is wrong:
>
> Beagle assumes English at present.
Isn't Beagle using stemmin
Hi,
On Tue, 2005-11-15 at 10:09 +0100, Bernhard Kleine wrote:
>
> the search algorithm is either biased for the english language or
> something else is wrong:
Beagle assumes English at present.
This is actually kind of a hard problem to fix generally. Maybe we
could assume a language based on
the search algorithm is either biased for the english language or
something else is wrong:
looking up "gene" shows
131 results and the following words in red since they were found:
gene, Gene, genes
(genes as the plural of gene, !!)
however, looking up "gen" (the german word for "gene") s
Am Montag, den 14.11.2005, 11:20 +0100 schrieb [EMAIL PROTECTED]:
> Bernhard Kleine wrote:
> > when searching for "retinoblastom" 4 results, for "retinoblastoma" 72
> > results, bug or feature??
> >
> > I would propose, this is a bug!
>
> Google does just the same. Would you submit a bug report t
Bernhard Kleine wrote:
> when searching for "retinoblastom" 4 results, for "retinoblastoma" 72
> results, bug or feature??
>
> I would propose, this is a bug!
Google does just the same. Would you submit a bug report to google?
Prueba el Nuevo Correo Terra; Seguro, RĂ¡pido, Fiable.
Hallo,
when searching for "retinoblastom" 4 results, for "retinoblastoma" 72
results, bug or feature??
I would propose, this is a bug!
Bernhard
___
Dashboard-hackers mailing list
Dashboard-hackers@gnome.org
http://mail.gnome.org/mailman/listinfo/dash
12 matches
Mail list logo