It should work, but if you want to handle multiple languages in ONE index
you end up with a lot of filters and fields handled with different analyzers
in a SINGLE configuration.

On Wed, Oct 15, 2008 at 3:03 PM, sunnyfr <[EMAIL PROTECTED]> wrote:

>
> But about stopwords and stemming, is it a real issue if on one core I've
> several stemming and stopwords(with a different name), it should work?
>
>
>
> Hannes Carl Meyer-2 wrote:
> >
> > Hi,
> >
> > yes, if you don't handle (stopwords, stemming etc.) a specific language
> > you
> > should create a general core.
> >
> > In my project I'm supporting 10 languages and if I get unsupported
> > languages
> > it is going to be logged and discarded right away!
> >
> > Boosting on multiple cores is indeed a problem. An idea would be to merge
> > the result sets from core0 and core1 and sort by scoring?
> >
> > Regards
> >
> > Hannes
> >
> > On Wed, Oct 15, 2008 at 1:50 PM, sunnyfr <[EMAIL PROTECTED]> wrote:
> >
> >>
> >>
> >> ok MultiCore is handy indeed to don't have this big index wich manage
> >> every
> >> language,
> >> but when you have one modification to do you have to do it on all of
> >> them.
> >>
> >> And the point as well is it's complicate too boost more one language
> than
> >> another one,
> >> ie with an Italian search video, if we don't have that much video then
> it
> >> might be more interesting to bring back english one.
> >>
> >> And if there is some language like Slovakia which are not managed by the
> >> website but people can come from there ... so the video will be stored
> in
> >> core0 which will be all language which are not english, spanish, germany
> >> ..
> >> french.
> >> so this kind of garbage core for every language which are not managed
> ...
> >> and I think it might be hard to manage.
> >>
> >> What do you think?
> >>
> >>
> >>
> >> Hannes Carl Meyer-2 wrote:
> >> >
> >> > I attached an example for you.
> >> >
> >> > The challenge with MultiCore is on the client's search logic. It would
> >> > help
> >> > if you know which language the person wants to search through. If not
> >> you
> >> > would have to perform multiple requests to the multiple cores.
> Ordinary
> >> > logic would be:
> >> >
> >> > 1. search "chien" in core0 (english)
> >> > 2. if #1 returned zero results search for "chien" in core1 (french)
> >> >
> >> > ---
> >> >
> >> > In your client you could even parallelize the requests to minimize
> >> waiting
> >> > time.
> >> >
> >> > *One feature I didn't try yet is the DistributedSearch (and how it
> will
> >> > help
> >> > with multiple cores)*, find it here:
> >> > http://wiki.apache.org/solr/DistributedSearch
> >> >
> >> > Regards,
> >> >
> >> > Hannes
> >> >
> >> > On Tue, Oct 14, 2008 at 4:26 PM, sunnyfr <[EMAIL PROTECTED]>
> wrote:
> >> >
> >> >>
> >> >> Thanks for this explanation, but just to get it properly :
> >> >>
> >> >> One core per language, so with the same field and schema just the
> >> >> language
> >> >> part and management which is different?
> >> >> and one core which consider every language which are not managed by
> >> solr
> >> >> like russian or ???
> >> >> so different request to the dabase....
> >> >> ok
> >> >>
> >> >> Just don't get really when you look for the word 'chien' on the
> >> english
> >> >> website I want get back result from french video because chien is
> >> french
> >> >> so
> >> >> if it doesn't find any english video with chien I need my french
> video
> >> >> then.
> >> >>
> >> >> Exactly the same for user's core, if somebody look for 'chien' and
> >> there
> >> >> is
> >> >> one user with exactly the same username I would like to show it up.
> >> >>
> >> >> thanks for your time, really,
> >> >>
> >> >>
> >> >>
> >> >> John E. McBride wrote:
> >> >> >
> >> >> > Fairly nebulous requirements, but I recently was involved in a
> >> >> > multilingual search platform.
> >> >> >
> >> >> > The approach, translated to solr 1.3 would be to use multicore -
> one
> >> >> > core per geography.  Then a schema.xml per core, each with a
> >> different
> >> >> > language in the porter algorithm, stopwords etc - taken from
> >> snowball.
> >> >> >
> >> >> > Then on the german front end you make requests to the de core, on
> >> the
> >> >> > english front end make requests to the english core.
> >> >> >
> >> >> > This is much simpler than sorting every language in the one index,
> >> for
> >> >> > example german queries will need to be run through the german query
> >> >> > filters etc.  If you have all languages in one schema, then you
> will
> >> >> > have to do some front end logic to map the query to the correct
> >> field.
> >> >> >
> >> >> > You have failed to consider internationalisation of the query side
> >> of
> >> >> > the process - your field type merely have analysis filters.
> >> >> >
> >> >> > Additionally, if the data source for each different geography is
> >> >> > different it makes sense to separate the indexes and subsequently
> >> the
> >> >> > ingestion mechanisms and schedules.
> >> >> >
> >> >> > Just a few thoughts.
> >> >> >
> >> >> > John
> >> >> >
> >> >> > sunnyfr wrote:
> >> >> >> Hi,
> >> >> >>
> >> >> >> I would like to manage properly multi language search motor,
> >> >> >> I would like your advice about what have I done.
> >> >> >>
> >> >> >> Solr1.3
> >> >> >> tomcat55
> >> >> >>
> >> >> >> http://www.nabble.com/file/p19954805/schema.xml schema.xml
> >> >> >>
> >> >> >> Thanks a lot,
> >> >> >>
> >> >> >>
> >> >> >
> >> >> >
> >> >> >
> >> >>
> >> >> --
> >> >> View this message in context:
> >> >>
> >>
> http://www.nabble.com/Multi-language-solr1.3-what-would-you-reckon--tp19954805p19974618.html
> >> >> Sent from the Solr - User mailing list archive at Nabble.com.
> >> >>
> >> >>
> >> >
> >> > Solr1.3 MultiCore Scenario
> >> >
> >> > core0 (french)                core1 (english)         ...
> >> core8 (russian)
> >> > |schema.xml                   schema.xml
> >>      schema.xml
> >> > |- analyzers          |- analyzers                            |-
> >> analyzers
> >> > |-- FrenchAnalyzer    |-- EnglishAnalyzer                     |--
> >> RussianAnalyzer
> >> > |-- FrenchStops               |-- EnglishStops
> >> |--
> >> RussianStops
> >> > |- fields                     |- fields
> >>     |- fields
> >> > |-- title                     |-- title
> >>     |-- title
> >> > |-- description               |-- description
> >> |--
> >> description
> >> > |-- id                                |-- id
> >>              |-- id
> >> >
> >>
> >> --
> >> View this message in context:
> >>
> http://www.nabble.com/Multi-language-solr1.3-what-would-you-reckon--tp19954805p19991949.html
> >> Sent from the Solr - User mailing list archive at Nabble.com.
> >>
> >>
> >
> >
>
> --
> View this message in context:
> http://www.nabble.com/Multi-language-solr1.3-what-would-you-reckon--tp19954805p19993036.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>

Reply via email to