Hi,

I found stop words list in snowball website. I will merge this list into my
repository. I can send pull-request.

http://snowball.tartarus.org/algorithms/

But still I am waiting feedback about these language support :)

thanks,


2014/1/20 SHIBUKAWA Yoshiki <[email protected]>

> Hi all,
>
> I am adding search languages support to Sphinx. I get stemming algorithms
> from Snowball and generete  Python and JavaScript versions and merge it
> into the following branch. But I don't use these languages. If you are
> native speaker of one of the language or know syntax very well, please help
> trying this feature and add stop words for the language.
>
> https://bitbucket.org/shibu/sphinx/branch/add_stemmer
>
> *What I need:*
>
> * Try the search language option
>    see:
> http://sphinx-doc.org/config.html?highlight=search#confval-html_search_language
> * Add stop words (pull request or send me the word list).
>
> *Added Languages:*
>
> ** *Danish(da)
> * Dutch(nl)
> * Finnish(fi)
> * French(fr)
> * German(de)
> * Hungarian(hu)
> * Italian(it)
> * Norwegian(no)
> * Portuguese(pr)
> * Romanian(ro)
> * Russian(ru)
> * Spanish(es)
> * Swedish(sv)
> * Turkish(tr)
>
> *References:*
>
> Stemming:
>     Stemming is an important algorithm to make users can find needed
> document easily.
>     http://en.wikipedia.org/wiki/Word_stem
>
> Stop words:
>     These words are not in index. It reduce index size and noize and
> improve speed.
>     Sphinx specifies the folowing words as stop words.
>
> https://bitbucket.org/birkenfeld/sphinx/src/5bf9b44bcd0903b9db510c234dd24d62792570e3/sphinx/search/en.py?at=default#cl-23
>
> Snowball:
>     Stemming algorithm collection. http://snowball.tartarus.org/index.php
>     Python Version: https://pypi.python.org/pypi/snowballstemmer
>     JS(JSX) Version: https://npmjs.org/package/snowball-stemmer.jsx
>
> thanks,
>
> --
> #! /usr/bin/python2
> def shibu(shibukawa, yoshiki):
>     web = "http://www.shibu.jp";
>     mail = "[email protected]"
>     twitter = "@shibukawa"
>     return "smile!"
>



-- 
#! /usr/bin/python2
def shibu(shibukawa, yoshiki):
    web = "http://www.shibu.jp";
    mail = "[email protected]"
    twitter = "@shibukawa"
    return "smile!"

-- 
You received this message because you are subscribed to the Google Groups 
"sphinx-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/sphinx-users.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to