Re: [CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Ya'aqov Ziso
Till, Bess, Ralph, >> assuming the algorithm for enriching the spellings of a word (from PinYin to Chinese) exists, will the result include both forms, PinYin AND Non-PinYin Chinese transliteration and BOTH forms will be indexed? >> the principle of indexing different forms for spelling a vertain

Re: [CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Till Kinstler
Bess Sadler schrieb: So, thoughts? Anyone know more about this than I do and want to speak up? I'd second Demian's and Jonathan's statements: Do that in Solr by using a Filter (either at indexing or search time). You want to solve that using an algorithm that translates american transcriptio

Re: [CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Jonathan Rochkind
I wonder if you could do it in Solr itself, instead of SolrMarc? As an input filter (I forget the exact technical language for this, but there's something you can set to do this task in Solr); which you don't need to get it built into the Solr distro to do, Solr allows you to have your own cus

Re: [CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Demian Katz
> What's the right place for such a piece of code? Solrmarc seems the > obvious place to me. As it has been described to me so far, this > doesn't seem like an issue affecting people outside the library realm, > which makes it seem too niche and community-specific to get it built > into the lucene

[CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Bess Sadler
By sending this here I hope I'm going to hit everyone on the blacklight, vufind, and solrmarc mailing lists, and maybe some other interested parties. Our East Asian Languages Librarian has approached me with a problem he wants to see solved. According to him, the typical North American li