[MarkLogic Dev General] lib-parser

2008-10-08 Thread Shannon
Hi, Does anyone know whether lib-parser has support for language-aware tokenization, for lp:get-cts-query specifically? Thanks, __ Shannon Scott Shiflett, programmer/analyst with ROTUNDA, The University of Virginia Press, Charlottesville, VA USA

Re: [MarkLogic Dev General] lib-parser

2008-10-08 Thread Michael Blakeley
Today, lib-parser calls cts:tokenize() without the language argument, so it always uses the database default language. So the tokenization is language-aware, but there's no per-query control over which language it uses. If per-query control over language awareness would be useful, how would y

Re: [MarkLogic Dev General] lib-parser

2008-10-09 Thread Shannon
Thank you, Mike--that's so very agreeable--yes, per-query control language awareness would be most useful! Given a form that accepts a query string input and a language selector that includes an "all" option, the desired behavior is language-specific tokenization, in this case, for English

Re: [MarkLogic Dev General] lib-parser

2008-10-09 Thread Michael Blakeley
Shannon, Hmm... I think we may be talking at cross-purposes. As I mentioned yesterday, I'm a little concerned about maintaining a distinction between cts:query term-level language, vs the language passed to cts:tokenize() in lp:get-cts-query-element(). When I mentioned the idea of adding ano

Re: [MarkLogic Dev General] lib-parser

2008-10-10 Thread Shannon
Good morning, Mike, thanks for the free consulting :) I agree, in the case of English and French, I don't think I need to be concerned with the tokenizing. I may end up submitting an RFE on this (something similar to the thesaurus expansion API sounds like a good approach to me) since fu