Ted, It's a good idea to prepare a new release including use locale.
I'll check my Perl version and also the documentation. Maybe, I'll find a solution to work with Catalan corpus. Thanks for all your suggestions, Mercè > Thanks very much, I'm glad to hear that use locale has helped with > most of your problems. This entire episode has convinced me that > rather than waiting for the next release of NSP, I am going to go > ahead and do a release that simply includes use locale. I'll do that > this week, which I know doesn't help you but will hopefully help > others avoid a similar problem. > > Regarding your issue with "la geminada", that might be a more general > Perl locale issue. You might want to check and make sure you are > running a fairly current version of Perl, and scout around some of the > Perl documentation to see if this has come up in a more general > context. > > Cordially, > Ted > > On Thu, Feb 14, 2008 at 3:26 AM, mercevg <[EMAIL PROTECTED]> wrote: > > > > > > > > > > > > > > Patrick, Ted, > > > > I added "use locale;" in line 83 but this can't improve my results: > > words containing the character "l·l" (like "intel·ligència")are not > > included in the results list. > > > > But it is important to say that I add as a tokens all accents, > > diaeresis and apostrophes that are used in Catalan corpus and I have > > had a good results. I think it's the solution for this kind of > > characters, except for the "l·l" ("l geminada"). > > > > Best regards, > > Mercè > > > > > > > > Greetings all, > > > > > > Thanks for the very interesting discussion. This is quite helpful. > > > > > > Just a short note to confirm that we have not yet added the > > > > > > add locale; > > > > > > directive to NSP - we haven't had a release in some time, but this > > will surely > > > be included when we do. I am thinking it might not be a bad idea to > > have a > > > release simply to take care of this. Thanks to Patrick for pointing > > this out > > > in the first place, and then reminding us of that earlier discussion. > > > > > > I would be very interested to know if this resolves the problems > > with Catalan, > > > French, Spanish, btw. Please do update us and the rest of the list, as > > > I suspect > > > this is a fairly common problem. > > > > > > Cordially, > > > Ted > > > > > > On Feb 13, 2008 11:07 AM, mercevg <mercevg@> wrote: > > > > > > > > > > > > > > > > Patrick, > > > > > > > > I have checked the latest version of NSP (v.1.03) and count.pl > > doesn't > > > > contain "use locale;". I'll try to add "use locale;" in line 83, > > maybe > > > > your suggestion it's my solution. > > > > > > > > More or less we have the same problems with accents and other kind of > > > > characters working with French and Catalan or Spanish. > > > > > > > > > > > > Thank you very much! > > > > > > > > Mercè > > > > > > > > > > > > > > Mercè, > > > > > > > > > > I have not checked the latest version of NSP to see if count.pl > > and the > > > > > other files contain "use locale;" as I suggested some time ago. The > > > > > simple inclusion of such a statement at the beginning of the Perl > > > > > scripts fixed the problems I had for French. You can have a look at > > > > this > > > > > for more information : > > > > > > > > > > http://tech.groups.yahoo.com/group/ngram/message/159 > > > > > > > > > > Hope this helps... > > > > > > > > > > Regards, > > > > > Patrick > > > > > > > > > > > > > > > > > > -- > > > Ted Pedersen > > > http://www.d.umn.edu/~tpederse > > > > > > > > > > > -- > Ted Pedersen > http://www.d.umn.edu/~tpederse >