Re: Verity and Asian languages
Hi All, I've got a verity collection that I load periodically. Some of the data contains Unicode (English) characters. When they're retrieved and displayed, I get ? and other odd characters. Would using language=unicode on the cfindex tag take care of this? ~| Order the Adobe Coldfusion Anthology now! http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion Archive: http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:341986 Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm
RE: Verity and Asian languages
created using japanese? well that's the first problem. if your encoding is unicode then i think you need to use uni as the language when you build the collections. and yes question marks I see. I assumed the cfindex language attribute referred to the language being indexed, but apparently it refers to the character set, or at least the implied character set (i.e. Japanese implies Shift_JIS). Thanks for the advice. what do you mean by special intelligence to languages like Japanese? I assumed the stem and suggestions functionality in Verity would require an awareness of the language in question, which I assume the Unicode language wouldn't be able to provide. I realise these issues I've been having probably stem from Verity being a third party application, but I think the CF docs could be a bit clearer about what exactly a language is in Verity. ~| Find out how CFTicket can increase your company's customer support efficiency by 100% http://www.houseoffusion.com/banners/view.cfm?bannerid=49 Message: http://www.houseoffusion.com/lists.cfm/link=i:4:221158 Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4 Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4 Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4 Donations Support: http://www.houseoffusion.com/tiny.cfm/54
Re: Verity and Asian languages
Matthew Walker wrote: wrong). So I've installed the Asian pack, created a collection and attempted to search it. The results returned are all question marks, created using japanese? well that's the first problem. if your encoding is unicode then i think you need to use uni as the language when you build the collections. and yes question marks are garbaged data due to encoding problems. boxes indicate rendering problems (text data is ok but the browser doesn't have a font to render that char). apply any special intelligence to languages like Japanese or does that only apply with the Japanese locale and the Shift JIS character set? The advantages of using Verity at all for Japanese are hard for me to see. what do you mean by special intelligence to languages like Japanese? ~| Logware (www.logware.us): a new and convenient web-based time tracking application. Start tracking and documenting hours spent on a project or with a client with Logware today. Try it for free with a 15 day trial account. http://www.houseoffusion.com/banners/view.cfm?bannerid=67 Message: http://www.houseoffusion.com/lists.cfm/link=i:4:220991 Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4 Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4 Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4 Donations Support: http://www.houseoffusion.com/tiny.cfm/54
Verity and Asian languages
I've been having issues within CF7 with getting Verity to index and search Japanese. I note that you can download language packs from Macromedia (www.macromedia.com/go/verity http://www.macromedia.com/go/verity ) (the instructions for these btw specify the wrong file location, also the file sizes are completely wrong). So I've installed the Asian pack, created a collection and attempted to search it. The results returned are all question marks, indicating a character encoding issue to me. If I try cfcollection action=LIST... it looks like Verity is using the Shift_JIS character set. I never asked for that -- I'm using Unicode. I think the multilanguage pack includes support for the Unicode language -- leaving aside that Unicode isn't a language. I'll have to download that from home later as we pay through the nose for office bandwidth, so I haven't tested this out yet. Can anybody confirm that this is the way to search Unicode data with Verity? It seems to me that the Verity CF docs are in an appalling state with regard to languages. My second question then is, does Verity not apply any special intelligence to languages like Japanese or does that only apply with the Japanese locale and the Shift JIS character set? The advantages of using Verity at all for Japanese are hard for me to see. Advice anybody? ~| Logware (www.logware.us): a new and convenient web-based time tracking application. Start tracking and documenting hours spent on a project or with a client with Logware today. Try it for free with a 15 day trial account. http://www.houseoffusion.com/banners/view.cfm?bannerid=67 Message: http://www.houseoffusion.com/lists.cfm/link=i:4:220861 Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4 Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4 Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4 Donations Support: http://www.houseoffusion.com/tiny.cfm/54