Aloha, Joe:

I'm not quite sure how your suggestion relates to the problem of endlines.

The unicode.txt file I have is being read OK in Pages on the mac.
It also loads just fine in Rev, with the exception of the line breaks

I'm not sure where the uniencode/unidecode could be used to solve the line break issue.

Sivakatirswami




Joe F. wrote:
The trick is to use uniencode/unidecode for everything.
Three separate examples:

 ask file "Name new file:" with "NewFile.xml"
 put "binfile:" & it into theNewFileName
 get the unicodetext of cd fld 1
 put unidecode(it,"utf8") into url (theNewFileName)
------------------------------------------
 set the unicodetext of cd fld 1 to uniencode(tMyUnicode,"utf8")
------------------------------------------
 put URL (theFTPRequest) into theDownLoadedText
 put uniencode(theDownLoadedText,"utf8") into theDownLoadedText
 put unidecode(theDownLoadedText,"ANSI") into cd fld 1 of cd id 4630



On Sep 4, 2009, at 1:39 AM, Sivakatirswami wrote:

I have some UTF-16 unicode raw text. If I import this into Pages, it displays the font correctly and also the line breaks between paragraphs correctly

But if I use this function:

on mouseUp
answer file "Choose a unicode file to read in."
if it is empty then exit mouseUp
put "binfile:" & it into urlName
replace numtochar(13) with numtochar(10) in urlName
set the unicodeText of fld "display" to url urlName
end mouseUp

the line breaks are not appearing in the field in revolution.

Also if I try to analyze what chars are there, where the line break should be, by selecting across a missing line break and then use this test:

on mouseUp
 set the useUnicode to true
 if the selection is empty then
    answer "No Selection" with "ok"
 end if
 put the selection into tUnicode
 put tUnicode
 repeat for each char x in tUnicode
    put  chartonum(x) & cr after tOutput      end repeat
put tOutput # returns empty
end mouseUp

I get nothing in the msg box. If I switch from Tamil Inaimathi (Mac unicode font) to Anjal Chittu unicode, The Tamil displays correctly and now I can clearly see a two byte block

space+square-box-symbol (somewhat transparent) where the line breaks should be. But it still returns empty on an empty to determine what the bytes are....

How do we deal with/import correctly, line breaks in unicode text in Revolution?

I plan to create an editing environment as a revlet for online work... and unicode will be imported and exported freely for later use in InDesign. Obviously CRLF (or whatever it is in Unicode) needs to be preserved.

Thanks!
Sivakatirswami
"completely Unicode newbie"




_______________________________________________
use-revolution mailing list
use-revolution@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

_______________________________________________
use-revolution mailing list
use-revolution@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

_______________________________________________
use-revolution mailing list
use-revolution@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

Reply via email to