Re: [XeTeX] traditional to simplified Chinese character conversion utility or data base
2011/10/17 Daniel Greenhoe dgreen...@gmail.com: I know that this is not really the right mailing list for this question, but I have so far not found the answer by any other means ... I would like to find or write some a utility that would take an unicode encoded file and map Chinese traditional characters to simplified, while leaving all other code points (such as those in the Latin and IPA code spaces) untouched. For example, the traditional character for horse (馬) is at unicode U+99AC, the simplified one (马) is at unicode U+9A6C, and the Latin character for A is at U+0041. So I want a utility that would change the 99AC to 9A6C, but leave the 0041 unchanged. If it is really that simple 1:1 mapping, you can just use tr, it does exactly that if you supply the map. If you wish to do it on the fly in XeTeX, you can write a TECkit map. Having the TECkit map you can also run txtconv from the command line. Does anyone know of such a utility? Does anyone know of any data base with a traditional to simplified character mapping such that I could maybe write the utility myself? Many thanks in advance, Dan -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Zdeněk Wagner http://hroch486.icpf.cas.cz/wagner/ http://icebearsoft.euweb.cz -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
[XeTeX] How to manually create the xelatex.fmt?
Hi all; I need to generate the xelatex.fmt file. Apparently Fedora doesn't create these files. It is not a new issue, I have had issues with the latex.fmt files not created in the past. Is there any way to manually create this file? Best Wishes, Chris Travers -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] Always bold math strange unicode-math behaviour
Am Mon, 17 Oct 2011 17:33:05 +0200 schrieb Tobias Schoel: Is there a workaround or do I need to type \mathbf{} at the beginning of each \(\) myself? You shouldn't use \mathbf to make large part of an equation bold. \mathbf is meant for single symbols and only for things like numbers and characters. In unicode math it works by mapping them to other unicode position. E.g. a \mathbf{0} is the MATHEMATICAL BOLD DIGIT ZERO character at position U+1D7CE. OK. I wasn't used to this when using normal latex. In normal latex the situation is similar: \mathbf prints its argument in a specific (bold) font and normally can/should be used only for characters and numbers. If you want a bold math font you should at best use a font which has a bold face. Is there any good unicode-math font with a bold face (Asana Math and XITS Math don't work)? I do have a bold version of XITS and of STIX. But both font doesn't work flawlessly on my machine here - neither with xelatex, nor with lualatex (I don't know if it is a problem of the fonts or of my version of unicode-math/fontspec). \documentclass{article} \usepackage{unicode-math} % \setmathfont{STIXGeneral} \setmathfont[version=bold]{STIXGeneral Bold} \setmathfont[version=xitsnormal]{XITS} \setmathfont[version=xitsbold]{XITS Bold} \setmathfont[version=Asana]{Asana Math} \begin{document} STIX $\uppi\sqrt{\frac{\uppi r^2}{4}}$ \mathversion{bold} $\uppi\sqrt{\frac{\uppi r^2}{4}}$ \bigskip xitsnormal \mathversion{xitsnormal} $\uppi\sqrt{\frac{\uppi r^2}{4}}$ \mathversion{xitsbold} $\uppi\sqrt{\frac{\uppi r^2}{4}}$ \bigskip Asana \mathversion{Asana} $\uppi\sqrt{\frac{\uppi r^2}{4}}$ \end{document} -- Ulrike Fischer -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
2011/10/18 Chris Travers chris.trav...@gmail.com: Hi all; I need to generate the xelatex.fmt file. Apparently Fedora doesn't create these files. It is not a new issue, I have had issues with the latex.fmt files not created in the past. Is there any way to manually create this file? Certainly there is but it would rather be a question for Fedora forum. Although I use Fedora myself, I do not use its TeX but I install TeX Live. That's why I do not know how it is packaged in Fedora. Does Fedora contain fmtutil and fmtutil-sys? Best Wishes, Chris Travers -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Zdeněk Wagner http://hroch486.icpf.cas.cz/wagner/ http://icebearsoft.euweb.cz -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Am 18.10.2011 um 10:30 schrieb Chris Travers: Is there any way to manually create this file? TeX Live comes with a utility, fmtutil-sys. It allows to create FMT files for the *system*. Private FMT files can be created with fmtutil. The former is kind of a wrapper for the latter, so the only man page exists for the latter. The script also gives some hints when invoked with --help. It's also possible to create a local patch file for the fmtutil configure file, /usr/local/texlive/texmf-local/web2c/fmtutil-local.cnf. To apply that patch you'll need to run tlmgr and make it regenerate the configure files. This procedure is described in some top level TeX Live documentation, maybe doc.html. -- Greetings Pete No man was ever taken to hell by a woman unless he already had a ticket in his pocket, or at least had been fooling around with timetables. – Archie Goodwin -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
There are two ways to create xelatex.fmt: 1) xetex -ini -jobname=xelatex -progname=xelatex -etex xelatex.ini 2) fmtutil --byfmt xelatex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Vafa Khalighi wrote: There are two ways to create xelatex.fmt: 1) xetex -ini -jobname=xelatex -progname=xelatex -etex xelatex.ini What does this accomplish that xetex -ini -etex xelatex.ini does not ? Philip Taylor -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
This has all been very helpful. At least I have things narrowed down a bit here: # fmtutil-sys --byfmt xelatex Defining UNIX/DOS style filename parser. catcodes, registers, compatibility for TeX 2, parameters, !! ! You are attempting to make a LaTeX format from a source file ! That is more than five years old. ! ! If you enter return to scroll past this message then the format ! will be built, but please consider obtaining newer source files ! before continuing to build LaTeX. !! ! LaTeX source files more than 5 years old!. l.545 ...aTeX source files more than 5 years old!} ? ! Emergency stop. l.545 ...aTeX source files more than 5 years old!} No pages of output. Transcript written on xelatex.log. Error: `xetex -ini -jobname=xelatex -progname=xelatex -etex xelatex.ini' failed ### fmtutil: Error! Not all formats have been built successfully. Visit the log files in directory /var/lib/texmf/web2c for details. ### This is a summary of all `failed' messages and warnings: `xetex -ini -jobname=xelatex -progname=xelatex -etex xelatex.ini' failed Any idea of what I do about this? Best Wishes, Chris Travers -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Quoting Chris Travers (chris.trav...@gmail.com): ! LaTeX source files more than 5 years old!. Any idea of what I do about this? I did not follow the thread closely. Are you the administrator of the system? If so, I'd advise to de-install fedora's TeX-suite and install texlive instead. That at least is what I did with my openSUSE boxes. Hope that helps, Susan -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
On Tue, Oct 18, 2011 at 6:07 AM, Susan Dittmar susan.ditt...@gmx.de wrote: Quoting Chris Travers (chris.trav...@gmail.com): ! LaTeX source files more than 5 years old!. Any idea of what I do about this? I did not follow the thread closely. Are you the administrator of the system? If so, I'd advise to de-install fedora's TeX-suite and install texlive instead. That at least is what I did with my openSUSE boxes. I am the administrator on this system but the software is that uses this (an open source accounting program) is to be shipped out ideally in .rpm and .deb format. It would really help if I don't require most users who want to generate PDF invoices in multiple languages to go download large external dependencies. This current version is supposed to support, for example RHEL 5 and higher, among other distros. Best Wishes, Chris Travers -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Am Tue, 18 Oct 2011 05:43:57 -0700 schrieb Chris Travers: This has all been very helpful. At least I have things narrowed down a bit here: # fmtutil-sys --byfmt xelatex ! LaTeX source files more than 5 years old!. l.545 ...aTeX source files more than 5 years old!} Any idea of what I do about this? The best is to get and install a new TeXLive 2011 with newer latex sources. You can also try to fool latex by changing your pc date. -- Ulrike Fischer -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Hi. I appear to have solved this by running xelatex -ini manually and then copying the ..fmt file to the appropriate directory. Thanks for everyone's help. As a note, I am really restricted to supporting TexLive versions that ship on stable long-term-support (and supported) distros We can argue about whether these distros are too shy about upgrades, but users don't like to hear that their shiny rpm or .deb requires that they also track down large dependencies from external sources not in any repository for their distro. Best Wishes, Chris Travers -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
On Tue, Oct 18, 2011 at 10:24 AM, Chris Travers chris.trav...@gmail.com wrote: Hi. I appear to have solved this by running xelatex -ini manually and then copying the ..fmt file to the appropriate directory. Thanks for everyone's help. As a note, I am really restricted to supporting TexLive versions that ship on stable long-term-support (and supported) distros We can argue about whether these distros are too shy about upgrades, but users don't like to hear that their shiny rpm or .deb requires that they also track down large dependencies from external sources not in any repository for their distro. Users also don't like to discover that the publishers' LaTeX format they need won't work with the distro TeX, or that a document that formats correctly on a co-author's Mac or Windows system won't format on their linux system. The TeX ecosystem needs some reasonable limits on how long old versions should be supported. If users can't get adequate support from their distro there are better supported alternatives. -- George N. White III aa...@chebucto.ns.ca Head of St. Margarets Bay, Nova Scotia -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Users also don't like to discover that the publishers' LaTeX format they need won't work with the distro TeX, or that a document that formats correctly on a co-author's Mac or Windows system won't format on their linux system. The TeX ecosystem needs some reasonable limits on how long old versions should be supported. If users can't get adequate support from their distro there are better supported alternatives. Given that server software that I work with usually has at least a five year support cycle, what are those reasonable limits? Or is XeTeX not intended to be used in these environments? Best Wishes, Chris Travers -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Am Tue, 18 Oct 2011 06:43:59 -0700 schrieb Chris Travers: Users also don't like to discover that the publishers' LaTeX format they need won't work with the distro TeX, or that a document that formats correctly on a co-author's Mac or Windows system won't format on their linux system. The TeX ecosystem needs some reasonable limits on how long old versions should be supported. If users can't get adequate support from their distro there are better supported alternatives. Given that server software that I work with usually has at least a five year support cycle, what are those reasonable limits? Well you error message said ! LaTeX source files more than 5 years old!. So the limit is five years (but only for the latex kernel). The version date of my (current) latex.ltx ist \edef\fmtversion{2011/06/27} Or is XeTeX not intended to be used in these environments? I would say that if your latex is more than five years old, your xetex binaries and packages aren't up-to-date either. And as xetex is rather young this can be quite a problem. Regardless if you want to ship out only xetex documents or xetex documents + binaries: You should be aware that other people can have up-to-date systems and so you should make tests on such systems too (and just in case you don't know: you can't use a fmt generated by one xetex version with another xetex version). -- Ulrike Fischer -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
On Tue, Oct 18, 2011 at 10:53 AM, msk...@ansuz.sooke.bc.ca wrote: On Tue, 18 Oct 2011, George N. White III wrote: Users also don't like to discover that the publishers' LaTeX format they need won't work with the distro TeX, or that a document that formats correctly on a co-author's Mac or Windows system won't format on their linux system. That seems to me to be a reason to *continue* support for older versions, not a reason to *end* it. I don't understand how you got from the above to the next thing you wrote: No -- newer versions have to deal with changes to external interfaces (fonts, image formats, library versions, etc.) so end up adding extra code to check for old versions and work around limitations, or do without some desirable features that can't be implemented on the older version. It takes real work to support older versions and the options to make improvements are constrained. Compromises are needed to live within the hardware limitations of baseline systems at the time the design is fixed. This thinking would still have TeX configurations that could run in 16-bit memory address limits. Even if you think that would be useful, the people who do the heavy lifting tend to use current or even leading edge hardware and are going to be more interested in the new capabilities they can get by taking advantage of the latest hardware developments than minimizing memory footprints. Ultimately, the decisions are made by the people who write the code. The TeX ecosystem needs some reasonable limits on how long old versions should be supported. If users can't get adequate support from their -- Matthew Skala msk...@ansuz.sooke.bc.ca People before principles. http://ansuz.sooke.bc.ca/ -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- George N. White III aa...@chebucto.ns.ca Head of St. Margarets Bay, Nova Scotia -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
On Tue, Oct 18, 2011 at 10:43 AM, Chris Travers chris.trav...@gmail.com wrote: Users also don't like to discover that the publishers' LaTeX format they need won't work with the distro TeX, or that a document that formats correctly on a co-author's Mac or Windows system won't format on their linux system. The TeX ecosystem needs some reasonable limits on how long old versions should be supported. If users can't get adequate support from their distro there are better supported alternatives. Given that server software that I work with usually has at least a five year support cycle, what are those reasonable limits? Five years seems to be a common support period. If you sell a distro with 5-year support then you should know that some packages you provide will expire before the 5 years is up and be prepared to provide updates. Or is XeTeX not intended to be used in these environments? The 5 year limit is in LaTeX, not xetex. -- George N. White III aa...@chebucto.ns.ca Head of St. Margarets Bay, Nova Scotia -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
On Tue, Oct 18, 2011 at 7:09 AM, Ulrike Fischer ne...@nililand.de wrote: So the limit is five years (but only for the latex kernel). The version date of my (current) latex.ltx ist \edef\fmtversion{2011/06/27} Or is XeTeX not intended to be used in these environments? I would say that if your latex is more than five years old, your xetex binaries and packages aren't up-to-date either. And as xetex is rather young this can be quite a problem. Regardless if you want to ship out only xetex documents or xetex documents + binaries: You should be aware that other people can have up-to-date systems and so you should make tests on such systems too (and just in case you don't know: you can't use a fmt generated by one xetex version with another xetex version). Of course. I don't expect .fmt files to be portable. What is helpful is to know how to resolve the issue so I can put a faq entry in and direct people to it when they ask on the mailing list. (And if they can't get it, charge for support.) I believe I have gotten that, so I am satisfied with the resolution. However, so that there are no misunderstandings The issue here is being forced to choose between supporting XeTeX on many platforms and being able to support the platform's package manager. I don't see anyone here suggesting a way around that. For developers distributing software, that's kind of an issue. Here's a breakdown of OS support for TexLive versions for anyone interested: Debian Lenny: TexLive 2007 Debian Squeeze: TexLive 2009 Debian Sid: TexLive 2009 Ubuntu 10.04 LTS: TexLive 2009 Red Hat Enterprise 6: TexLive 2007 That means that the most recent versions of CentOS and Scientific Linux also use 2007. To be clear, I am not saying every issue has to be resolved. And I did get enough info to solve my problem off this list (and for that I am grateful). I am however saying that some comments on this list seem more aimed at end users than folks trying to build tools which integrate with a TeX environment. However, the software project has contributors on both TexLive 2007 and 2009, and so our coverage in terms of testing is pretty good there. I also understand George's point that ensuring backwards compatibility isn't always either desirable or possible. I am not asking for that either. Quite frankly if I can get documentation about what needs to change for documents to render I can supply alternate copies of templates, avoid trouble spots, etc.. I also understand that these things are usually most unstable for young software. My own package, LedgerSMB, is going through similar issues. I don't know anyone that ensures absolute backwards compatibility. That way madness lies-- it locks you into bad decisions that you make before the full scope of the problem becomes known. I am not asking for any of that. However, this argument is going on because I was told that upgrading was the solution, I said it wasn't a possibility for me and outlined why, and folks decided to push the issue. What I don't understand is what is to be gained by pushing the issue. Best Wishes, Chris Travers -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Am 18.10.2011 um 15:43 schrieb Chris Travers: Given that server software that I work with usually has at least a five year support cycle, what are those reasonable limits? For TeX I'd think in decades. And support in the way TeX understands this term is constant development and constant updating. TeX is alive. Or is XeTeX not intended to be used in these environments? XeTeX is likely to stop in development, maybe before reaching version 1.0, because of the lack of active developers (and because of improving alternatives like LuaTeX). So soon you'll have a software that will last longer than your hardware and won't need updating. BTW, XeTeX from five years ago was a bit away from perfection and quite a few bugs richer. Particularly the xdvipdfmx convertor was heavily improved. -- Greetings Pete The best way to accelerate a PC is 9.8 m/s² -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] How to manually create the xelatex.fmt?
Am 18.10.2011 um 16:39 schrieb Chris Travers: Here's a breakdown of OS support for TexLive versions for anyone interested: Debian Lenny: TexLive 2007 Debian Squeeze: TexLive 2009 Debian Sid: TexLive 2009 Ubuntu 10.04 LTS: TexLive 2009 Red Hat Enterprise 6: TexLive 2007 That means that the most recent versions of CentOS and Scientific Linux also use 2007. Forget these RPM or DEB based re-packings! (The support from their distributors/repackagers can be a bit less than optimal.) Install TeX Live 2005, 2006, 2007, 2008, 2009, 2010, 2011! Then every user will have a choice. Because setting PATH and MANPATH makes any of these installations active and working. And you can use the TeX Live Manager, tlmgr, of each of these installations to support (setup, changes, updates) any of these. And when you start lacking some disk space you can use utilities like dupmerge to hard-link files from the stable distributions (all but that from this year) that many invariant files become unique on disk. I think it should even be possible to have one external disk (NAS or such) that also carries the 32-bit and 64-bit binaries for all the different Linux clients. (Because the TeX binaries are statically linked and therefore do not depend on subtle variations in systems' shared libraries.) -- Greetings Pete No man was ever taken to hell by a woman unless he already had a ticket in his pocket, or at least had been fooling around with timetables. – Archie Goodwin -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] traditional to simplified Chinese character conversion utility or data base
Hi Zdenek, Thank you for your suggestions. On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner zdenek.wag...@gmail.com wrote: you can just use tr, ... if you supply the map. I don't know what tr is, but this comes back to one of my original problems; and that is, I don't have a map. Does anyone know of a publicly available map? Such a map very likely exists. For example, Google Translate can translate from traditional to simplified. But even if they use a map for this service, that map may be proprietary. If you wish to do it on the fly in XeTeX, you can write a TECkit map. Having the TECkit map you can also run txtconv from the command line. I like these solutions. However, again, I would still need a map. SIL has a collection of maps available here: http://scripts.sil.org/cms/scripts/page.php?site_id=nrsicat_id=ConversionMaps But I didn't see a Chinese traditional--simplified character map. Dan On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner zdenek.wag...@gmail.com wrote: 2011/10/17 Daniel Greenhoe dgreen...@gmail.com: I know that this is not really the right mailing list for this question, but I have so far not found the answer by any other means ... I would like to find or write some a utility that would take an unicode encoded file and map Chinese traditional characters to simplified, while leaving all other code points (such as those in the Latin and IPA code spaces) untouched. For example, the traditional character for horse (馬) is at unicode U+99AC, the simplified one (马) is at unicode U+9A6C, and the Latin character for A is at U+0041. So I want a utility that would change the 99AC to 9A6C, but leave the 0041 unchanged. If it is really that simple 1:1 mapping, you can just use tr, it does exactly that if you supply the map. If you wish to do it on the fly in XeTeX, you can write a TECkit map. Having the TECkit map you can also run txtconv from the command line. Does anyone know of such a utility? Does anyone know of any data base with a traditional to simplified character mapping such that I could maybe write the utility myself? Many thanks in advance, Dan -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Zdeněk Wagner http://hroch486.icpf.cas.cz/wagner/ http://icebearsoft.euweb.cz -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] traditional to simplified Chinese character conversion utility or data base
You can try digging in the source for Tong Wen Tang (a Firefox extension). Or email its developers. They should have a map and additional notes on the conversion. On Tue, Oct 18, 2011 at 18:50, Daniel Greenhoe dgreen...@gmail.com wrote: Hi Zdenek, Thank you for your suggestions. On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner zdenek.wag...@gmail.com wrote: you can just use tr, ... if you supply the map. I don't know what tr is, but this comes back to one of my original problems; and that is, I don't have a map. Does anyone know of a publicly available map? Such a map very likely exists. For example, Google Translate can translate from traditional to simplified. But even if they use a map for this service, that map may be proprietary. If you wish to do it on the fly in XeTeX, you can write a TECkit map. Having the TECkit map you can also run txtconv from the command line. I like these solutions. However, again, I would still need a map. SIL has a collection of maps available here: http://scripts.sil.org/cms/scripts/page.php?site_id=nrsicat_id=ConversionMaps But I didn't see a Chinese traditional--simplified character map. Dan On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner zdenek.wag...@gmail.com wrote: 2011/10/17 Daniel Greenhoe dgreen...@gmail.com: I know that this is not really the right mailing list for this question, but I have so far not found the answer by any other means ... I would like to find or write some a utility that would take an unicode encoded file and map Chinese traditional characters to simplified, while leaving all other code points (such as those in the Latin and IPA code spaces) untouched. For example, the traditional character for horse (馬) is at unicode U+99AC, the simplified one (马) is at unicode U+9A6C, and the Latin character for A is at U+0041. So I want a utility that would change the 99AC to 9A6C, but leave the 0041 unchanged. If it is really that simple 1:1 mapping, you can just use tr, it does exactly that if you supply the map. If you wish to do it on the fly in XeTeX, you can write a TECkit map. Having the TECkit map you can also run txtconv from the command line. Does anyone know of such a utility? Does anyone know of any data base with a traditional to simplified character mapping such that I could maybe write the utility myself? Many thanks in advance, Dan -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Zdeněk Wagner http://hroch486.icpf.cas.cz/wagner/ http://icebearsoft.euweb.cz -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex
Re: [XeTeX] traditional to simplified Chinese character conversion utility or data base
On Wed, Oct 19, 2011 at 10:05 AM, Andy Lin kir...@gmail.com wrote: You can try digging in the source for Tong Wen Tang ... Or email its developers. That's a great idea --- thanks! Dan On Wed, Oct 19, 2011 at 10:05 AM, Andy Lin kir...@gmail.com wrote: You can try digging in the source for Tong Wen Tang (a Firefox extension). Or email its developers. They should have a map and additional notes on the conversion. On Tue, Oct 18, 2011 at 18:50, Daniel Greenhoe dgreen...@gmail.com wrote: Hi Zdenek, Thank you for your suggestions. On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner zdenek.wag...@gmail.com wrote: you can just use tr, ... if you supply the map. I don't know what tr is, but this comes back to one of my original problems; and that is, I don't have a map. Does anyone know of a publicly available map? Such a map very likely exists. For example, Google Translate can translate from traditional to simplified. But even if they use a map for this service, that map may be proprietary. If you wish to do it on the fly in XeTeX, you can write a TECkit map. Having the TECkit map you can also run txtconv from the command line. I like these solutions. However, again, I would still need a map. SIL has a collection of maps available here: http://scripts.sil.org/cms/scripts/page.php?site_id=nrsicat_id=ConversionMaps But I didn't see a Chinese traditional--simplified character map. Dan On Tue, Oct 18, 2011 at 2:53 PM, Zdenek Wagner zdenek.wag...@gmail.com wrote: 2011/10/17 Daniel Greenhoe dgreen...@gmail.com: I know that this is not really the right mailing list for this question, but I have so far not found the answer by any other means ... I would like to find or write some a utility that would take an unicode encoded file and map Chinese traditional characters to simplified, while leaving all other code points (such as those in the Latin and IPA code spaces) untouched. For example, the traditional character for horse (馬) is at unicode U+99AC, the simplified one (马) is at unicode U+9A6C, and the Latin character for A is at U+0041. So I want a utility that would change the 99AC to 9A6C, but leave the 0041 unchanged. If it is really that simple 1:1 mapping, you can just use tr, it does exactly that if you supply the map. If you wish to do it on the fly in XeTeX, you can write a TECkit map. Having the TECkit map you can also run txtconv from the command line. Does anyone know of such a utility? Does anyone know of any data base with a traditional to simplified character mapping such that I could maybe write the utility myself? Many thanks in advance, Dan -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Zdeněk Wagner http://hroch486.icpf.cas.cz/wagner/ http://icebearsoft.euweb.cz -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex -- Subscriptions, Archive, and List information, etc.: http://tug.org/mailman/listinfo/xetex