On 26/07/2022 11:26, K K wrote:
On 2022-07-26 Tue. 09:23 +0700,Max Nikulin wrote:

> However the suggestion was namely to use U+200B ZERO WIDTH SPACE and
> it
> is actually implemented since `org-emphasis-regexp-components'
> currently
> contains [:space:].
> ...
> K, could you, please, clarify what is your particular use case?

My bad, I misunderstood the "feature" mentioned in the old post.

My use case is to emphasize chinese characters without spaces being inserted, even those zero-width spaces. For example "中文*测*试" should be enough to emphasize "测".

I am using zero-width spaces right now, and it works fine in org-mode buffers, but if exported to latex-pdf files, the U+200B ZERO WIDTH SPACE character will not be zero-width for certain fonts. So I hope not to use that character.

I have not tested it, but I expect you can use
- export filter that removes zero-width spaces at the last export stage. I assume that your documents do not contain them besides markup workaround
- #+latex_header: \DeclareUnicodeCharacter{200B}{}
- custom link

   #+begin_src elisp :results none :exports both
     (org-link-set-parameters
      "sep"
      :export (lambda (path desc backend)
               (if (org-export-derived-backend-p backend 'org)
                   (org-link-make-string (concat "sep:" path) desc)
                 (or desc ""))))
   #+end_src
   "中文[[sep:][*测*]]试"

  https://list.orgmode.org/ssp8e7$ah2$1...@ciao.gmane.io/
Max Nikulin Re: [RFC] Creole-style / Support for **emphasis**__within__**a word** Tue, 25 Jan 2022 23:27:50 +0700

In other thread we are discussing advantages and problems of switching from PdfLaTeX to LuaLaTeX for non-latin scripts. The latter is a Unicode engine. I am curious what is your opinion from standpoint of Chinese language, namely amount of required customization in both cases. I think, it is better to either start a dedicated thread, or find the part of discussion related to fonts and babel (LaTeX package) setup.


Reply via email to