Re: [XeTeX] Ligatures and searching in PDFs

2010-06-10 Thread David J. Perry
Gareth, Everything that Khaled said in his message is correct, particularly about PDFs relying on glyph names and about not using the Unicode presentation forms. My comments about ligatures not having PUA assignments were written under the assumption that they were all correctly named (e.g.,

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-10 Thread Khaled Hosny
On Thu, Jun 10, 2010 at 06:26:12PM +0100, Gareth Hughes wrote: > David J. Perry wrote: > > I am curious; are you using standard Unicode Syriac fonts? In such > > fonts, there is no need for, nor should there be, PUA assignments for > > the joined shapes. (And any font whose maker puts joined shap

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-10 Thread Gareth Hughes
David J. Perry wrote: > I am curious; are you using standard Unicode Syriac fonts? In such > fonts, there is no need for, nor should there be, PUA assignments for > the joined shapes. (And any font whose maker puts joined shapes > "somewhere that's going to spare" needs to go back to Unicode 101

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-09 Thread David J. Perry
Scripsit Gareth: What is more, I do a lot of work with Syriac, a cursive script for which most joined shapes are encoded in the PUA or somewhere that's going spare. This means that my XeTeX PDFs aren't searchable or copyable in Syriac. Only one or two Syriac letters per word can be searched or c

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-09 Thread Gareth Hughes
Ross Moore wrote: > However, PDF has two separate mechanisms to overcome this. > > 1. a CMap resource for the font > 2. the /ActualText tagging construction > > Concerning method 1. CMap resources: > > I don't know where that CMap resource is being constructed. > Presumably it is by xdvip

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-08 Thread Philip Taylor (Webmaster, Ret'd)
Argh, this was not "off-list". Please accept my apologies, and also keep the information in this confidential. Philip Taylor Philip Taylor (Webmaster, Ret'd) wrote: [Off-list] Have I told you that Thanh has created a Win-32 version of the modified PdfTeX [...]

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-08 Thread Philip Taylor (Webmaster, Ret'd)
[Off-list] Have I told you that Thanh has created a Win-32 version of the modified PdfTeX, and after some (many !) trials and tribulations, I finally persuaded it to work within a TeXlive framework ? I am not yet certain that it is correct -- it seem to see rather more debugging output from proce

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-07 Thread Ross Moore
Hi Andy, On 08/06/2010, at 11:39 AM, Andy Lin wrote: It seems I misunderstood what exactly the TECkit mapping does. All it does is change the input as instructed. All other "features" -- copy/paste and search compatibility -- I'd assumed was attributed to TECkit is actually that of the PDF read

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-07 Thread David J. Perry
June 07, 2010 9:39 PM Subject: Re: [XeTeX] Ligatures and searching in PDFs It seems I misunderstood what exactly the TECkit mapping does. All it does is change the input as instructed. All other "features" -- copy/paste and search compatibility -- I'd assumed was attributed to TEC

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-07 Thread Andy Lin
It seems I misunderstood what exactly the TECkit mapping does. All it does is change the input as instructed. All other "features" -- copy/paste and search compatibility -- I'd assumed was attributed to TECkit is actually that of the PDF reader (in my case, Adobe Reader). So, when Adobe Reader enc

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-06 Thread Gareth Hughes
Andy Lin wrote: > In order to make the common f/ff ligatures searchable in PDFs, add the > following lines and compile the map file with teckit_compile (should > be in the bin folder): > U+0066 U+0066 <> U+FB00 ; ff -> ff ligature > U+0066 U+0069 <> U+FB01 ; fi -> fi ligature > U+0066 U

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-01 Thread Andy Lin
> One ting I'm wondering about: not all of the fonts I use always have > all those ligatures. From what I understand from you, can't check > right now, glyphs will be replaced usgin the mapping regardless of > glyph availability, which would lead to missing glyphs in the docuemnt > if not available

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-01 Thread Jonathan Kew
On 1 Jun 2010, at 19:12, Diederick C. Niehorster wrote: > Would it therefore make more sense to put these mappings in a separate > file and load that mapping as well when required? Can multiple > mappings be loaded? No. (But you can of course choose different mappings for different fonts, accord

Re: [XeTeX] Ligatures and searching in PDFs

2010-06-01 Thread Diederick C. Niehorster
Hi Andy, Thanks a lot for your post, this is very useful! One ting I'm wondering about: not all of the fonts I use always have all those ligatures. From what I understand from you, can't check right now, glyphs will be replaced usgin the mapping regardless of glyph availability, which would lead

[XeTeX] Ligatures and searching in PDFs

2010-06-01 Thread Andy Lin
Sorry to revive this topic, but I think I've found a solution. The original post described a problem when using the rare ligatures (e.g. "fty") in the Junicode font, in that the strings could not be found by their decomposed characters. At the time, it was suggested the /ActualText PDF feature wou

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-16 Thread Joel C. Salomon
On 05/10/2010 03:36 AM, Janusz S. Bień wrote: > On Mon, 10 May 2010 Paul Foley wrote: >> Try the following: >> >> \documentclass{article} >> \usepackage{xltxtra} >> \setmainfont[Mapping=tex-text,Numbers=OldStyle,Ligatures={Required,Common,Rare}]{Junicode} >> >> \begin{document} >> Fifty afflicted

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-16 Thread Peter Baker
Meho R. wrote: So, what we have concluded: that we have accsupp and Junicode. But can anyone show how to solve this problem using the example in OP's post? \documentclass{article} \usepackage{xltxtra} \setmainfont[Mapping=tex-text,Numbers=OldStyle,Ligatures={Required,Common,Rare}]{Junicode} \b

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-16 Thread Meho R.
ument} Fifty afflicted fjords. \end{document} From: Janusz S. Bień To: Khaled Hosny Cc: Unicode-based TeX for Mac OS X and other platforms Sent: Tue, May 11, 2010 5:04:58 AM Subject: Re: [XeTeX] Ligatures and searching in PDFs On Tue, 11 May 2010 Khaled Hosny

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-11 Thread Peter Baker
Peter Dyballa wrote: Am 11.05.2010 um 14:13 schrieb Peter Baker: the version I posted the other day This seems to have exactly the same version number as the font files which came with TeX Live 2009... I didn't change the number because it wasn't an official release. I probably should ha

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-11 Thread Peter Dyballa
Am 11.05.2010 um 14:13 schrieb Peter Baker: the version I posted the other day This seems to have exactly the same version number as the font files which came with TeX Live 2009... -- Greetings Pete To drink without thirst and to make love all the time, madam, it is only these which

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-11 Thread Peter Baker
On 5/10/10 11:04 PM, Janusz S. Bień wrote: You are right: http://www.tex.ac.uk/tex-archive/macros/latex/contrib/oberdiek/accsupp.pdf I was not aware of it. Best regards Janusz I've been very concerned about the searchability of PDFs and have begun to build Junicode in such a way that l

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-10 Thread Janusz S. Bień
On Tue, 11 May 2010 Khaled Hosny wrote: [...] > IIRC, there are already latex packages that adds higher level support > for ActualText tags (low level support is already in the engines). You are right: http://www.tex.ac.uk/tex-archive/macros/latex/contrib/oberdiek/accsupp.pdf I was not awar

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-10 Thread Khaled Hosny
On Tue, May 11, 2010 at 04:26:11AM +0200, Janusz S. Bień wrote: > On Mon, 10 May 2010 "David J. Perry" wrote: > > >> > >> The proper solution would be to use /ActualText feature of the PDF > >> specification. > > > > I am very interested in this issue of searching PDFs. A google search for > >

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-10 Thread Janusz S. Bień
On Mon, 10 May 2010 "David J. Perry" wrote: >> >> The proper solution would be to use /ActualText feature of the PDF >> specification. > > I am very interested in this issue of searching PDFs. A google search for > "PDF Actual Text" turned up nothing. I then downloaded the actual PDF spec >

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-10 Thread David J. Perry
The proper solution would be to use /ActualText feature of the PDF specification. I am very interested in this issue of searching PDFs. A google search for "PDF Actual Text" turned up nothing. I then downloaded the actual PDF spec from the Adobe web site and found the reference, and got the

Re: [XeTeX] Ligatures and searching in PDFs

2010-05-10 Thread Janusz S. Bień
On Mon, 10 May 2010 Paul Foley wrote: > 1. (*) text/plain ( ) text/html > > Try the following: > > \documentclass{article} > \usepackage{xltxtra} > \setmainfont[Mapping=tex-text,Numbers=OldStyle,Ligatures={Required,Common,Rare}]{Junicode} > > \begin{document} > Fifty afflict

[XeTeX] Ligatures and searching in PDFs

2010-05-10 Thread Paul Foley
Try the following: \documentclass{article} \usepackage{xltxtra} \setmainfont[Mapping=tex-text,Numbers=OldStyle,Ligatures={Required,Common,Rare}]{Junicode} \begin{document} Fifty afflicted fjords. \end{document} Load the PDF, and search for any of the words. The "fty", "ct" and "fj" ligatures ar