On Mon, Apr 27, 2026 at 02:48:23PM +0300, Eli Zaretskii wrote:
> > From: Gavin Smith <[email protected]>
> > Date: Sun, 26 Apr 2026 22:01:37 +0100
> > Cc: [email protected]
> > 
> > I remember that the convention of merging index entries with the
> > same sort key was very old (from when I looked at this before, several
> > years ago), but I thought we could reconsider this, as I do not actually
> > see any advantage of merging the entries.
> > 
> > Does anyone have an opinion on this?
> 
> Isn't this what causes the Index to have stuff like
> 
>   Foo bar.......................................42, 142, 442
> 
> rather than
> 
>   Foo bar.......................................42
>   Foo bar.......................................142
>   Foo bar.......................................442
> 
> That is, if the same subject is mentioned in several places, have on
> cumulative entry for it in the index with all the pages?  If so, I
> see a clear advantage to merging the entries, at least for
> non-punctuation characters.

I propose that they only be merged if the index entry text is identical.

Thus, the following two index entries should be merged:

@cindex Foo bar
@cindex Foo bar

However, the following index entries should be distinct:

@cindex Foo bar
@cindex Foo @code{bar}
@cindex Foo @command{bar}
@cindex Föö bar

I notice in the NEWS file for Texinfo, in the section for 6.0, there
is:

* texindex:
  . completely new implementation as a literate program using Texinfo
    and (portable) awk (called TexiWeb Jr.), thanks to Arnold Robbins.
    (Requires gawk 4.0+ if .twjr source is modified.)
  . the -o (--output) is not supported, unless we hear of someone using it.
  . duplicated sort keys with different display texts result in one
    merged index entry, using the first display text.
  . better sorting and parsing in unusual cases; most notably, { and }
    characters can appear as initials.

Bullet point 3 is what I am talking about here.

Reply via email to