First Last Prev Next    No search results available      Search page      Enter new bug
Bug#: 121502
Alias:
Product:
Component:
Status: RESOLVED
Resolution: FIXED
Assigned To: Gentoo's Team for Core System packages <base-system@gentoo.org>
Hardware:
OS:
Version:
Priority:
Severity:
Reporter: throw_away_2002@yahoo.com
Add CC:
CC:
Remove selected CCs
URL:
Summary:
Status Whiteboard:
Keywords:

Filename Description Type Creator Created Size Actions
groff-man-UTF-8.diff groff-man-UTF-8.diff-modified patch Matthias Schwarzott 2006-09-05 01:55 0000 777 bytes Details | Diff
groff-man-UTF-8.diff groff-man-UTF-8.diff-second-try patch Matthias Schwarzott 2006-09-05 02:01 0000 778 bytes Details | Diff
Create a New Attachment (proposed patch, testcase, etc.) View All

Bug 121502 depends on: Show dependency tree
Show dependency graph
Bug 121502 blocks: 146315
Votes: 0    Show votes for this bug    Vote for this bug

Additional Comments: (this is where you put emerge --info)







View Bug Activity   |   Format For Printing   |   XML   |   Clone This Bug


Description:   Opened: 2006-02-03 21:38 0000
One of the strangest things I have ever seen.

ONLY a problem on amd64 (not my x86 machines).

ONLY a problem with LANG set to ANY utf-8 locale (for example, en_US.UTF-8,
but NOT simply en_US or POSIX).

(So far) ONLY a problem for ANY man page in the openssh package
(about a dozen files). Other packages (newly emerged) do not
have this problem, and I just re-emerged openssh because of the
security update. No change.

The problem: using either less or man to view any of these man pages,
the search function ("/") will not find the dash character ("-") in the
file (even with many of them obviously visible).

Hope somebody can duplicate this, but if not, happy to do whatever
testing I can.

$ emerge -p info
Portage 2.0.54 (default-linux/amd64/2005.1, gcc-3.4.4, glibc-2.3.5-r2,
2.6.15-gentoo-r1 x86_64)
=================================================================
System uname: 2.6.15-gentoo-r1 x86_64 AMD Athlon(tm) 64 Processor 3200+
Gentoo Base System version 1.6.14
dev-lang/python:     2.4.2
sys-apps/sandbox:    1.2.12
sys-devel/autoconf:  2.13, 2.59-r6
sys-devel/automake:  1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1
sys-devel/binutils:  2.16.1
sys-devel/libtool:   1.5.22
virtual/os-headers:  2.6.11-r2
ACCEPT_KEYWORDS="amd64"
AUTOCLEAN="yes"
CBUILD="x86_64-pc-linux-gnu"
CFLAGS="-march=k8 -O3 -pipe -msse2 -mfpmath=sse"
CHOST="x86_64-pc-linux-gnu"
CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3.4/env
/usr/kde/3.4/share/config /usr/kde/3.4/shutdown /usr/kde/3/share/config
/usr/lib/X11/xkb /usr/share/config /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d"
CXXFLAGS="-march=k8 -O3 -pipe -msse2 -mfpmath=sse"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoconfig distlocks sandbox sfperms strict"
GENTOO_MIRRORS="http://distfiles.gentoo.org
http://distro.ibiblio.org/pub/linux/distributions/gentoo"
LANG="en_NZ.UTF-8"
LINGUAS="en ru"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY="/usr/local/portage"
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="amd64 X aac aalib acpi alsa apache2 arts audiofile avi berkdb bitmap-fonts
bzip2 caps cdparanoia cdr cjk crypt css cups dga directfb divx4linux dvd dvdr
emboss encode exif expat faad fam fbcon ffmpeg flac freetype gd ggi gif gmp
gphoto2 gpm gstreamer gtk2 idea idn imagemagick imap imlib ipv6 javascript
jikes joystick jpeg kde lcms libcaca libwww live lm_sensors lzw lzw-tiff mad
matroska mbox memlimit mng motif mp3 mpeg mpi mysql nas ncurses network nls
nptl nptlonly ogg opengl pcre pdflib perl png ppds qt quicktime readline real
rtc samba scanner sdl silc speex spell ssl tcpd theora tiff truetype
truetype-fonts type1-fonts udev unicode usb userlocales utf8 vcd vorbis wifi
xinerama xml2 xmms xpm xv xvid zlib linguas_en linguas_ru userland_GNU
kernel_linux elibc_glibc"
Unset:  ASFLAGS, CTARGET, LC_ALL, LDFLAGS, MAKEOPTS

------- Comment #1 From Harald van Dijk 2006-02-22 11:54:43 0000 -------
Could you please verify that what you're seeing is the ASCII minus sign, rather
than a non-ASCII Unicode symbol which looks exactly the same? One way to find
out is by viewing one of these manpages, copying the character with the mouse,
and typing

echo - | cat -v

in a shell, except that instead of typing -, you paste it. I'm guessing you'll
see  "M-bM-^HM-^R" instead of "-". If this is the case, could you please make
sure your /etc/man.conf is the same on all your machines, and if not, if you
can reproduce this on other systems by making it the same?

------- Comment #2 From throw_away_2002@yahoo.com 2006-02-23 12:20:50 0000 -------
Oops. :(

It seems like you have it right:

$ echo &#8722; | cat -v
M-bM-^HM-^R

And yes, I do have a difference in man.conf (the -Tascii option).

I guess that solves the problem.

Now I need to figure out if I even want to use utf-8 for man pages
(searching on what looks like an ascii "-" seems obvious to me, and
I do it all the time to find the description of an option).

Why on earth would the openssh people make those non-ascii characters
(in the middle of pure ascii text) when a far more obvious (at least
to me) alternative exists?

------- Comment #3 From Harald van Dijk 2006-02-23 14:09:46 0000 -------
> Why on earth would the openssh people make those non-ascii characters
> (in the middle of pure ascii text) when a far more obvious (at least
> to me) alternative exists?

It's not their decision. The manpage contains macros that tell nroff "format
'1' as an option", but it doesn't tell nroff how to do that. Other manpages
would contain "format '-1' in bold" instead, which is why it happens to work
with them, but I actually think openssh is doing the right thing here. (If you
want to be sure, you can check `gzip -dc /usr/share/man/man1/scp.1.gz`, and
look for the .Fl macros. Its meaning is described in the groff_mdoc manpage.) I
do think this may be a groff bug though, since &#8722;1 isn't a valid scp
option, only -1 is. base-system, as responsible for groff, added to CC for
additional input. Does this description sound about right, and if so, should
groff maybe be changed to force ASCII - for command-line options?

------- Comment #4 From Matthias Schwarzott 2006-09-04 04:12:06 0000 -------
Does this error still exists?

Can you please tell us what versions of "man" and "groff" you have installed.

------- Comment #5 From throw_away_2002@yahoo.com 2006-09-04 10:50:20 0000 -------
(In reply to comment #4)
> Does this error still exists?

No. My (very recent - as in two minutes ago :) ) update of man from 1.6-r1 to
1.6d appears to have fixed the problem.

------- Comment #6 From Matthias Schwarzott 2006-09-04 11:51:00 0000 -------
(In reply to comment #5)
> (In reply to comment #4)
> > Does this error still exists?
> 
> No. My (very recent - as in two minutes ago :) ) update of man from 1.6-r1 to
> 1.6d appears to have fixed the problem.
> 
1. Please also give us your version of groff.
2. With which man-page did you check the error?

For me on x86 it produces the error with "man scp" with man-1.6d and all
available versions of groff (1.18.1.1, 1.19.1-r2 and 1.19.2-r1).

------- Comment #7 From Jakub Moc 2006-09-04 16:14:04 0000 -------
Still broken here (x86 and amd64): sys-apps/man-1.6d, sys-apps/groff-1.19.2-r1

------- Comment #8 From Matthias Schwarzott 2006-09-05 01:53:56 0000 -------
This bug can be solved by adding the hack now positioned in
/usr/share/groff/site-tmac/man.local also to
/usr/share/groff/site-tmac/mdoc.local.

See attached (modified) groff-man-UTF-8.diff.

------- Comment #9 From Matthias Schwarzott 2006-09-05 01:55:34 0000 -------
Created an attachment (id=96041) [edit]
groff-man-UTF-8.diff-modified

------- Comment #10 From Matthias Schwarzott 2006-09-05 02:01:20 0000 -------
Created an attachment (id=96043) [edit]
groff-man-UTF-8.diff-second-try

------- Comment #11 From Jakub Moc 2007-04-04 07:28:43 0000 -------
*** Bug 173165 has been marked as a duplicate of this bug. ***

------- Comment #12 From spiritus 2007-04-04 19:19:10 0000 -------
I have seen groff and man in FC and Debian Etch are patched for compatibility
with UTF8 and autorecoding non-UTF8 mans(in KOI8-R, etc) to UTF. Patches are
inside their source packages. For examples:
http://mirrors.dotsrc.org/fedora/6/source/SRPMS/man-1.6d-1.1.src.rpm and
http://mirrors.dotsrc.org/fedora/6/source/SRPMS/groff-1.18.1.1-11.1.src.rpm.

------- Comment #13 From Ɓukasz Damentko 2007-04-28 15:12:30 0000 -------
*** Bug 176363 has been marked as a duplicate of this bug. ***

------- Comment #14 From Jakub Moc 2007-09-06 15:58:45 0000 -------
*** Bug 191488 has been marked as a duplicate of this bug. ***

------- Comment #15 From SpanKY 2008-02-24 18:11:12 0000 -------

*** This bug has been marked as a duplicate of bug 126361 ***

------- Comment #16 From SpanKY 2008-02-24 18:42:46 0000 -------
blah, goddamn mess of dupes

this bug is about the dash issue with unicode / non-unicode

it is not about anything else

------- Comment #17 From SpanKY 2008-02-24 18:57:43 0000 -------
looks like this was half way fixed (man.local) but the important part
(mdoc.local) was left out

groff-1.19.2-r2 includes mdoc.local as well

http://sources.gentoo.org/sys-apps/groff/files/groff-1.19.2-man-unicode-dashes.patch?rev=1.1

First Last Prev Next    No search results available      Search page      Enter new bug