Bug 128151 - Can't get unicode to work with Links
Summary: Can't get unicode to work with Links
Product: Gentoo Linux
Component: Current packages (show other bugs)
Hardware: All Linux
Assignee: Gentoo Linux bug wranglers
Reported: 2006-03-30 11:25 UTC by ta2002
Modified: 2012-01-28 12:21 UTC (History)
5 users (show)

Description ta2002 2006-03-30 11:25:06 UTC
I probably missed something here, but I
sure can't figure out what.

I have this problem on two different
machines, an x86 running  links-2.1_pre20,
and an amd64 running links-2.1_pre19).

Yes, I followed the instruction at I have
the UTF-8 I/O option checked. Yes, I did
compile it with the unicode "USE" flag.

I mostly run KDE with konsole, and I have
unicode working fine on other apps (vim
and less, though less does leave garbage
on the screen), but I can't get links to
display anything other than plain ASCII.

$ emerge -p info
Portage 2.0.54 (default-linux/amd64/2006.0, gcc-3.4.5, glibc-2.3.5-r2, 2.6.15-gentoo-r7 x86_64)
System uname: 2.6.15-gentoo-r7 x86_64 AMD Athlon(tm) 64 Processor 3200+
Gentoo Base System version 1.6.14
dev-lang/python:     2.4.2
sys-apps/sandbox:    1.2.12
sys-devel/autoconf:  2.13, 2.59-r7
sys-devel/automake:  1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1
sys-devel/binutils:  2.16.1
sys-devel/libtool:   1.5.22
virtual/os-headers:  2.6.11-r2
CFLAGS="-march=k8 -O3 -pipe"
CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3.4/env /usr/kde/3.4/share/config /usr/kde/3.4/shutdown /usr/kde/3/share/config /usr/lib/X11/xkb /usr/share/config /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d"
CXXFLAGS="-march=k8 -O3 -pipe"
FEATURES="autoconfig distlocks sandbox sfperms strict"
LINGUAS="en ru"
USE="amd64 X aac aalib acpi alsa arts audiofile berkdb bitmap-fonts bzip2 caps cdparanoia cdr cjk crypt css cups dga directfb dri dvd dvdr encode exif expat fam fbcon ffmpeg flac ggi gif glibc-omitfp gmp gphoto2 gpm gstreamer gtk2 idea idn imagemagick imap imlib ipv6 isdnlog javascript joystick jpeg kde lcms libcaca libwww live lm_sensors mad matroska mbox memlimit mng motif mp3 mpeg mysql nas ncurses network nls nptl nptlonly ogg opengl pcre pdflib perl png ppds pppd qt quicktime readline real rtc samba scanner sdl silc speex spell ssl tcpd theora tiff truetype truetype-fonts type1-fonts udev unicode usb userlocales utf8 vcd vorbis wifi xinerama xml xml2 xmms xv xvid zlib linguas_en linguas_ru userland_GNU kernel_linux elibc_glibc"
Comment 1 Alexander Simonov 2006-04-08 09:45:55 UTC
can you put you info about next:

$ locale

example link to utf-8 html page and show you broken screen with this page please
Comment 2 Alexander Simonov 2006-04-08 09:52:38 UTC
did you know what links have full utf-8 support if you use graphical mode with -g command option?

i am compile links and run it in terminal and w/ -g option - in terminal shows only ASCII in graphical show unicode

I tested on my blog -
Comment 3 ta2002 2006-04-13 05:22:08 UTC
$ locale

A simple one:

I don't know how you want me to show what I see, but
copying it (using Konsole):

   |                    Description                     |   This is what your browser shows...   |   ...and this is what you should see   |
   | Czech and Slovak characters                        |  s  t  z  l  c  e d n r u l            |                                        |
   |                                                    |  S T Z L C E D N R U L                 |                                        |
   | Polish characters                                  |  l a z e c n s z                       |                                        |
   |                                                    |  L A Z E C N S Z                       |                                        |
   | Romanian characters                                |  A a S s T t                           |                                        |
   | Croatian and Slovenian characters                  |  s c z c d/                            |                                        |
   |                                                    |  S C Z C D/                            |                                        |
   | Hungarian characters                               |  O" o" U" u"                           |                                        |
   | German characters                                  |  A:, a:, O:, o:, U:, u:, ss            |                                        |
   |                                                    | abvgdeyozhzijklmno                     |                                        |
   |                                                    | prstufhcchshch'y~e'yuya                |                                        |
   | Russian alphabet                                   | ABVGDEYOZHZIJ                          |                                        |
   |                                                    | KLMNOPRSTUF                            |                                        |
   |                                                    | HCCHSHSHCH'Y~E'YUYA                    |                                        |
   | Special Byelorussian and Ukrainian characters      |  V% v% IE ie G3 g3                     |                                        |

That matches exactly what I see. :)

Just for completeness:

$ equery uses links
[ Searching for packages matching links... ]
[ Colour Code : set unset ]
[ Legend        : Left column  (U) - USE flags from make.conf                     ]
[                  : Right column (I) - USE flags packages was installed with ]
[ Found these USE variables for www-client/links-2.1_pre19 ]
 U I
 + + X          : Adds support for X11
 + + directfb   : Adds support for DirectFB layer (library for FB devices)
 + + fbcon      : Adds framebuffer support for the console, via the kernel
 + + gpm        : Adds support for sys-libs/gpm (Console-based mouse driver)
 + + javascript : Enables javascript support
 + + jpeg       : Adds JPEG image support
 - - livecd     : !!internal use only!! DO NOT SET THIS FLAG YOURSELF!, used during livecd building.
 + + png        : Adds support for libpng (PNG images)
 + + ssl        : Adds support for Secure Socket Layer connections
 - - svga       : Adds support for SVGAlib (graphics library)
 + + tiff       : Adds support for the tiff image format
 + + unicode    : Adds support for Unicode

OK, it does show unicode in graphical mode (well, on that page I gave
before, it has a couple of characters wrong, but close enough).

I guess I just assumed it would work in text mode (which I frequently
use to save bandwidth). Is it supposed to work in text mode (I don't
know why it shouldn't)?
Comment 4 Alexander Simonov 2006-04-13 13:36:12 UTC
>Is it supposed to work in text mode (I don't
know why it shouldn't)?

I think so not, becouse in text mode i dont find in configuration how i can use utf-8 charsets.

May be you try w3m ? in text console it support unicode very well :)
Comment 5 Petr Pisar 2007-09-25 20:43:03 UTC
Links doesn't support full UTF-8 output on terminal (there are only fixes to draw lines on such terminal). Therefore it translates "ä" into "a:" or removes diacritics for other characters like "á" -> "a".

This is reminiscence from old ages when the program was developed for Czech audience using linux virtual terminals without ability to show all ISO-8859-2 characters.

I agree it's curious today, but nobody has been motivated enough to fix it up to the present day.
Comment 6 Azamat H. Hackimov 2011-03-05 07:45:23 UTC

www-client/links-2.3_pre1-r1 unicode support is broken again. USE="unicode" do nothing.
Comment 7 Nikolaj Šujskij 2011-11-25 05:23:59 UTC
Yay, I could read both Cyrillic and extended Latin in www-client/links-2.4-r1 now!

(Setup > Character set > Unicode UTF-8)
Comment 8 Petr Pisar 2011-11-25 08:32:51 UTC
(In reply to comment #7)
> Yay, I could read both Cyrillic and extended Latin in www-client/links-2.4-r1
> now!
> (Setup > Character set > Unicode UTF-8)

Yes, this new feature from upstream.
Comment 9 Nikolaj Šujskij 2011-11-25 19:12:39 UTC
(In reply to comment #8)
> (In reply to comment #7)
> > Yay, I could read both Cyrillic and extended Latin in www-client/links-2.4-r1
> > now!
> > 
> > (Setup > Character set > Unicode UTF-8)
> Yes, this new feature from upstream.

 Suppose so. But the bug is fixed and could be closed at last, ain't it?