I probably missed something here, but I sure can't figure out what. I have this problem on two different machines, an x86 running links-2.1_pre20, and an amd64 running links-2.1_pre19). Yes, I followed the instruction at http://gentoo.org/doc/en/utf-8.xml. I have the UTF-8 I/O option checked. Yes, I did compile it with the unicode "USE" flag. I mostly run KDE with konsole, and I have unicode working fine on other apps (vim and less, though less does leave garbage on the screen), but I can't get links to display anything other than plain ASCII. $ emerge -p info Portage 2.0.54 (default-linux/amd64/2006.0, gcc-3.4.5, glibc-2.3.5-r2, 2.6.15-gentoo-r7 x86_64) ================================================================= System uname: 2.6.15-gentoo-r7 x86_64 AMD Athlon(tm) 64 Processor 3200+ Gentoo Base System version 1.6.14 dev-lang/python: 2.4.2 sys-apps/sandbox: 1.2.12 sys-devel/autoconf: 2.13, 2.59-r7 sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1 sys-devel/binutils: 2.16.1 sys-devel/libtool: 1.5.22 virtual/os-headers: 2.6.11-r2 ACCEPT_KEYWORDS="amd64" AUTOCLEAN="yes" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-march=k8 -O3 -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3.4/env /usr/kde/3.4/share/config /usr/kde/3.4/shutdown /usr/kde/3/share/config /usr/lib/X11/xkb /usr/share/config /var/qmail/control" CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d" CXXFLAGS="-march=k8 -O3 -pipe" DISTDIR="/usr/portage/distfiles" FEATURES="autoconfig distlocks sandbox sfperms strict" GENTOO_MIRRORS="http://distfiles.gentoo.org http://distro.ibiblio.org/pub/linux/distributions/gentoo" LANG="en_NZ.UTF-8" LINGUAS="en ru" PKGDIR="/usr/portage/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="/usr/local/portage" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="amd64 X aac aalib acpi alsa arts audiofile berkdb bitmap-fonts bzip2 caps cdparanoia cdr cjk crypt css cups dga directfb dri dvd dvdr encode exif expat fam fbcon ffmpeg flac ggi gif glibc-omitfp gmp gphoto2 gpm gstreamer gtk2 idea idn imagemagick imap imlib ipv6 isdnlog javascript joystick jpeg kde lcms libcaca libwww live lm_sensors mad matroska mbox memlimit mng motif mp3 mpeg mysql nas ncurses network nls nptl nptlonly ogg opengl pcre pdflib perl png ppds pppd qt quicktime readline real rtc samba scanner sdl silc speex spell ssl tcpd theora tiff truetype truetype-fonts type1-fonts udev unicode usb userlocales utf8 vcd vorbis wifi xinerama xml xml2 xmms xv xvid zlib linguas_en linguas_ru userland_GNU kernel_linux elibc_glibc" Unset: ASFLAGS, CTARGET, LC_ALL, LDFLAGS, MAKEOPTS
can you put you info about next: $ locale example link to utf-8 html page and show you broken screen with this page please
did you know what links have full utf-8 support if you use graphical mode with -g command option? i am compile links and run it in terminal and w/ -g option - in terminal shows only ASCII in graphical show unicode I tested on my blog - http://blog.devil.mk.ua
$ locale LANG=en_US.UTF-8 LC_CTYPE="en_US.UTF-8" LC_NUMERIC="en_US.UTF-8" LC_TIME="en_US.UTF-8" LC_COLLATE=POSIX LC_MONETARY="en_US.UTF-8" LC_MESSAGES="en_US.UTF-8" LC_PAPER="en_US.UTF-8" LC_NAME="en_US.UTF-8" LC_ADDRESS="en_US.UTF-8" LC_TELEPHONE="en_US.UTF-8" LC_MEASUREMENT="en_US.UTF-8" LC_IDENTIFICATION="en_US.UTF-8" LC_ALL= A simple one: http://slovo.info/testuni.htm I don't know how you want me to show what I see, but copying it (using Konsole): +--------------------------------------------------------------------------------------------------------------------------------------+ | Description | This is what your browser shows... | ...and this is what you should see | |----------------------------------------------------+----------------------------------------+----------------------------------------| | Czech and Slovak characters | s t z l c e d n r u l | | | | S T Z L C E D N R U L | | |----------------------------------------------------+----------------------------------------+----------------------------------------| | Polish characters | l a z e c n s z | | | | L A Z E C N S Z | | |----------------------------------------------------+----------------------------------------+----------------------------------------| | Romanian characters | A a S s T t | | |----------------------------------------------------+----------------------------------------+----------------------------------------| | Croatian and Slovenian characters | s c z c d/ | | | | S C Z C D/ | | |----------------------------------------------------+----------------------------------------+----------------------------------------| | Hungarian characters | O" o" U" u" | | |----------------------------------------------------+----------------------------------------+----------------------------------------| | German characters | A:, a:, O:, o:, U:, u:, ss | | |----------------------------------------------------+----------------------------------------+----------------------------------------| | | abvgdeyozhzijklmno | | | | prstufhcchshch'y~e'yuya | | | Russian alphabet | ABVGDEYOZHZIJ | | | | KLMNOPRSTUF | | | | HCCHSHSHCH'Y~E'YUYA | | |----------------------------------------------------+----------------------------------------+----------------------------------------| | Special Byelorussian and Ukrainian characters | V% v% IE ie G3 g3 | | |----------------------------------------------------+----------------------------------------+----------------------------------------| That matches exactly what I see. :) Just for completeness: $ equery uses links [ Searching for packages matching links... ] [ Colour Code : set unset ] [ Legend : Left column (U) - USE flags from make.conf ] [ : Right column (I) - USE flags packages was installed with ] [ Found these USE variables for www-client/links-2.1_pre19 ] U I + + X : Adds support for X11 + + directfb : Adds support for DirectFB layer (library for FB devices) + + fbcon : Adds framebuffer support for the console, via the kernel + + gpm : Adds support for sys-libs/gpm (Console-based mouse driver) + + javascript : Enables javascript support + + jpeg : Adds JPEG image support - - livecd : !!internal use only!! DO NOT SET THIS FLAG YOURSELF!, used during livecd building. + + png : Adds support for libpng (PNG images) + + ssl : Adds support for Secure Socket Layer connections - - svga : Adds support for SVGAlib (graphics library) + + tiff : Adds support for the tiff image format + + unicode : Adds support for Unicode OK, it does show unicode in graphical mode (well, on that page I gave before, it has a couple of characters wrong, but close enough). I guess I just assumed it would work in text mode (which I frequently use to save bandwidth). Is it supposed to work in text mode (I don't know why it shouldn't)?
>Is it supposed to work in text mode (I don't know why it shouldn't)? I think so not, becouse in text mode i dont find in configuration how i can use utf-8 charsets. May be you try w3m ? in text console it support unicode very well :)
Links doesn't support full UTF-8 output on terminal (there are only fixes to draw lines on such terminal). Therefore it translates "ä" into "a:" or removes diacritics for other characters like "á" -> "a". This is reminiscence from old ages when the program was developed for Czech audience using linux virtual terminals without ability to show all ISO-8859-2 characters. I agree it's curious today, but nobody has been motivated enough to fix it up to the present day.
Hello. www-client/links-2.3_pre1-r1 unicode support is broken again. USE="unicode" do nothing.
Yay, I could read both Cyrillic and extended Latin in www-client/links-2.4-r1 now! (Setup > Character set > Unicode UTF-8)
(In reply to comment #7) > Yay, I could read both Cyrillic and extended Latin in www-client/links-2.4-r1 > now! > > (Setup > Character set > Unicode UTF-8) Yes, this new feature from upstream.
(In reply to comment #8) > (In reply to comment #7) > > Yay, I could read both Cyrillic and extended Latin in www-client/links-2.4-r1 > > now! > > > > (Setup > Character set > Unicode UTF-8) > > Yes, this new feature from upstream. Suppose so. But the bug is fixed and could be closed at last, ain't it?