this session shows luit doesn't work: imi@boborjan ~ $ echo $LANG hu_HU.UTF-8 imi@boborjan ~ $ LANG=hu_HU luit bash imi@boborjan ~ $ echo $LANG hu_HU imi@boborjan ~ $ echo é >/tmp/teszt1.txt imi@boborjan ~ $ echo ő >>/tmp/teszt1.txt imi@boborjan ~ $ exit imi@boborjan ~ $ echo $LANG hu_HU.UTF-8 imi@boborjan ~ $ hexdump -C </tmp/teszt1.txt 00000000 c3 a9 0a c5 91 0a |......| 00000006 imi@boborjan ~ $ cat /tmp/teszt1.txt é ő imi@boborjan ~ $ if luit have done its work, then at the end two glitched characters would show up, but now two correctly encoded UTF8 characters are present. Reproducible: Always Steps to Reproduce:
You are overriding your locales incorrectly, LANG is overriden by respective LC_* settings; you need to use `LC_CTYPE=hu_HU luit bash` instead.
okay, copypasting another session, the error is still the same: imi@boborjan ~ $ echo $LC_CTYPE imi@boborjan ~ $ echo $LANG hu_HU.UTF-8 imi@boborjan ~ $ LC_CTYPE=hu_HU bash imi@boborjan ~ $ echo $LC_CTYPE hu_HU imi@boborjan ~ $ echo $LANG hu_HU.UTF-8 imi@boborjan ~ $ echo é >/tmp/teszt1.txt imi@boborjan ~ $ echo ő >>/tmp/teszt1.txt imi@boborjan ~ $ exit exit imi@boborjan ~ $ echo $LC_CTYPE imi@boborjan ~ $ echo $LANG hu_HU.UTF-8 imi@boborjan ~ $ hexdump -C </tmp/teszt1.txt 00000000 c3 a9 0a c5 91 0a |......| 00000006 imi@boborjan ~ $ cat /tmp/teszt1.txt é ő imi@boborjan ~ $ cat /etc/env.d/02locale LC_ALL="hu_HU.UTF-8" LANG="hu_HU.UTF-8" imi@boborjan ~ $ locale LANG=hu_HU.UTF-8 LC_CTYPE="hu_HU.UTF-8" LC_NUMERIC="hu_HU.UTF-8" LC_TIME="hu_HU.UTF-8" LC_COLLATE="hu_HU.UTF-8" LC_MONETARY="hu_HU.UTF-8" LC_MESSAGES="hu_HU.UTF-8" LC_PAPER="hu_HU.UTF-8" LC_NAME="hu_HU.UTF-8" LC_ADDRESS="hu_HU.UTF-8" LC_TELEPHONE="hu_HU.UTF-8" LC_MEASUREMENT="hu_HU.UTF-8" LC_IDENTIFICATION="hu_HU.UTF-8" LC_ALL=hu_HU.UTF-8 imi@boborjan ~ $
Please move this out of bugzilla... It's clearly working just as expected when used correctly. http://www.gentoo.org/main/en/support.xml