Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 99725 - utf-8.xml: minor fixes
Summary: utf-8.xml: minor fixes
Status: RESOLVED FIXED
Alias: None
Product: [OLD] Docs-user
Classification: Unclassified
Component: Other (show other bugs)
Hardware: All Linux
: High normal (vote)
Assignee: Sven Vermeulen (RETIRED)
URL: http://www.gentoo.org/doc/en/utf-8.xml
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2005-07-20 14:46 UTC by Jan Kundrát (RETIRED)
Modified: 2005-08-01 05:54 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Jan Kundrát (RETIRED) gentoo-dev 2005-07-20 14:46:53 UTC
$URL contains some minor errors:  
  
a) You don't have to use LC_ALL for UTF-8 usage; LC_CTYPE can do the trick as  
well.  
b) Section about kernel configuration could speak about possible usage of  
modules as well.  
c) I don't have to set LESSCHARSET on my system and LESS displays UTF-8  
encoded files correctly even with only LC_CTYPE=cs_CZ.UTF-8 .
Comment 1 Shyam Mani (RETIRED) gentoo-dev 2005-07-20 20:02:29 UTC
(In reply to comment #0)
> a) You don't have to use LC_ALL for UTF-8 usage; LC_CTYPE can do the trick as  
> well.  

I think there was an issue with this as some packages weren't picking up the
locale correctly? IIRC, SwifT was the one who addressed this in the doc, but I'm
not able to find this in the cvs logs.

> b) Section about kernel configuration could speak about possible usage of  
> modules as well.  

Modules for?

> c) I don't have to set LESSCHARSET on my system and LESS displays UTF-8  
> encoded files correctly even with only LC_CTYPE=cs_CZ.UTF-8 .

Look up man less. If it works without, fine. LESSCHARSET just forces less to use
the specified charset and we really don't lose 10GB of space by setting it you
know ;) If you can send me a few utf-8 files, I'd be glad to check. As of now I
guess LESSCHARSET stays...
Comment 2 Jan Kundrát (RETIRED) gentoo-dev 2005-07-21 00:01:37 UTC
(In reply to comment #1) 
> I think there was an issue with this as some packages weren't picking up the 
> locale correctly? IIRC, SwifT was the one who addressed this in the doc, but 
I'm 
> not able to find this in the cvs logs. 
 
I'll talk to him. 
 
> Modules for? 
 
For selecting NLS support in kernel, code listing 3.1. 
 
> Look up man less. If it works without, fine. LESSCHARSET just forces less to 
use 
> the specified charset and we really don't lose 10GB of space by setting it 
you 
> know ;) If you can send me a few utf-8 files, I'd be glad to check. As of now 
I 
> guess LESSCHARSET stays... 
 
Check files under gentoo/xml/htdocs/doc/cs/ :-). 
 
From `man less`: 
 
If neither LESSCHARSET nor LESSCHARDEF is set, but the string "UTF-8" is found 
in the LC_ALL, LC_TYPE or LANG environment  variables, then the default 
character set is utf-8. 
 
If  that  string  is  not found, but your system supports the setlocale 
interface, less will use setlocale to determine the character set.  setlocale 
is controlled by setting the LANG or LC_CTYPE environment variables. 
Comment 3 Shyam Mani (RETIRED) gentoo-dev 2005-07-21 00:16:50 UTC
(In reply to comment #2)

> For selecting NLS support in kernel, code listing 3.1. 

Doesn't make any major difference IMHO :)
  
> Check files under gentoo/xml/htdocs/doc/cs/ :-). 

Will do.
  
> If neither LESSCHARSET nor LESSCHARDEF is set, but the string "UTF-8" is found 
> in the LC_ALL, LC_TYPE or LANG environment  variables, then the default 
> character set is utf-8. 

Bug #91777
Comment 4 Jan Kundrát (RETIRED) gentoo-dev 2005-07-21 00:22:44 UTC
As fox2mike suggested, I've read bug 91777, but I think that the proposed fix 
is redundant. Original thread poster 
(http://forums.gentoo.org/viewtopic-p-2385415.html) used mallformed locale 
"de_DE.utf8" instead of correct one "de_DE.UTF-8". I've just checked on my 
system with cs_CZ locale and *both* utf-8 and UTF-8 are supported while utf8 is 
not. 
Comment 5 Jan Kundrát (RETIRED) gentoo-dev 2005-07-22 03:44:10 UTC
Additional point: 
 
d) Code Listing 3.6 suggests re-emerging slang and ncurses. Portage supports 
the --newuse flag which is ideal for such purposes: 
`emerge --update --newuse sys-libs/ncurses sys-libs/slang` 
 
e) "If an application has support for both a Qt and GTK+2 GUI, the GTK+2 GUI 
will generally give better results with Unicode." - any reasons? :-] 
 
f) "To build fonts (including the Bitstream Vera set) with support for East 
Asian letters with X, make sure you have the <c>cjk</c> USE flag set. Many 
other applications utilise this flag, so it may be worthwhile to add it as a 
*permanent* USE flag." - IMHO "global" is better. 
Comment 6 Shyam Mani (RETIRED) gentoo-dev 2005-07-22 06:22:37 UTC
(In reply to comment #5)

> d) Code Listing 3.6 suggests re-emerging slang and ncurses. Portage supports 
> the --newuse flag which is ideal for such purposes: 
> `emerge --update --newuse sys-libs/ncurses sys-libs/slang` 

I think neither of the two is required. Just plain emerge slang will 

a) pick up the new USE flag and
b) install the latest

  
> e) "If an application has support for both a Qt and GTK+2 GUI, the GTK+2 GUI 
> will generally give better results with Unicode." - any reasons? :-] 

Because they have better support? Ping slarti for more info, I'm cc'ing him on
this bug btw.
Comment 7 Jan Kundrát (RETIRED) gentoo-dev 2005-07-22 09:12:50 UTC
(In reply to comment #6) 
> > `emerge --update --newuse sys-libs/ncurses sys-libs/slang`  
>  
> I think neither of the two is required. Just plain emerge slang will  
>  
> a) pick up the new USE flag and 
> b) install the latest 
 
Users don't have to re-emerge; it is required only if the USE flag changed. 
Comment 8 Shyam Mani (RETIRED) gentoo-dev 2005-07-22 12:37:53 UTC
(In reply to comment #7)
> Users don't have to re-emerge; it is required only if the USE flag changed. 

That goes without saying. They wouldn't be reading the doc IMHO if they were
utf-8 compliant :) and if they are to make a system utf-8 compliant, 99% of them
would need to re-emerge.
Comment 9 Jan Kundrát (RETIRED) gentoo-dev 2005-07-22 13:00:38 UTC
(In reply to comment #8) 
> (In reply to comment #7) 
> > Users don't have to re-emerge; it is required only if the USE flag changed.  
>  
> That goes without saying. They wouldn't be reading the doc IMHO if they were 
> utf-8 compliant :) and if they are to make a system utf-8 compliant, 99% of 
them 
> would need to re-emerge. 
 
IMHO not - they are reading it just because they want to have it utf-8 
compliant, and maybe they will find out that they are completed from 73%. Why 
to force them to do useless compilation? 
Comment 10 Sven Vermeulen (RETIRED) gentoo-dev 2005-08-01 05:44:03 UTC
On the LC_ALL subject: for one, it is easier to set it completely, other LC_'s
who do benefit from it take it as well (I think LC_MESSAGES does) and others
ignore it. Using specific categories is only usefull when you want to deviate
from the default. In this case, we are setting the default.

The re-emerge stuff is indeed formulated differently, I'll rewrite to use the
more recommended method.
Comment 11 Sven Vermeulen (RETIRED) gentoo-dev 2005-08-01 05:54:56 UTC
I've fixed the command to update slang/blabla. I left the "permanent" USE flag
as-is, I don't think it causes confusion. I also left the GTK vs Qt as-is since
GTK just has a better international community working on it afaik.