Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!

Bug 263531

Summary: dev-texlive/texlive-langczechslovak - native UTF-8 support for CSLaTeX
Product: Gentoo Linux Reporter: Jiri Tyr <jiri.tyr>
Component: New packagesAssignee: TeX project <tex>
Status: RESOLVED UPSTREAM    
Severity: enhancement CC: petr.pisar
Priority: High    
Version: 2008.0   
Hardware: All   
OS: Linux   
URL: http://www.olsak.net/enctex.html
Whiteboard:
Package list:
Runtime testing required: ---
Attachments: texlive-langczechslovak-2008-r1.ebuild
files/format.cslatex.cnf
files/format.csplain.cnf
texlive-langczechslovak.diff
texlive-langczechslovak-2009-r1.ebuild
texlive-langczechslovak-2009.diff

Description Jiri Tyr 2009-03-23 19:02:27 UTC
Me and friend of mine, we have implemented a better solution for UTF-8 encoding in CSLaTeX. It is based on encTeX from Czech LaTeX guru Petr Olsak. EncTeX allows full UTF-8 processing in standard 8-bit TeX. Comparing to the usage of the "\usepackage[utf8]{inputenc}" in the preamble of a document, this solution do not need any additional package because for the character translation is using the TeX preprocessor.

Further we implemented some corrections which are needed for the fully operational CSLaTeX.
Comment 1 Jiri Tyr 2009-03-23 19:04:46 UTC
Created attachment 186024 [details]
texlive-langczechslovak-2008-r1.ebuild

Ebuild for native UTF-8 support in CSLaTeX.
Comment 2 Jiri Tyr 2009-03-23 19:05:48 UTC
Created attachment 186026 [details]
files/format.cslatex.cnf

format.cslatex.cnf with UTF-8 support.
Comment 3 Jiri Tyr 2009-03-23 19:06:37 UTC
Created attachment 186027 [details]
files/format.csplain.cnf

format.csplain.cnf with UTF-8 support.
Comment 4 Lars Wendler (Polynomial-C) (RETIRED) gentoo-dev 2009-03-23 22:12:23 UTC
Please attach diffs when you did changes to an ebuild. That would be much more
handy for our devs.
Comment 5 Jiri Tyr 2009-03-23 22:22:54 UTC
Created attachment 186054 [details]
texlive-langczechslovak.diff

Here is the diff as requested.
Comment 6 Alexis Ballier gentoo-dev 2009-03-24 09:18:29 UTC
(In reply to comment #0)
> Me and friend of mine, we have implemented a better solution for UTF-8 encoding
> in CSLaTeX. It is based on encTeX from Czech LaTeX guru Petr Olsak. EncTeX
> allows full UTF-8 processing in standard 8-bit TeX. Comparing to the usage of
> the "\usepackage[utf8]{inputenc}" in the preamble of a document, this solution
> do not need any additional package because for the character translation is
> using the TeX preprocessor.


IMHO we shouldn't differ from upstream there, have you tried to contact cslatex and texlive upstreams? As far as I know the standard TeX behavior is to use only ASCII characters unless you change the inputenc; you can modify locally your fmtutil configuration files because they are config protected in /etc/texmf/fmtutil.d so this is just a matter of configuration and I think the default should be kept to what the standard is and what upstream uses.
Comment 7 Jiri Tyr 2009-03-26 15:08:08 UTC
(In reply to comment #6)
> IMHO we shouldn't differ from upstream there, have you tried to contact cslatex
> and texlive upstreams? As far as I know the standard TeX behavior is to use
> only ASCII characters unless you change the inputenc; you can modify locally
> your fmtutil configuration files because they are config protected in
> /etc/texmf/fmtutil.d so this is just a matter of configuration and I think the
> default should be kept to what the standard is and what upstream uses.

You are wrong. It has nothing to do with the mainstream of the CSLaTeX or TeXLive. If you read the encTex documentation (http://www.olsak.net/enctex.html), you can find the following differences:

- inputenc translates UTF-8 character into active characters (ř -> \v{r})
- encTeX is working with real UTF-8 characters

Conclusion:
- inputenc can bring problems in some macros:
  \def\compareutf8str{\somevar\string \someothervar\string}
- inputenc doesn't show correct UTF-8 text in log files, encTeX does

From my point of view the proposed unicode USE flag is relevant to the texlive-langczechslovak package as well as to any other texlive-lang package which supports encTeX's way of UTF-8 handling.

Please consider this enhanclemnt one more time.
Comment 8 Jiri Tyr 2010-02-04 14:39:40 UTC
Created attachment 218415 [details]
texlive-langczechslovak-2009-r1.ebuild

Ebuild for the texlive-langczechslovak-2009 with a native unicode support.
Comment 9 Jiri Tyr 2010-02-04 14:41:44 UTC
Created attachment 218417 [details]
texlive-langczechslovak-2009.diff

Diff against the official texlive-langczechslovak-2009 ebuild.
Comment 10 Alexis Ballier gentoo-dev 2015-07-28 06:37:20 UTC
this is changed upstream these days i think