Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!

Bug 136355

Summary: ruby-1.8.4-r1 - REXML module corrupts UTF-8 data
Product: Gentoo Linux Reporter: Eli Venter <eli_venter>
Component: [OLD] DevelopmentAssignee: Gentoo Ruby Team <ruby>
Status: RESOLVED FIXED    
Severity: major    
Priority: High    
Version: unspecified   
Hardware: All   
OS: Linux   
Whiteboard:
Package list:
Runtime testing required: ---

Description Eli Venter 2006-06-10 20:53:15 UTC
As mentioned at the following link:

https://www.zarb.org/pipermail/booh-discuss/2006-February/000146.html

Using the REXML module with UTF-8 chars can corrupt data. Running the simple test file from the above link shows the current stable ruby has this bug.

$ epm -q ruby
ruby-1.8.4-r1
$ ruby t.rb
andr
Comment 1 Eli Venter 2006-06-10 20:53:15 UTC
As mentioned at the following link:

https://www.zarb.org/pipermail/booh-discuss/2006-February/000146.html

Using the REXML module with UTF-8 chars can corrupt data. Running the simple test file from the above link shows the current stable ruby has this bug.

$ epm -q ruby
ruby-1.8.4-r1
$ ruby t.rb
andré is nice
andré is nice
andr&#233; is nice

Solution used to be use ruby 1.8.2 but that was just removed from portage.
Comment 2 Caleb Tennis (RETIRED) gentoo-dev 2006-06-13 05:38:46 UTC
Is this fixed in a more recent version than 1.8.4-r3 ?

There have been security vulnerabilities issues for < 1.8.3, so we can't easily revert.
Comment 3 Eli Venter 2006-06-13 07:00:18 UTC
Hadn't tried it until now. It does appear to be fixed in ruby-1.8.4.20060529