The perl module HTML::Parser includes support for decoding unicode entities in html (such as „) into the corresponding characters. This should be enabled during the installation. However, the current ebuild automatically answers 'no' to this question. Attached is a modified ebuild that gives 'yes' if the use flags include unicode. Reproducible: Always Steps to Reproduce: 1. Install HTML::Parser 2. Use it to parse a web page that contains unicode entities (for example, http://www.witkacy.hg.pl/witkosmos/kosmopis.html) 3. Actual Results: The text contains the original entities. Expected Results: (Optionally) translate them to unicode characters.
Created attachment 39241 [details] HTML::Parser ebuild with optional unicode entities support
That is an unfortunate oversight on my part (I could lay blame on the legacy of the ebuild, but that wouldn't be fair). Corrected and noted in 3.36-r1.
The $answer should not be (at least single) quoted in the echo statement. It appears you like curly braces around variables so it should probably be echo ${answer} ... or echo "${answer}" ... (Note that there are two of those)
Thanks for the catch; corrected and re-closing.