Summary: | sys-apps/less-394 cannot display some html | ||
---|---|---|---|
Product: | Gentoo Linux | Reporter: | Michal Suchanek <hramrach> |
Component: | Current packages | Assignee: | Gentoo's Team for Core System packages <base-system> |
Status: | RESOLVED WONTFIX | ||
Severity: | normal | ||
Priority: | High | ||
Version: | unspecified | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Package list: | Runtime testing required: | --- |
Description
Michal Suchanek
2007-07-04 14:03:29 UTC
dont really know what you expect from less here ... get links or lynx to work with unicode I would expect it not to use broken software unless I explicitly set it up to. (In reply to comment #2) > I would expect it not to use broken software unless I explicitly set it up to. So install lynx instead; it handles unicode just fine. The script uses links before lynx. I have lynx 2.8.6-r2 installed. Not that it is much good at handling unicode either. If I go to http://www.ruby-lang.org/ja and save the page, I can at least tell it is Japanese by looking at it by "cat page.html | less". "lynx page.html" yields very little text, it does not even hint there is some content that could be seen if stuff worked correctly, seems like the page is broken rather than the viewer. This is much worse that links. links displays lots of garbage instead of the Japanese text. It is not possible to tell what it is but at least something is there. When I save http://seznam.cz and view it with lynx it again removes some characters. I do not see how this is better handling of unicode, or any handling of unicode at all. With links I see the page properly, apparently it has some conversion table for Latin characters with diacritics, and converts them to ascii. Not ideal as I could see all the characters but it picks them from the html for me. Generally picking the text from html would be nice but I do not see it working so I would rather have text with the tags than (almost) nothing. get links/lynx fixed oh, so you install a script that relies on a functionality that's never been implemented in either of the three packages it uses, and it is not a bug in the script but rather in the packages it misuses? |