Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 349255 - Incomplete OpenSource MS Word spec (particularly broken codepage set)
Summary: Incomplete OpenSource MS Word spec (particularly broken codepage set)
Status: RESOLVED UPSTREAM
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: All Linux
: High normal (vote)
Assignee: Gentoo Linux bug wranglers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2010-12-21 08:30 UTC by Sergey S. Starikoff
Modified: 2010-12-21 16:07 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Sergey S. Starikoff 2010-12-21 08:30:14 UTC
Excuse me for posting this bug here. I'm not shure about proper way of it's upstream reporting.

The issue is that in most (or even almost all) of some type ms word files
(created I think, by opening in ms word saved html page and export it into doc
format) some parts of text are displayed in incorrect codepage.

Example file (one of many): http://flibusta.net/b/145462
(attention! registration required or I should attach it to this bug?)

There are many problem parts.
For example on page 8:
Сегодня, кажется, все согласны с тем, что
этика есть наука о морали (нравственности).
Но вопрос о том, что представляет собой
нравственность, остается невыясненным;
речь идет не о теоретическом определении
(хотя и здесь много спорных проблем), а об
установлении эмпирических границ,
фиксации качественного своеобразия
явления. Парадокс, обнаружившийся в
творчестве Оссовской, вообще свойствен
развитию этики, и он состоит в том, что
этика смело рассуждает о сущности морали,
но не умеет вычленить ее как эмпирическое
явление. Нравственность образует такую
область действительности — область
межчеловеческих отношений, — которую
нельзя идентифицировать без апелляции к
терминам морального сознания. Однако
моральное сознание не является надежным
путеводителем в мире ценностей, ибо оно не
только выражает, но весьма часто, а в
определенных социальных условиях как
правило — искажает действительный
ценностный смысл поступков, отношений; без
предварительной критики оно не может стать
эмпирическим основанием науки. Увы,
çåðêàëî ìîðàëüíîãî ñîçíàíèÿ — êðèâîå
çåðêàëî. È ýòî íå åäèíñòâåííàÿ òðóäíîñòü,
ïðåïÿòñòâóþùàÿ òîìó, ÷òîáû íðàâñòâåííîñòü
èç îáûäåííîãî ôàêòà ñòàëà ôàêòîì íàóêè.

Microsoft Word display it correctly.
Discussion (in russian): http://flibusta.net/node/92084 (with screenshot of the
problem part of the problem file in Microsoft Word 2010).

Currently checked in app-office/abiword-2.8.6 upstream reported, bug http://bugzilla.abisource.com/show_bug.cgi?id=12917
The same issue I've seen in app-office/openoffice-3.2.0.
Yet not checked in app-office/openoffice-bin
Could anybody check about this issue the KOffice word processor?

Reproducible: Always

Steps to Reproduce:
Comment 1 Jeroen Roovers (RETIRED) gentoo-dev 2010-12-21 16:07:46 UTC
Please do not assume anyone speaks Russian, or for that matter is inclined to register at a website not under Gentoo's control, when reporting bugs.

Attach anything as a file that you would have found elsewhere, do not link to external discussions when they are not upstream discussions pertaining to packages that Gentoo distributes.

Also, you should paste your `emerge --info' output in a comment, and the Summary should preferably contain a category/package so we can figure out whom to assign this bug report to.

As it looks like this is a problem with documents exported by (some novel, probably incompletely documented, proprietary) MS Word, and subsequently opened by alternative word processor programs, there doesn't appear to be much that we can do about it right now.

A first step in the right direction would be to address the issue upstream, at Microsoft preferably, or at the support sites of the programs you are trying to open MS Word documents with.