Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 132256 - myspell-hu dictionary files version update & url correction
Summary: myspell-hu dictionary files version update & url correction
Status: VERIFIED TEST-REQUEST
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: New packages (show other bugs)
Hardware: All Linux
: High enhancement (vote)
Assignee: Kevin F. Quinn (RETIRED)
URL: http://magyarispell.sourceforge.net/
Whiteboard:
Keywords: InVCS
Depends on:
Blocks:
 
Reported: 2006-05-04 07:02 UTC by András
Modified: 2006-06-17 04:52 UTC (History)
1 user (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description András 2006-05-04 07:02:29 UTC
The newest versions of the hungarian affix and spelling dictionary files are available on the official home page maintained by hunspell author L
Comment 1 András 2006-05-04 07:02:29 UTC
The newest versions of the hungarian affix and spelling dictionary files are available on the official home page maintained by hunspell author László Németh:
http://magyarispell.sourceforge.net/
This is the latest version (1.0):
http://magyarispell.sourceforge.net/hu_HU-1.0.tar.gz

And the most up to date version of the hungarian hyphenation dictionary (by Bence Nagy) can be found here:
http://www.tipogral.hu/huhyphn/
This is the latest version (20050329):
http://www.tipogral.hu/download/huhyphn-20050329.tar.gz

They're a lot newer than those can be found on the openoffice site, which are rather old now (>2 years), so the source should be changed in favour of the official ones.
I've been using them since their release without any problem (with hunspell and in openoffice as well).

(The theasaurus dictionary is currently unmaintained or at least I don't know of any newer versions...)

thx
Comment 2 Kevin F. Quinn (RETIRED) gentoo-dev 2006-05-04 08:22:08 UTC
Thanks for the info :)

About the thesaurus - is the thesaurus from the openoffice.org site still valid?  Would it make sense for me to install the spelling and hyphenation dictionaries from the sites you list, along with the old thesaurus?
Comment 3 András 2006-05-04 08:49:13 UTC
Thanks for the quick reply.
The included README_th_hu_HU.txt file says that it is only in alpha state which means that it contains very few words but otherwise valid and usable.
It has nothing to do with spellchecking, so it wouldn't hurt if it is installed along with the others, I think, maybe someone will find it useful.

There were plans to include the official hungarian thesaurus dictionary but maybe they had no time to complete and/or some legal issues came up...
Comment 4 Kevin F. Quinn (RETIRED) gentoo-dev 2006-05-04 16:15:27 UTC
There seem to be several dictionaries in the files from magyaispell.sourceforge.net:

hu_HU.{aff,dic}
hu_HU_morph.{aff,dic}
hu_HU_u8.{aff,dic}

Are these three separate dictionaries?  What happens when you install them - what does the text in /usr/lib/openoffice/share/dict/ooo/dictionary.lst look like?

Can they all be installed for OpenOffice simultaneously or are they alternatives, i.e. can OpenOffice only refer to one of them?

I notice the hu_HU.{aff,dic} files are substantially smaller than the ones on the OpenOffice site.

Sorry to ask so many questions, but as I don't know any Hungarian I can't read what's on the site :}
Comment 5 András 2006-05-06 09:54:12 UTC
> hu_HU_morph.{aff,dic}
These resources are only for the morphological analyzer and thus should not be included in the dictionary.lst file.

> hu_HU.{aff,dic}
> hu_HU_u8.{aff,dic}
These are the resources for the spell checker in latin2 (ISO-8859-2) and in utf-8 format, respectively. They're almost the same, the only little difference is that now there are a couple of non-hungarian words included in the utf-8 dictionary which don't fit into latin2 (e.g 
Comment 6 András 2006-05-06 09:54:12 UTC
> hu_HU_morph.{aff,dic}
These resources are only for the morphological analyzer and thus should not be included in the dictionary.lst file.

> hu_HU.{aff,dic}
> hu_HU_u8.{aff,dic}
These are the resources for the spell checker in latin2 (ISO-8859-2) and in utf-8 format, respectively. They're almost the same, the only little difference is that now there are a couple of non-hungarian words included in the utf-8 dictionary which don't fit into latin2 (e.g Ångström).

I asked the author about this, but he hasn't answered yet. But I also checked in the Openoffice version released by the hungarian FSF and they installed the utf-8 files. (I use this version from my portage overlay.)
http://hu.openoffice.org/about-downloads.html (in hungarian)
http://www.fsf.hu/en/about_us.en.html

So I think all of them should be installed (or controlled by USE flags) but only the utf-8 version should be listed in the openoffice config file.

I am glad I could help to improve the hungarian language support in Gentoo.
Comment 7 András 2006-05-07 04:50:50 UTC
A little update:
I received the answer from the author and he also suggests using the utf-8 version for Openoffice2.
I was told that he's currently working on transforming some aspell dictionaries (of other languages) into hunspell format and after that he is going to ask the official Openoffice homepage maintainers to update the list with the new versions. So we can expect more to come... :)
Comment 8 Kevin F. Quinn (RETIRED) gentoo-dev 2006-05-09 11:47:51 UTC
> I am glad I could help to improve the hungarian language support in Gentoo.

Much appreciated, and thanks for taking the time to investigate for me.  We want to provide whatever is best for the users of the language; obviously I don't know what that is until a native speaker tells me :)

> So we can expect more to come... :)

That'll keep me busy :)

The new ebuild is in CVS, so should come through shortly.
Let me know here if it works for you (or if there are problems!).
Comment 9 András 2006-06-14 10:20:59 UTC
Everything seems to be ok here.

The only problem I had is hopefully going to be solved by the eselect-oodict prog. I wanted to alert you before, but noticed the new ebuild little later and chilled down a bit.

The stable version of Huhyphn has been updated two weeks ago:
http://www.tipogral.hu/download/huhyphn-20060531.tar.gz

Plus one minor error to correct - the url of the hyphenation dictionary is the following:
http://www.tipogral.hu/index.rbx/site/projects/huhyphn

Btw. the next version will be released this summer. I'll file a bug for it then. (Huhyphn3 is currently beta and can be used for testing only.)
Comment 10 Kevin F. Quinn (RETIRED) gentoo-dev 2006-06-17 04:52:50 UTC
(In reply to comment #7)
> The stable version of Huhyphn has been updated two weeks ago:
> http://www.tipogral.hu/download/huhyphn-20060531.tar.gz

I've bumped the package for this; thanks.  Any problems with it, raise a new bug.

> Plus one minor error to correct - the url of the hyphenation dictionary is the
> following:
> http://www.tipogral.hu/index.rbx/site/projects/huhyphn

Fixed, thanks.

> Btw. the next version will be released this summer. I'll file a bug for it
> then. (Huhyphn3 is currently beta and can be used for testing only.)

Please do, thanks.  I'm relying on users to notify me of new versions of the myspell dictionaries, rather than monitoring all 42 of them ;)