From Fedora Project Wiki
No edit summary
m (internal link cleaning)
 
(40 intermediate revisions by one other user not shown)
Line 7: Line 7:
| '''Language Code'''  || '''Language'''    || '''hunspell''' || '''hyphen''' || '''mythes''' || '''notes'''
| '''Language Code'''  || '''Language'''    || '''hunspell''' || '''hyphen''' || '''mythes''' || '''notes'''
|-
|-
| aa || Afar                ||              ||          ||          || [http://www.afarfriends.org/lang_eng.htm afarfriends.org] hosted [http://www.afarfriends.org/Dok%20t%20websida/Afar_lang/Afaraf%20&%20its%20dictionary%20preparation.pdf ALSEC report].
| aa || Afar                ||              ||          ||          || [http://www.afarfriends.org/lang_eng.htm afarfriends.org] hosted [http://www.afarfriends.org/Dok%20t%20websida/Afar_lang/Afaraf%20&%20its%20dictionary%20preparation.pdf ALSEC report].
|-
|-
| af || Afrikaans          || hunspell-af  || hyphen-af ||          ||
| af || Afrikaans          || hunspell-af  || hyphen-af ||          ||
|-f
|-f
| am || Amharic            || hunspell-am  || || ||  
| am || Amharic            || hunspell-am  || || ||  
|-
|-
| an || Aragonese          ||              ||          ||          || [http://www.iea.es/ www.iea.es], see [http://www.cs.brandeis.edu/~roser/pubs/ell2_05.pdf Spain: Lexicography In Iberian Languages]
| an || Aragonese          ||              ||          ||          || [http://www.iea.es/ www.iea.es], see [http://www.cs.brandeis.edu/~roser/pubs/ell2_05.pdf Spain: Lexicography In Iberian Languages]
 
|-
|-
| ar || Arabic              || hunspell-ar  ||          || [http://ayaspell.sourceforge.net/ experimental thesaurus]        ||
| ar || Arabic              || hunspell-ar  ||          || [http://ayaspell.sourceforge.net/ experimental thesaurus]        ||
|-
|-
| as || Assamese            || hunspell-as  || hyphen-as ||          || [http://www.xobdo.net/ xobdo] is another potential source, possibly even for a thesaurus, but this isn't an option apparently at the moment.
| as || Assamese            || hunspell-as  || hyphen-as ||          || [http://www.xobdo.net/ xobdo] is another potential source, possibly even for a thesaurus, but this isn't an option apparently at the moment.
|-
|-
| ast || Asturian          || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building            ||          ||          || [http://www.academiadelallingua.com/diccionariu/index.php www.academiadelallingua.com], see [http://www.cs.brandeis.edu/~roser/pubs/ell2_05.pdf Spain: Lexicography In Iberian Languages]. Asturian [http://wiki.softastur.org/Portada translation] team
| ast || Asturian          || hunspell-ast ||          ||          || [http://blogs.altuxa.com/softastur/el-diccionariu-de-softastur-pasa-a-beta-publica.html dictionary announcement]
|-
|-
| az || Azeri (Latin)      || hunspell-az  ||          ||          ||
| az || Azeri (Latin)      || hunspell-az  ||          ||          ||
|-
|-
| be || Belarusian          || hunspell-be  || hyphen-be ||          ||
| be || Belarusian          || hunspell-be  || hyphen-be ||          ||
|-
|-
| ber || Amazigh (Tifinagh) || hunspell-ber ||          ||          ||
| ber || Amazigh (Tifinagh) || hunspell-ber ||          ||          ||
Line 30: Line 29:
| ber || Amazigh (Latin)    ||              ||          ||          ||
| ber || Amazigh (Latin)    ||              ||          ||          ||
|-
|-
| bg || Bulgarian          || hunspell-bg  || hyphen-bg || mythes-bg ||
| bg || Bulgarian          || hunspell-bg  || hyphen-bg || mythes-bg ||
|-
|-
| bn || Bengali            || hunspell-bn  || hyphen-bn ||          ||
| bn || Bengali            || hunspell-bn  || hyphen-bn ||          ||
|-
|-
| bo || Tibetan            ||              ||          ||          || [http://bo.openoffice.org/ bo.openoffice.org]. Latest language support [http://marketing.openoffice.org/ooocon2007/programme/friday_90.pdf update].
| bo || Tibetan            ||              ||          ||          || [http://bo.openoffice.org/ bo.openoffice.org]. Latest language support [http://marketing.openoffice.org/ooocon2007/programme/friday_90.pdf update].
|-
|-
| br || Breton              || hunspell-br  ||          ||          ||
| br || Breton              || hunspell-br  ||          ||          ||
|-
|-
| bs || Bosnian            || hunspell-bs  || hyphen-bs ||          ||
| bs || Bosnian            || hunspell-bs  || hyphen-bs ||          ||
|-
|-
| byn || Blin              ||              ||          ||          || [http://www.lingref.com/cpp/acal/36/paper1411.pdf Blin Orthography: A History and an Assessment]
| byn || Blin              ||              ||          ||          || [http://www.lingref.com/cpp/acal/36/paper1411.pdf Blin Orthography: A History and an Assessment]
|-
|-
| ca || Catalan            || hunspell-ca  || hyphen-ca || mythes-ca ||
| ca || Catalan            || hunspell-ca  || hyphen-ca || mythes-ca ||
|-
|-
| crh || Crimean Tatar      || A [http://korpus.juls.savba.sk/QIRIM/#about-the-corpus corpus] || || || [http://translationproject.org/team/crh.html translation team]
| crh || Crimean Tatar      || A [http://korpus.juls.savba.sk/QIRIM/#about-the-corpus corpus] || || || [http://translationproject.org/team/crh.html translation team]
|-
|-
| cs || Czech              || hunspell-cs  || hyphen-cs || mythes-cs ||
| cs || Czech              || hunspell-cs  || hyphen-cs || mythes-cs ||
|-
|-
| csb || Kashubian          || hunspell-csb ||          ||          ||
| csb || Kashubian          || hunspell-csb ||          ||          ||
|-
|-
| cy || Welsh              || hunspell-cy  || hyphen-cy ||          ||
| cv  || Chuvash            || hunspell-cv  || || ||
|-
| cy || Welsh              || hunspell-cy  || hyphen-cy ||          ||
|-
|-
| da || Danish              || hunspell-da  || hyphen-da || mythes-da ||
| da || Danish              || hunspell-da  || hyphen-da || mythes-da ||
|-
|-
| de || German              || hunspell-de  || hyphen-de || mythes-de ||
| de || German              || hunspell-de  || hyphen-de || mythes-de ||
|-
|-
| dv  || Dhivehi            ||              ||          ||          ||  
| dv  || Dhivehi            ||              ||          ||          ||  
Line 60: Line 61:
[http://www.maldivesroyalfamily.com/pdf/maldives_dictionary_1.0.pdf English-Dhivehi dictionary]
[http://www.maldivesroyalfamily.com/pdf/maldives_dictionary_1.0.pdf English-Dhivehi dictionary]
|-
|-
| dz || Dzongkha            || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || || Some [http://www.nabble.com/syllable-and-word.....-td23997292.html requests] for help/info.
| dz || Dzongkha            || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || || Some [http://www.nabble.com/syllable-and-word.....-td23997292.html requests] for help/info.
|-
|-
| el || Greek              || hunspell-el  || hyphen-el || mythes-el ||
| el || Greek              || hunspell-el  || hyphen-el || mythes-el ||
|-
|-
| en || English            || hunspell-en  || hyphen-en || mythes-en ||
| en || English            || hunspell-en  || hyphen-en || mythes-en ||
|-
|-
| es || Spanish            || hunspell-es  || hyphen-es || mythes-es ||
| es || Spanish            || hunspell-es  || hyphen-es || mythes-es ||
|-
|-
| et || Estonian            || hunspell-ee  || hyphen-et ||          ||
| et || Estonian            || hunspell-ee  || hyphen-et ||          ||
|-
|-
| eu || Basque              || hunspell-eu  || hyphen-eu ||          ||
| eu || Basque              || hunspell-eu  || hyphen-eu ||          ||
|-
|-
| fa || Farsi              || hunspell-fa  || hyphen-fa ||          ||
| fa || Farsi              || hunspell-fa  || hyphen-fa ||          ||
|-
|-
| fi || Finnish            || Finnish Community has a parallel [http://voikko.sourceforge.net/ Voikko] solution. With an enchant backend, an OpenOffice.org extension, and a Firefox extension. || || ||
| fi || Finnish            || Finnish Community has a parallel [http://voikko.sourceforge.net/ Voikko] solution. With an enchant backend, an OpenOffice.org extension, and a Firefox extension. || || ||
|-
|-
| fil || Filipino          || hunspell-tl  ||          ||          || Filipino is effectively an official Tagalog-based language
| fil || Filipino          || hunspell-tl  ||          ||          || Filipino is effectively an official Tagalog-based language
|-
|-
| fo || Faeroese            || hunspell-fo  || hyphen-fo ||          ||
| fo || Faeroese            || hunspell-fo  || hyphen-fo ||          ||
|-
|-
| fr || French              || hunspell-fr  || hyphen-fr || mythes-fr ||  
| fr || French              || hunspell-fr  || hyphen-fr || mythes-fr ||  
|-
|-
| fur || Friulian          || hunspell-fur ||          ||          ||
| fur || Friulian          || hunspell-fur ||          ||          ||
|-
|-
| fy || Frisian            || hunspell-fy  ||          ||          ||
| fy || Frisian            || hunspell-fy  ||          ||          ||
|-
|-
| ga || Irish              || hunspell-ga  || hyphen-ga || mythes-ga ||
| ga || Irish              || hunspell-ga  || hyphen-ga || mythes-ga ||
|-
|-
| gd || Scots Gaelic        || hunspell-gd  ||          ||          ||
| gd || Scots Gaelic        || hunspell-gd  ||          ||          ||
|-
|-
| gez || Ge'ez              ||              ||          ||          || [http://addistribune.ethiopiaonline.net/ Ge'ez Frontier Foundation]
| gez || Ge'ez              ||              ||          ||          || [http://addistribune.ethiopiaonline.net/ Ge'ez Frontier Foundation]
|-
|-
| gl || Galician            || hunspell-gl  || hyphen-gl ||          ||
| gl || Galician            || hunspell-gl  || hyphen-gl ||          ||
|-
|-
| gu || Gujarati            || hunspell-gu  || hyphen-gu ||          ||
| gu || Gujarati            || hunspell-gu  || hyphen-gu ||          ||
|-
|-
| gv || Manx                || hunspell-gv  ||          ||          ||
| gv || Manx                || hunspell-gv  ||          ||          ||
|-
|-
| ha || Hausa              || [http://www.mail-archive.com/dev@lingucomponent.openoffice.org/msg01207.html crubadan possible wordlist] || || || [http://www.dictionary.kasahorow.com/en/all/ha www.dictionary.kasahorow.com]
| ha || Hausa              || [https://addons.mozilla.org/en-US/firefox/addon/85790 available] but no License mentioned. In private communication " We will specify licenses for the next release of the spell checkers. In the meantime, assume both Hausa and Eʋegbe have the GNU GPLv3 license as well." || || ||
|-
|-
| he || Hebrew              || hunspell-he  ||          ||          || [http://www-01.ibm.com/software/globalization/topics/bidi/hebrew.jsp info on hyphenation]
| he || Hebrew              || hunspell-he  ||          ||          || [http://www-01.ibm.com/software/globalization/topics/bidi/hebrew.jsp info on hyphenation]
|-
|-
| hi || Hindi              || hunspell-hi  || hyphen-hi || [http://www.cfilt.iitb.ac.in/wordnet/webhwn Hindi Wordnet] is likely convertible, claims to have similar format as English Wordnet, which is the basis of mythes-en        ||
| hi || Hindi              || hunspell-hi  || hyphen-hi || [http://www.cfilt.iitb.ac.in/wordnet/webhwn Hindi Wordnet] is likely convertible, claims to have similar format as English Wordnet, which is the basis of mythes-en        ||
|-
|-
| hne || Chhattisgarhi      || [http://borel.slu.edu/crubadan/apps.html corpus building] ||          ||          ||
| hne || Chhattisgarhi      || [http://borel.slu.edu/crubadan/apps.html corpus building] ||          ||          ||
|-
|-
| hr || Croatian            || hunspell-hr  || hyphen-hr ||          || This hasn't been updated in a number of years, on a purely orthographical basis I wonder if [http://extensions.services.openoffice.org/project/dict-sr dict-sr] would provide a better option
| hr || Croatian            || hunspell-hr  || hyphen-hr ||          || This hasn't been updated in a number of years, on a purely orthographical basis I wonder if [http://extensions.services.openoffice.org/project/dict-sr dict-sr] would provide a better option
|-
|-
| hsb || Upper Sorbian      || hunspell-hsb || hyphen-hsb ||          ||
| hsb || Upper Sorbian      || hunspell-hsb || hyphen-hsb ||          ||
|-
|-
| ht || Haitian Creole      || hunspell-ht  ||          ||          ||
| ht || Haitian Creole      || hunspell-ht  ||          ||          ||
|-
| hu  || Hungarian          || hunspell-hu  || hyphen-hu || mythes-hu ||
|-
|-
| hu || Hungarian          || hunspell-hu || hyphen-hu || mythes-hu ||
| hy  || Armenian            || hunspell-hy ||           ||           ||
|-
|-
| hy || Armenian            || hunspell-hy ||           ||          ||
| id  || Indonesian          || hunspell-id || hyphen-id ||          ||
|-
|-
| id || Indonesian          || hunspell-id  || hyphen-id ||           ||
| ig  || Igbo                || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || || [http://www.dictionary.kasahorow.com/en/all/ig www.dictionary.kasahorow.com]
|-
|-
| ig || Igbo                || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || || [http://www.dictionary.kasahorow.com/en/all/ig www.dictionary.kasahorow.com]
| ik  || Inupiaq            || Broken [http://www.alaskool.org/language/inupiaqpb/In_spellchecker.html download link] to MSWord dictionary || || || [http://giellatekno.uit.no/ipk.html Iñupiaq parser project],  [http://siuc01.si.ehu.es/~jipsagak/SALTMIL2010_Proceedings.pdf Finite-State Morphology for Iñupiaq]
|-
|-
| ik || Inupiaq            || Broken [http://www.alaskool.org/language/inupiaqpb/In_spellchecker.html download link] to MSWord dictionary || || || [http://giellatekno.uit.no/ipk.html Iñupiaq parser project]
| is  || Icelandic          || hunspell-is  || hyphen-is ||           ||
|-
|-
| is || Icelandic          || hunspell-is || hyphen-is ||           ||
| it  || Italian            || hunspell-it || hyphen-it || mythes-it ||
|-
|-
| it || Italian            || hunspell-it  || hyphen-it || mythes-it ||
| iu  || Inuktitut          ||             ||           ||           || [http://www.livingdictionary.com/backgroundandhistory.jsp www.livingdictionary.com]
|-
|-
| iu || Inuktitut          ||              ||          ||          || [http://www.livingdictionary.com/backgroundandhistory.jsp www.livingdictionary.com]
| ja  || Japanese            ||              ||          ||          ||
|-
|-
| ja || Japanese           ||             ||           ||           ||
| ka  || Georgian           || [http://borel.slu.edu/crubadan/stadas.html Crubadan] is aware of 29023 words || || || [http://ka.openoffice.org/ ka.openoffice.org] Some [http://www.illc.uva.nl/Borjomi/Proceedings/margvelani.doc info] on spellchecking the language.
|-
|-
| ka || Georgian            ||  [http://borel.slu.edu/crubadan/stadas.html Crubadan] is aware of 29023 words || || || [http://ka.openoffice.org/ ka.openoffice.org] Some [http://www.illc.uva.nl/Borjomi/Proceedings/margvelani.doc info] on spellchecking the language.
| kk  || Kazakh              || hunspell-kk || || ||
|-
|-
| kk || Kazakh              || hunspell-kk  || || ||
| kl  || Kalaallisut        ||             ||           ||           || [http://giellatekno.uit.no/kal.html Greenlandic parser project]. [http://www.oqaasileriffik.gl/content/us/spell_checker_for_greenlandic/get_it_here MSWord] checker.
|-
|-
| kl || Kalaallisut        ||             ||          ||          || [http://giellatekno.uit.no/kal.html Greenlandic parser project]. [http://www.oqaasileriffik.gl/content/us/spell_checker_for_greenlandic/get_it_here MSWord] checker.
| km  || Khmer              || hunspell-km  ||          ||          ||
|-
|-
| km || Khmer              || hunspell-km ||           ||          ||
| kn  || Kannada            || hunspell-kn || hyphen-kn ||          ||
|-
|-
| kn || Kannada            || hunspell-kn || hyphen-kn ||          ||
| ko  || Korean              || hunspell-ko ||           ||          ||
|-
|-
| ko || Korean              || hunspell-ko  ||           ||          ||
| kok || Konkani            ||             ||           ||          || [http://www.savemylanguage.org/ online dictionary
|-
|-
| ks || Kashmiri          ||              ||            ||          || [http://dsal.uchicago.edu/dictionaries/grierson/ online dictionary]
| ks || Kashmiri          ||              ||            ||          || [http://dsal.uchicago.edu/dictionaries/grierson/ online dictionary]
Line 148: Line 151:
| ku || Kurdish (Arabic)    ||              ||          ||          || [http://www.mail-archive.com/dev@native-lang.openoffice.org/msg02819.html some info]
| ku || Kurdish (Arabic)    ||              ||          ||          || [http://www.mail-archive.com/dev@native-lang.openoffice.org/msg02819.html some info]
|-
|-
| kw || Cornish            || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || ||  
| kw || Cornish            || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || || [[Cornish|Fedora Cornish Language Translation Project]]
|-
|-
| ky || Kirgyz              || hunspell-ky ||          ||          || [http://www.mail-archive.com/dev@l10n.openoffice.org/msg04360.html OOo localization beginnings]. Orthography [http://enews.ferghana.ru/article.php?id=168 news]
| ky || Kirgyz              || hunspell-ky ||          ||          || [http://www.mail-archive.com/dev@l10n.openoffice.org/msg04360.html OOo localization beginnings]. Orthography [http://enews.ferghana.ru/article.php?id=168 news]
Line 162: Line 165:
| lv || Latvian            || hunspell-lv  || hyphen-lv || mythes-lv ||
| lv || Latvian            || hunspell-lv  || hyphen-lv || mythes-lv ||
|-
|-
| mai || Maithili          ||             ||          ||          || [http://maithiliacademy.org/ maithiliacademy.org]
| mai || Maithili          || hunspell-mai ||          ||          || [http://maithiliacademy.org/ maithiliacademy.org]
|-
|-
| mg || Malagasy            || hunspell-mg  ||          ||          ||
| mg || Malagasy            || hunspell-mg  ||          ||          || mg is equivalent to mlg which is a macrolanguage, see plt for "Standard Malagasy
|-
|-
| mi || Maori              || hunspell-mi  || hyphen-mi || mythes-mi ||
| mi || Maori              || hunspell-mi  || hyphen-mi || mythes-mi ||
Line 179: Line 182:
|-
|-
| mt || Maltese            || hunspell-mt  ||          ||          ||
| mt || Maltese            || hunspell-mt  ||          ||          ||
|-
| my || Burmese            ||              ||          ||          || [http://www.burmese-dictionary.org/ online dictionary]
|-
|-
| nan || Min Nan            ||              ||          ||          || [http://203.64.42.21/iug/ungian/SoannTeng/chil/taihoa.asp online dictionary?]
| nan || Min Nan            ||              ||          ||          || [http://203.64.42.21/iug/ungian/SoannTeng/chil/taihoa.asp online dictionary?]
Line 187: Line 192:
| nds || Lowlands Saxon    || hunspell-nds ||          ||          ||
| nds || Lowlands Saxon    || hunspell-nds ||          ||          ||
|-
|-
| ne || Nepali              || hunspell-ne  ||          || [http://svn.services.openoffice.org/ooo/trunk/dictionaries/ne_NP/th_ne_NP_v2.zip available] ||
| ne || Nepali              || hunspell-ne  ||          || mythes-ne ||
|-
|-
| nl || Dutch              || hunspell-nl  || hyphen-nl || mythes-nl ||
| nl || Dutch              || hunspell-nl  || hyphen-nl || mythes-nl ||
Line 208: Line 213:
|-
|-
| pl || Polish              || hunspell-pl  || hyphen-pl || mythes-pl ||
| pl || Polish              || hunspell-pl  || hyphen-pl || mythes-pl ||
|-
| ps || Pashto              ||              ||          ||          || [http://www.mail-archive.com/aspell-user@gnu.org/msg01945.html possible contact]
|-
|-
| pt || Portuguese          || hunspell-pt  || hyphen-pt || mythes-pt ||
| pt || Portuguese          || hunspell-pt  || hyphen-pt || mythes-pt ||
Line 219: Line 226:
| sa || Sanskrit            || An apparent [http://www.nabble.com/Sanskrit-spellchecker-td14487815.html effort] to create a Sanskrit hunspell dictionary || hyphen-sa || ||
| sa || Sanskrit            || An apparent [http://www.nabble.com/Sanskrit-spellchecker-td14487815.html effort] to create a Sanskrit hunspell dictionary || hyphen-sa || ||
|-
|-
| sc || Sardinian          || hunspell-sc  ||          ||          ||
| sc || Sardinian          || hunspell-sc  ||          ||          || [http://qa.openoffice.org/issues/show_bug.cgi?id=107288 intended dictionaries] [https://launchpad.net/ditzionariusardu launchpad page
|-
|-
| sd  || Sindhi            ||              ||            ||          || [http://dsal.uchicago.edu/dictionaries/mewaram/ online dictionary]
| sd  || Sindhi            || [http://extensions.services.openoffice.org/de/project/sindhispellchecker available] ||            ||          ||
|-
|-
| se || Sammi, Northern      || hunspell-se || [http://www.divvun.no/doc/proof/hyph/OOo/index.html watch this space] || ||
| se || Sammi, Northern      || hunspell-se || [http://www.divvun.no/doc/proof/hyph/OOo/index.html watch this space] || ||
Line 259: Line 266:
| tig || Tigre              || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || ||
| tig || Tigre              || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || ||
|-
|-
| tk || Turkmen            || hunspell-tk  ||           ||          ||
| tk || Turkmen            || hunspell-tk  || hyphen-tk ||          ||
|-
|-
| tl || Tagalog            || hunspell-tl  ||          ||          ||
| tl || Tagalog            || hunspell-tl  ||          ||          ||
Line 271: Line 278:
| tt || Tatar              || [http://sisyphus.ru/srpm/Branch4/hunspell-tt/spec available] but difficult to see where this came from originally, and what license it is exactly, GPLv2+ (?). Perhaps it is an original work of ALT Linux and that actually is the canonical upstream ? || [http://www.banners.tver.ru/4.0/branch/noarch/SRPMS.classic/hyphen-tt-20080619-alt1.src.rpm available] but difficult to see where this came from originally, and what license it is exactly, GPLv2+ (?). Perhaps it is an original work of ALT Linux and that actually is the canonical upstream ? || ||
| tt || Tatar              || [http://sisyphus.ru/srpm/Branch4/hunspell-tt/spec available] but difficult to see where this came from originally, and what license it is exactly, GPLv2+ (?). Perhaps it is an original work of ALT Linux and that actually is the canonical upstream ? || [http://www.banners.tver.ru/4.0/branch/noarch/SRPMS.classic/hyphen-tt-20080619-alt1.src.rpm available] but difficult to see where this came from originally, and what license it is exactly, GPLv2+ (?). Perhaps it is an original work of ALT Linux and that actually is the canonical upstream ? || ||
|-
|-
| ug || Uyghur              ||              ||          ||          || [http://www.uyghurdictionary.org/ www.uyghurdictionary.org]
| ug || Uyghur              ||              ||          ||          || [http://www.uyghurdictionary.org/ www.uyghurdictionary.org] [http://www.uighur.jp/resource/dic/index-japanese/main.htm www.uighur.jp]
|-
|-
| uk || Ukrainian          || hunspell-uk  || hyphen-uk || mythes-uk ||
| uk || Ukrainian          || hunspell-uk  || hyphen-uk || mythes-uk ||
Line 289: Line 296:
| xh || Xhosa              || hunspell-xh  ||          ||          ||
| xh || Xhosa              || hunspell-xh  ||          ||          ||
|-
|-
| yi || Yiddish            || [http://qa.openoffice.org/issues/show_bug.cgi?id=97791 available] spell-checker || || ||
| yi || Yiddish            || hunspell-yi  || || ||
|-
|-
| yo || Yoruba              || An apparent [http://www.mail-archive.com/dev@lingucomponent.openoffice.org/msg01820.html effort] to create a Yoruba hunspell dictionary || || || [http://www.dictionary.kasahorow.com/en/all/yo www.dictionary.kasahorow.com]
| yo || Yoruba              || Some apparent [http://qa.openoffice.org/issues/show_bug.cgi?id=106822 efforts] [http://www.mail-archive.com/dev@lingucomponent.openoffice.org/msg01820.html older info] to create a Yoruba hunspell dictionary || || || [http://www.dictionary.kasahorow.com/en/all/yo www.dictionary.kasahorow.com]
|-
|-
| zh || Chinese            ||              || Would these (convertable) [http://tug.org/svn/texhyphen/trunk/hyph-utf8/tex/generic/hyph-utf8/patterns/hyph-zh-latn.tex TeX] rules be universally meaningful for Chinese text ||          ||
| zh || Chinese            ||              || Would these (convertable) [http://tug.org/svn/texhyphen/trunk/hyph-utf8/tex/generic/hyph-utf8/patterns/hyph-zh-latn.tex TeX] rules be universally meaningful for Chinese text ||          ||
Line 310: Line 317:
|-
|-
| bm  || Bambara            ||              ||            ||          || [http://www.bambara.org/en/index.htm Online Dictionary]
| bm  || Bambara            ||              ||            ||          || [http://www.bambara.org/en/index.htm Online Dictionary]
|-
| buc || Bushi              ||              ||            ||          ||
|-
|-
| brx || Bodo              ||              ||            ||          || [http://www.xobdo.net/ xobdo] is a potential source, but this isn't an option apparently at the moment. Another [http://bagurumba.elementfx.com/bagurumba/dictionary.php Online Dictionary]
| brx || Bodo              ||              ||            ||          || [http://www.xobdo.net/ xobdo] is a potential source, but this isn't an option apparently at the moment. Another [http://bagurumba.elementfx.com/bagurumba/dictionary.php Online Dictionary]
|-
|-
| cop || Coptic            || hunspell-cop || [http://tug.org/svn/texhyphen/trunk/hyph-utf8/tex/generic/hyph-utf8/patterns/ experimental] convertible TeX rules ||          ||
| cop || Coptic            || hunspell-cop || [http://tug.org/svn/texhyphen/trunk/hyph-utf8/tex/generic/hyph-utf8/patterns/ experimental] convertible TeX rules ||          ||
|-
| cv  || Chuvash            || hunspell-cv  || || ||
|-
|-
| dgo || Dogri              ||              ||            ||          || [http://www.ciil.org/ Central Institute for Indian Languages]
| dgo || Dogri              ||              ||            ||          || [http://www.ciil.org/ Central Institute for Indian Languages]
Line 321: Line 328:
| dsb || Lower Sorbian      || hunspell-dsb || || ||  
| dsb || Lower Sorbian      || hunspell-dsb || || ||  
|-
|-
| ee  || Ewe                ||             ||            ||          || [http://www.eweland.com/ online] dictionary
| ee  || Ewe                || [https://addons.mozilla.org/en-US/firefox/addon/14027 available] but no License mentioned. In private communication " We will specify licenses for the next release of the spell checkers. In the meantime, assume both Hausa and Eʋegbe have the GNU GPLv3 license as well." ||            ||          || [http://www.eweland.com/ online] dictionary
|-
|-
| eo  || Esperanto          || hunspell-eo  || [http://tug.org/svn/texhyphen/trunk/hyph-utf8/tex/generic/hyph-utf8/patterns/hyph-eo.tex?view=markup needs more love] to be convertible ||          ||
| eo  || Esperanto          || hunspell-eo  || [http://tug.org/svn/texhyphen/trunk/hyph-utf8/tex/generic/hyph-utf8/patterns/hyph-eo.tex?view=markup needs more love] to be convertible ||          ||
Line 332: Line 339:
|-
|-
| gug || Guarani            || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || ||
| gug || Guarani            || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building || || ||
|-
| haw || Hawaiian          || hunspell-haw ||            ||          ||
|-
|-
| hil || Hiligaynon        || hunspell-hil ||            ||          ||
| hil || Hiligaynon        || hunspell-hil ||            ||          ||
Line 337: Line 346:
| ia  || Interlingua        || hunspell-ia  || hyphen-ia  ||          ||
| ia  || Interlingua        || hunspell-ia  || hyphen-ia  ||          ||
|-
|-
| kok || Konkani           ||              ||            ||          || [http://www.savemylanguage.org/ online dictionary
| ki  || Gikuyu            || [http://extensions.services.openoffice.org/en/project/Gikuyu available]            ||            ||          ||
|-
| ksf || Bafia              ||              ||            ||          || [http://www.mail-archive.com/dev@l10n.openoffice.org/msg07116.html work in progress] [http://extensions.services.openoffice.org/en/project/dictionary-ksf-CM empty dictionary page]
|-
|-
| la  || Latin              || hunspell-la  || hyphen-la  ||          ||
| la  || Latin              || hunspell-la  || hyphen-la  ||          ||
|-
|-
| lb  || Luxembourgish      || [http://downloads.spellchecker.lu/packages/OOo3/ available] but the EUPL v1.0 is on our [http://fedoraproject.org/wiki/Licensing licence] list as unacceptable for fedora. || || available but the EUPL v1.0 is on our licence list as unacceptable for fedora. ||
| lb  || Luxembourgish      || hunspell-lb  ||           || mythes-lb ||
|-
|-
| ln  || Lingala            || hunspell-ln  || || ||
| ln  || Lingala            || hunspell-ln  || || ||
|-
| ltg || Latgalian          || [http://dict.dv.lv/download.php?prj=la available] || || || [http://www.ante.lv/vuordneica/bin/view/Main/ Latgalian resources]
|-
|-
| mos || Mossi              || hunspell-mos || || || [http://www.panafril10n.org/wikidoc/pmwiki.php/PanAfrLoc/Moore info]. [http://markmail.org/message/wya3mihuqmmqjxle dictionary effort] (hunspell has no problem with utf-8 .dic files FWIW)
| mos || Mossi              || hunspell-mos || || || [http://www.panafril10n.org/wikidoc/pmwiki.php/PanAfrLoc/Moore info]. [http://markmail.org/message/wya3mihuqmmqjxle dictionary effort] (hunspell has no problem with utf-8 .dic files FWIW)
Line 349: Line 362:
| mni || Manipuri          ||              ||            ||          || [http://www.languageinindia.com/sep2007/manipuriphonological.html some info]
| mni || Manipuri          ||              ||            ||          || [http://www.languageinindia.com/sep2007/manipuriphonological.html some info]
|-
|-
| my || Burmese            ||             ||            ||          || [http://www.burmese-dictionary.org/ online dictionary]
| ny || Nyanja            || hunspell-ny  ||            ||          ||
|-
|-
| ny  || Nyanja            || hunspell-ny ||            ||          ||
| plt || Malagasy, Plateau  || hunspell-mg ||            ||          || Standard Malagasy
|-
|-
| qu  || Quechua Ecuador    || hunspell-qu  || || ||
| qu  || Quechua Ecuador    || hunspell-qu  || || ||
Line 360: Line 373:
|-
|-
| rm  || Raeto-Romance/Romansh ||              ||            ||          || Things are a bit messy as there's a group of R[h]aeto-Romance languages, but sil maps the [http://www.sil.org/iso639-3/codes.asp?order=639_2&letter=r ISO 639-1 rm to ISO 639-3 roh], and [http://www.ethnologue.com/show_language.asp?code=roh ethnologue] documents the Swizz Offical Orthography for roh as Rumantsch Grischun, so that's the probable [http://pne.livejournal.com/769521.html best-fit] for this. [http://www.drg.ch/main.php?a=ins&l=e Dicziunari Rumantsch Grischun]
| rm  || Raeto-Romance/Romansh ||              ||            ||          || Things are a bit messy as there's a group of R[h]aeto-Romance languages, but sil maps the [http://www.sil.org/iso639-3/codes.asp?order=639_2&letter=r ISO 639-1 rm to ISO 639-3 roh], and [http://www.ethnologue.com/show_language.asp?code=roh ethnologue] documents the Swizz Offical Orthography for roh as Rumantsch Grischun, so that's the probable [http://pne.livejournal.com/769521.html best-fit] for this. [http://www.drg.ch/main.php?a=ins&l=e Dicziunari Rumantsch Grischun]
|-
| rue  || Rusyn ||  || || ||
|-
|-
| sat || Santali            ||              ||            ||          || [http://wesanthals.tripod.com/id54.html English<->Santali dictionaries] [http://www.aa.tufs.ac.jp/~mmine/india/Bodding2k/index.html online dictionary]
| sat || Santali            ||              ||            ||          || [http://wesanthals.tripod.com/id54.html English<->Santali dictionaries] [http://www.aa.tufs.ac.jp/~mmine/india/Bodding2k/index.html online dictionary]
|-
|-
| sg  || Sango              ||              ||            ||          || [http://dictionary.kasahorow.com/all/sg/ www.dictionary.kasahorow.com]
| sdc || Sardinian, Sassarese  ||              ||            ||          || [http://qa.openoffice.org/issues/show_bug.cgi?id=107288 intended dictionaries] [https://launchpad.net/ditzionariusardu launchpad page
|-
|-
| sjd || Sammi, Kildin      ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
| sdn || Sardinian, Gallurese  ||              ||            ||          || [http://qa.openoffice.org/issues/show_bug.cgi?id=107288 intended dictionaries] [https://launchpad.net/ditzionariusardu launchpad page
|-
|-
| sma || Sammi, Southern     ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
| sg  || Sango                  ||              ||            ||          || [http://dictionary.kasahorow.com/all/sg/ www.dictionary.kasahorow.com]
|-
| sjd || Sammi, Kildin          ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
|-
| sma || Sammi, Southern       ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
|-amhar
|-amhar
| smj || Sammi, Lule         || hunspell-smj || [http://www.divvun.no/doc/proof/hyph/OOo/index.html watch this space] ||          || [http://www.divvun.no/ Northern Sammi]
| smj || Sammi, Lule           || hunspell-smj || [http://www.divvun.no/doc/proof/hyph/OOo/index.html watch this space] ||          || [http://www.divvun.no/ Northern Sammi]
|-
|-
| smn || Sammi, Inari       ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
| smn || Sammi, Inari           ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
|-
|-
| sms || Sammi, Skolt       ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
| sms || Sammi, Skolt           ||              ||            ||          || [http://www.divvun.no/ Northern Sammi]
|-
| src || Sardinian, Logudorese  ||              ||            ||          || [http://qa.openoffice.org/issues/show_bug.cgi?id=107288 intended dictionaries] [https://launchpad.net/ditzionariusardu launchpad page
|-
| sro || Sardinian, Campidanese ||              ||            ||          || [http://qa.openoffice.org/issues/show_bug.cgi?id=107288 intended dictionaries] [https://launchpad.net/ditzionariusardu launchpad page
|-
|-
| sw  || Swahili            || hunspell-sw  ||            ||          ||
| sw  || Swahili            || hunspell-sw  ||            ||          ||
|-
| swb || Maore              ||              ||            ||          || [http://www.ethnologue.com/show_language.asp?code=swb swb information]
|-
|-
| tet || Tetum              || hunspell-tet ||            ||          ||
| tet || Tetum              || hunspell-tet ||            ||          ||
|-
|-
| tpi || Tok Pisin          || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building ||            ||          ||
| tpi || Tok Pisin          || hunspell-tpi ||            ||          ||
|-
| ty  || Tahitian          || [http://borel.slu.edu/crubadan/apps.html crubadan] corpus building ||            ||          ||
|}
|}



Latest revision as of 21:13, 19 September 2016

Linguistic Components

1. Language Support Matrix (glibc upwards)

Language Code Language hunspell hyphen mythes notes
aa Afar afarfriends.org hosted ALSEC report.
af Afrikaans hunspell-af hyphen-af
am Amharic hunspell-am
an Aragonese www.iea.es, see Spain: Lexicography In Iberian Languages
ar Arabic hunspell-ar experimental thesaurus
as Assamese hunspell-as hyphen-as xobdo is another potential source, possibly even for a thesaurus, but this isn't an option apparently at the moment.
ast Asturian hunspell-ast dictionary announcement
az Azeri (Latin) hunspell-az
be Belarusian hunspell-be hyphen-be
ber Amazigh (Tifinagh) hunspell-ber
ber Amazigh (Latin)
bg Bulgarian hunspell-bg hyphen-bg mythes-bg
bn Bengali hunspell-bn hyphen-bn
bo Tibetan bo.openoffice.org. Latest language support update.
br Breton hunspell-br
bs Bosnian hunspell-bs hyphen-bs
byn Blin Blin Orthography: A History and an Assessment
ca Catalan hunspell-ca hyphen-ca mythes-ca
crh Crimean Tatar A corpus translation team
cs Czech hunspell-cs hyphen-cs mythes-cs
csb Kashubian hunspell-csb
cv Chuvash hunspell-cv
cy Welsh hunspell-cy hyphen-cy
da Danish hunspell-da hyphen-da mythes-da
de German hunspell-de hyphen-de mythes-de
dv Dhivehi

wordlist English-Dhivehi dictionary

dz Dzongkha crubadan corpus building Some requests for help/info.
el Greek hunspell-el hyphen-el mythes-el
en English hunspell-en hyphen-en mythes-en
es Spanish hunspell-es hyphen-es mythes-es
et Estonian hunspell-ee hyphen-et
eu Basque hunspell-eu hyphen-eu
fa Farsi hunspell-fa hyphen-fa
fi Finnish Finnish Community has a parallel Voikko solution. With an enchant backend, an OpenOffice.org extension, and a Firefox extension.
fil Filipino hunspell-tl Filipino is effectively an official Tagalog-based language
fo Faeroese hunspell-fo hyphen-fo
fr French hunspell-fr hyphen-fr mythes-fr
fur Friulian hunspell-fur
fy Frisian hunspell-fy
ga Irish hunspell-ga hyphen-ga mythes-ga
gd Scots Gaelic hunspell-gd
gez Ge'ez Ge'ez Frontier Foundation
gl Galician hunspell-gl hyphen-gl
gu Gujarati hunspell-gu hyphen-gu
gv Manx hunspell-gv
ha Hausa available but no License mentioned. In private communication " We will specify licenses for the next release of the spell checkers. In the meantime, assume both Hausa and Eʋegbe have the GNU GPLv3 license as well."
he Hebrew hunspell-he info on hyphenation
hi Hindi hunspell-hi hyphen-hi Hindi Wordnet is likely convertible, claims to have similar format as English Wordnet, which is the basis of mythes-en
hne Chhattisgarhi corpus building
hr Croatian hunspell-hr hyphen-hr This hasn't been updated in a number of years, on a purely orthographical basis I wonder if dict-sr would provide a better option
hsb Upper Sorbian hunspell-hsb hyphen-hsb
ht Haitian Creole hunspell-ht
hu Hungarian hunspell-hu hyphen-hu mythes-hu
hy Armenian hunspell-hy
id Indonesian hunspell-id hyphen-id
ig Igbo crubadan corpus building www.dictionary.kasahorow.com
ik Inupiaq Broken download link to MSWord dictionary Iñupiaq parser project, Finite-State Morphology for Iñupiaq
is Icelandic hunspell-is hyphen-is
it Italian hunspell-it hyphen-it mythes-it
iu Inuktitut www.livingdictionary.com
ja Japanese
ka Georgian Crubadan is aware of 29023 words ka.openoffice.org Some info on spellchecking the language.
kk Kazakh hunspell-kk
kl Kalaallisut Greenlandic parser project. MSWord checker.
km Khmer hunspell-km
kn Kannada hunspell-kn hyphen-kn
ko Korean hunspell-ko
kok Konkani [http://www.savemylanguage.org/ online dictionary
ks Kashmiri online dictionary
ku Kurdish (Latin) hunspell-ku hyphen-ku
ku Kurdish (Arabic) some info
kw Cornish crubadan corpus building Fedora Cornish Language Translation Project
ky Kirgyz hunspell-ky OOo localization beginnings. Orthography news
lg Luganda crubadan corpus building A general translation effort. An online dictionary
li Limburgish crubadan corpus building
lo Lao Lao OOo localization
lt Lithuanian hunspell-lt hyphen-lt
lv Latvian hunspell-lv hyphen-lv mythes-lv
mai Maithili hunspell-mai maithiliacademy.org
mg Malagasy hunspell-mg mg is equivalent to mlg which is a macrolanguage, see plt for "Standard Malagasy
mi Maori hunspell-mi hyphen-mi mythes-mi
mk Macedonian hunspell-mk convertible
ml Malayalam hunspell-ml hyphen-ml
mn Mongolian hunspell-mn hyphen-mn
mr Marathi hunspell-mr hyphen-mr
ms Malay hunspell-ms no content, but a project announcement for Malaysian thesaurus etc.
mt Maltese hunspell-mt
my Burmese online dictionary
nan Min Nan online dictionary?

Debian wiki notes

nb Bokmaal hunspell-nb hyphen-nb mythes-nb
nds Lowlands Saxon hunspell-nds
ne Nepali hunspell-ne mythes-ne
nl Dutch hunspell-nl hyphen-nl mythes-nl
nn Nynorsk hunspell-nn hyphen-nn mythes-nn
nr Ndebele (Southern) hunspell-nr
nso Sotho (Northern) hunspell-nso
oc Occitan hunspell-oc
om Oromo hunspell-om Oromo details
or Oriya hunspell-or hyphen-or
pa Punjabi hunspell-pa hyphen-pa
pap Papiamentu/Papiamento Papiamentu work in progress The supported glibc locale is pap_AN. Spelling rules differ between Papiamentu and Papiamento groupings. Papiamentu: Curaçao and Bonaire, current members of the Netherlands Antillies, territory code AN. Papiamento: Aruba, (former member of the Netherlands Antillies), territory code AW, crubadan Papiamento corpus building.
pl Polish hunspell-pl hyphen-pl mythes-pl
ps Pashto possible contact
pt Portuguese hunspell-pt hyphen-pt mythes-pt
ro Romanian hunspell-ro hyphen-ro mythes-ro
ru Russian hunspell-ru hyphen-ru mythes-ru
rw Kinyarwanda hunspell-rw
sa Sanskrit An apparent effort to create a Sanskrit hunspell dictionary hyphen-sa
sc Sardinian hunspell-sc intended dictionaries [https://launchpad.net/ditzionariusardu launchpad page
sd Sindhi available
se Sammi, Northern hunspell-se watch this space
shs Secwepemctsin hunspell-shs Secwepecmtsín word bank work in progress. Note it's trivial to create a simple wordlist-based hunspell dict. e.g. wordlist2hunspell
si Sinhala hunspell-si Another very small wordlist
sid Sidamo Some info
sk Slovak hunspell-sk hyphen-sk mythes-sk
sl Slovenian hunspell-sl hyphen-sl mythes-sl
so Somali hunspell-so
sq Albanian hunspell-sq
sr Serbian hunspell-sr hyphen-sr
ss Swati hunspell-ss
st Sotho (Southern) hunspell-st
sv Swedish hunspell-sv hyphen-sv mythes-sv
ta Tamil hunspell-ta hyphen-ta
te Telugu hunspell-te hyphen-te
tg Tajik An apparent effort to create a Tajik hunspell dictionary
th Thai hunspell-th
ti Tigrigna hunspell-ti
tig Tigre crubadan corpus building
tk Turkmen hunspell-tk hyphen-tk
tl Tagalog hunspell-tl
tn Tswana hunspell-tn
tr Turkish available, but like Finnish through voikko the typical solution for Turkish has been the Zemberek library, and to have an enchant backend, an Openoffice.org Extension, and a Firefox extension)
ts Tsonga hunspell-ts
tt Tatar available but difficult to see where this came from originally, and what license it is exactly, GPLv2+ (?). Perhaps it is an original work of ALT Linux and that actually is the canonical upstream ? available but difficult to see where this came from originally, and what license it is exactly, GPLv2+ (?). Perhaps it is an original work of ALT Linux and that actually is the canonical upstream ?
ug Uyghur www.uyghurdictionary.org www.uighur.jp
uk Ukrainian hunspell-uk hyphen-uk mythes-uk
ur Urdu hunspell-ur
uz Uzbek hunspell-uz
ve Venda hunspell-ve
vi Vietnamese hunspell-vi
wa Walloon hunspell-wa
wo Wolof www.alfanet.anafa.org make Wolof localizations of Firefox and Abiword. www.dictionary.kasahorow.com
xh Xhosa hunspell-xh
yi Yiddish hunspell-yi
yo Yoruba Some apparent efforts older info to create a Yoruba hunspell dictionary www.dictionary.kasahorow.com
zh Chinese Would these (convertable) TeX rules be universally meaningful for Chinese text
zu Zulu hunspell-zu hyphen-zu


2. Language Support Matrix (extra OOo recognized not in glibc)

Language Code Language hunspell hyphen mythes notes
ak Akan hunspell-ak www.dictionary.kasahorow.com
az Azeri (Cyrillic) transliteration table
bm Bambara Online Dictionary
buc Bushi
brx Bodo xobdo is a potential source, but this isn't an option apparently at the moment. Another Online Dictionary
cop Coptic hunspell-cop experimental convertible TeX rules
dgo Dogri Central Institute for Indian Languages
dsb Lower Sorbian hunspell-dsb
ee Ewe available but no License mentioned. In private communication " We will specify licenses for the next release of the spell checkers. In the meantime, assume both Hausa and Eʋegbe have the GNU GPLv3 license as well." online dictionary
eo Esperanto hunspell-eo needs more love to be convertible
fj Fijian hunspell-fj
grc Ancient Greek hunspell-grc hyphen-grc
gsc Gascon Non-Commercial BY-NC-ND license
gug Guarani crubadan corpus building
haw Hawaiian hunspell-haw
hil Hiligaynon hunspell-hil
ia Interlingua hunspell-ia hyphen-ia
ki Gikuyu available
ksf Bafia work in progress empty dictionary page
la Latin hunspell-la hyphen-la
lb Luxembourgish hunspell-lb mythes-lb
ln Lingala hunspell-ln
ltg Latgalian available Latgalian resources
mos Mossi hunspell-mos info. dictionary effort (hunspell has no problem with utf-8 .dic files FWIW)
mni Manipuri some info
ny Nyanja hunspell-ny
plt Malagasy, Plateau hunspell-mg Standard Malagasy
qu Quechua Ecuador hunspell-qu
quh Quechua South Bolivia hunspell-quh
qul Quechua North Bolivia current effort
rm Raeto-Romance/Romansh Things are a bit messy as there's a group of R[h]aeto-Romance languages, but sil maps the ISO 639-1 rm to ISO 639-3 roh, and ethnologue documents the Swizz Offical Orthography for roh as Rumantsch Grischun, so that's the probable best-fit for this. Dicziunari Rumantsch Grischun
rue Rusyn
sat Santali English<->Santali dictionaries online dictionary
sdc Sardinian, Sassarese intended dictionaries [https://launchpad.net/ditzionariusardu launchpad page
sdn Sardinian, Gallurese intended dictionaries [https://launchpad.net/ditzionariusardu launchpad page
sg Sango www.dictionary.kasahorow.com
sjd Sammi, Kildin Northern Sammi
sma Sammi, Southern Northern Sammi
smj Sammi, Lule hunspell-smj watch this space Northern Sammi
smn Sammi, Inari Northern Sammi
sms Sammi, Skolt Northern Sammi
src Sardinian, Logudorese intended dictionaries [https://launchpad.net/ditzionariusardu launchpad page
sro Sardinian, Campidanese intended dictionaries [https://launchpad.net/ditzionariusardu launchpad page
sw Swahili hunspell-sw
swb Maore swb information
tet Tetum hunspell-tet
tpi Tok Pisin hunspell-tpi
ty Tahitian crubadan corpus building

3. Obsolete/Useless codes (glibc)

Language Code Language notes
iw Hebrew Obsoleted by he
no Norwegian Effectively obsoleted by nb