X hits on this document

PDF document

Languages and sizes of dictionaries - page 6 / 9

28 views

0 shares

0 downloads

0 comments

6 / 9

The Turkish language is written in the Latin alphabet, but a few characters were added, such as the dotless-i which is very different from the dotted-i. Therefore the letter i is not a lower case of the majuscule letter I, a major prob- lem to many systems.

Romanian (lexicon size 1,000,000, selection June 2009) The Romanian language belongs to the Roman languages. It includes a few additional characters such as the a- breve <ă>, i-circumflex <î>, the s-cedille <ş>, the t-sedille <ţ> , the s-comma below, the t-comma below.

Bulgarian (lexicon size 840,000, selection February 2008) The Bulgarian language is written in the Cyrillic alphabet.

Faeroese (lexicon size 517,000, selection November 2012) The Faeroese language is spoken by 50,000 inhabitants of the Faeroe Islands. It is based on the old Norse as is the Islandic language.

Bahasa Indonesia (lexicon size 76,000, selection July 2010) The Bahasa Indonesian language is the standard language written and spoken in the Republic of Indonesia. Many Austronesian languages are spoken in the Indonesian Archipelago, but Bahasa Indonesia is the lingua franca.

Slovene (lexicon size 425,000, selection October 2007) The Slovene language is spoken in the Republic of Slovenia, situated between Austria, Hungary, Croatia, and Italy. It is a south slavic language written in the Latin alphabet, including a few Slavic characters such as <č>, <š>, <ž> and the diagraphs Lj and Nj. Slovene is highly inflected and nearly every noun has an adjective form too.

Croatian (lexicon size 547,000, selection October2009) The Croatian language, formerly named Serbo-Croatic, is closely related to Serbian. The Croatian language is written in the Latin alphabet, including a few typical Slavic characters such as <č>, <ć>, <š>, <ž>, and digraphs Lj and Nj.

Bosnian (lexicon size 565,000, selection August 2009) The Bosnian language, formerly named Serbo-Croatic, is closely related to Serbian and Croatian. visit download page

Serbian Cyrillic (lexicon size 570,000, selection August 2009) The Serbian language is written in the Cyrillic alphabet, including typical Serbian characters Dž, Lj, Nj (Џ, Љ, Њ).

Byelorussian (lexicon size 1,500,000, selection February 2008) The Byelorussian language is written in the Cyrillic alphabet, like the Russian language, but the language was heavily influenced by Polish for centuries. Today, in the Byelorussian Republic, Byelorussian plays a lesser role compared to the Russian language.

Slovak (lexicon size 1 million, selection August 2009) The Slovak language is closely related to Czech, but a few characters differ.

Ukrainian (lexicon size over 1 million words, selection February 2008) The Ukrainian language is written in the Cyrillic alphabet, but for centuries the language was heavily influenced by Polish.

Swahili (lexicon size 75,000, selection February 2005)

Document info
Document views28
Page views28
Page last viewedThu Jan 19 07:50:22 UTC 2017
Pages9
Paragraphs112
Words3806

Comments