Jump to content

Cyrillic script

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 82.83.136.187 (talk) at 13:05, 1 June 2004. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Cyrillic alphabet is an alphabet used to write six natural Slavic languages (Russian, Ukrainian, Belarusian, Serbian, Macedonian, and Bulgarian), other languages of the former Soviet Union (Turkic languages Azeri (1939-91), Tatar, Turkmen (1940-94), Uzbek (1940-98), Kyrgyz, Kazakh and Gagauz (since 1957); Ossetic and Tajik (Indo-Iranian tongues); Romanian (a Romance language; up to 19th century and in Moldova between 1940-89); Udmurt, Kildin Saami and Mordvin (Finno-Ugric language); and Abaza, Abkhaz, Adygei, Aisor, Altai, Avar, Balkar, Bashkir, Buryat, Chechen (1940-1991), Chuvash, Chukchi, Dargwa, Dungan, Evenks, Kabardian, Kalmyk, Karachay, Karakalpak, Karelian, Khakas, Khants, Komi, Koryak, Kumyk, Kurdish (living in former USSR), Lak, Lezghian, Mansi, Mari, Mongolian, Nanai, Nenets, Nogai, Oriat, Romany (in Serbia and Montenegro and former USSR), Selkup, Tabasaran, Tat, Tuva, Udekhe (Udege) and Yakut languages), as well as the constructed languages Slovio and Lingua Franca Nova.

А
A
Б
Be
В
Ve
Г
Ge
Ѓ
Gje
Ґ
Ghe
Д
De
Ђ
Dje
Е
E
Є
E ukrainian
Ѐ
E with grave
Ё
Io
Ж
Zhe
Ѕ
Dze
З
Ze
И
I
Й
I short
Ѝ
I with grave
І
I ukrainian
Ї
Yi
Ј
Je
К
Ka
Ќ
Kje
Ћ
Tshe
Л
El
Љ
Lje
М
Em
Н
En
Њ
Nje
О
O
П
Pe
Р
Er
С
Es
Т
Te
Ѹ
Ou
У
U
Ў
U short
Ф
Ef
Х
Ha
Ѡ
Omega cyrillic
Ц
Tse
Ч
Che
Џ
Dzhe
Ш
Sha
Щ
Shcha
Ъ
Hard sign (yer)
Ы
Yery
Ь
Soft sign
Ѣ
Yat
Э
E reversed
Ю
Yu
Я
Ya
(not in Unicode)
A iotified
Ѥ
E iotified
Ѧ
Yus small
Ѫ
Yus big
Ѩ
Yus small iotified
Ѭ
Yus big iotified
Ѯ
Ksi cyrillic
Ѱ
Psi cyrillic
Ѳ
Fita
Ѵ
Izhitsa
Ѷ
Izhitsa with double grave

(Russian letters bolded; old letters slanted)

Cyrillic alphabet for Russian

The plan of the alphabet is derived from the Early Cyrillic alphabet, itself a derivative of the Glagolitic alphabet, a 9th century uncial cursive usually credited to two brothers, Saint Cyril and Saint Methodius. But the shapes of the glyphs in the Cyrillic alphabet are mainly Greek letters, although some letters retain their Glagolitic forms. Cyril's contributions to the Glagolitic alphabet and hence to the Cyrillic alphabet are still recognised, as the latter is named after him.

А Б В Г Д Е Ё Ж З И Й
К Л М Н О П Р С Т У Ф
Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я


As used in various languages

Sounds are indicated using SAMPA. These are only approximate indicators. While these languages by and large have a phonemic orthography, there are occasional exceptions -- most notably Russian ЕГО (meaning him/his), which is pronounced /jevO/ instead of /jegO/.

Note that spellings of names may vary, especially Y/J/I but also GH/G/H and ZH/J.)

Slavic languages

Capital Small NameSoundISO 9:1995 transliteration
АаA/a/a
БбBe/b/b
ВвVe/v/v
ГгGe/g/g
ДдDe/d/d
ЕеYe/jE/e
ЁёYo/jO/yo
ЖжZhe/Z/zh
ЗзZe/z/z
ИиI/i/i
ЙйShort I/j/j
КкKa/k/k
ЛлEl/l/l
МмEm/m/m
НнEn/n/n
ОоO/o/o
ПпPe/p/p
РрEr/r/r
СсEs/s/s
ТтTe/t/t
УуU/u/u
ФфEf/f/f
ХхHa/x/x
ЦцTse/ts/c
ЧчChe/tS/ch
ШшSha/S/sh
ЩщShcha/Sj/shh
ЪъHard Signno palatalization¹''
ЫыYery/1/y'
ЬьSoft Sign/j/ -- palatalization¹'
ЭэE/E/e'
ЮюYu/ju/yu
ЯяYa/ja/ya

Notes on the Hard Sign and Soft Sign:

  1. When a iotated vowel (vowel whose sound begins with /j/) follows a consonant, the consonant will become palatalised (the /j/ sound will mix with the consonant), and the vowel's /j/ sound will not be heard independently. The Hard Sign will indicate that this does not happen, and the /j/ sound will appear only in front of the vowel. The Soft Sign will indicate the consonant should be palatised, but the vowel's /j/ sound will not mix with the palatalization of the consonant. The Soft Sign will also indicate that a consonant before another consonant or at the end of a word is palatised. Examples: та - ta; тя - tja; тья - tjja; тъя - tja; т - t; ть - tj.
  2. In the pre-reform Russian orthography, in Old Russian language and in Old Church Slavonic the letter is called yer. Historically, the "hard sign" takes place of a disappeared vowel, preserved in Bulgarian language. See notes for Bulgarian language.

Historical letters:

Before 1918, there were four extra letters in use: Іі (replaced by Ии), Ѳѳ (Фита "Fita", replaced by Фф), Ѣѣ (Ять "Yat", replaced by Ее), and Ѵѵ (ижица "Izhitsa", replaced by Ии).

Like Russian except:

  • Г is a voiced fricative consonant and is called "Ge". Between Ge and De is the letter Ghe (Ґ, ґ), pronounced /g/, i.e., like a Russian Г. It looks like Ge but has an "upturn" pointing up from the right side of the top bar. (This letter was not officially used in the Soviet Union, so it doesn't appear in many Cyrillic fonts.)
  • Ye is pronounced /E/ and is called "E". Yo does not appear. Between E and Zhe is the letter Ye (Є, є), pronounced /jE/, which looks like the Russian letter E, only backwards. The Russian letter E does not appear.
  • I is pronounced /1/ and is called "Y". Accordingly, Short I is called "Short Y". Between Y and Short Y appear the letter I (І, і), pronounced /i/, which looks like the Latin letter I, and the letter Yi (Ї, ї), pronounced /ji/, which looks like I with a diaeresis (the same two dots that appear in the Russian letter Yo) above it.
  • Yery does not appear.
  • The Hard Sign is not used; instead, its purpose is served by an apostrophe.

Like Russian except:

  • I looks like the Latin letter I (І, і). (But Short I still looks the same as in Russian!)
  • Between U and Ef is the letter Short U (Ў, ў), which looks like U (У) with a breve and pronounced /w/, or like the "u" part in diphthongs in "now", "low".
  • Shcha does not appear.
  • The Hard Sign is not used. Its purpose (removing of palatalization) is served by an apostrophe.
  • Г represents a voiced fricative consonant

Like Russian except:

  • Ye is pronounced /E/ and is called "E".
  • Yo does not appear.
  • The Russian letter E does not appear.
  • Shcha is pronounced /St/ and is called "Shta".
  • The Hard Sign is used for a vowel, /@/ (Schwa).
  • Yery does not appear.

Like Russian except:

  • Ye is pronounced /E/ and is called "E". Yo does not appear. The Russian letter E does not appear.
  • Between De and E is the letter Gje (Ђ, ђ), which is pronounced /dj/, and looks like Tjerv, except that the loop of the H curls farther and dips downwards.
  • Short I does not appear. Between I and Ka is the letter Ej (Ј, ј), pronounced /j/, which looks like the Latin letter J.
  • Between El and Em is the letter Elj (Љ, љ), pronounced /lj/, which looks like El and the Soft Sign smashed together.
  • Between En and O is the letter Enj (Њ, њ), pronounced /nj/, which looks like En and the Soft Sign smashed together.
  • Between Te and U is the letter Tjerv (Ћ, ћ), which is pronounced /tj/ and looks like a lowercase Latin letter H with a bar. On the uppercase letter, the bar appears at the top; on the lowercase letter, the bar crosses the top half of the vertical line.
  • Between Che and Sha is the letter Dzhe (Џ, џ), pronounced /dZ/, which looks like Tse but with the downturn moved from the right side of the bottom bar to the middle of the bottom bar.
  • Sha is the last letter; the rest do not appear.

Like Serbian except:

  • Between Ze and I is the letter Dze (Ѕ, ѕ), pronounced /dz/, which looks like the Latin letter S.
  • Djerv is replaced by Gje (Ѓ, ѓ), pronounced /gj/, which looks like Ghe with an acute accent (').
  • Tjerv is replaced by Kja (Ќ, ќ), pronounced /kj/, which looks like Ka with an acute accent (').

Non-Slavic languages

These alphabets are generally modelled after Russian, but often bear striking differences, particularly when adapted for Caucasian languages.

Cyrillics Qaydar latin transliteration
Аa Aa
Әә Ää
Бб Bb
Вв Vv
Гг Gg
Ғғ Ğğ
Дд Dd
Ее Ee
Ёё Yo yo
Жж Jj
Зз Zz
Ии Iy iy
Йй Yy
Кк Kk
Ққ Qq
Лл Ll
Мм Mm
Нн Nn
Ңң Ññ
Оо Oo
Өө Öö
Пп Pp
Рр Rr
Сс Ss
Тт Tt
Уу Ww
Ұұ Uu
Үү Üü
Фф Ff
Хх Xx
Һһ Hh
Цц Cc
Чч Çç
Шш Şş
Щщ ??
Ъъ ??
Ыы
İi İi
Ьь ??
Ээ Ee
Юю Yu yu
Яя Ya ya

Cyrillic in Unicode

In Unicode, the Cyrillic block extends from U+0400 to U+052F. The characters in the range U+0400 — U+045F are basically the characters from ISO 8859-5 moved upward by 864 positions. The characters in the range U+0460 — U+0489 are historic letters, not used now. The characters in the range U+048A — U+052F are additional letters for various languages that are written with Cyrillic script.

Unicode does not include accented Cyrillic letters, but they can be composed by adding U+0301 ("combining acute accent") after the accented vowel (e.g. ы́ э́ ю́ я́). Some languages (e.g. modern Church Slavonic language) still are not fully supported.

    0 1 2 3 4 5 6 7 8 9 A B C D E F
400   Ѐ Ё Ђ Ѓ Є Ѕ І Ї Ј Љ Њ Ћ Ќ Ѝ Ў Џ
410   А Б В Г Д Е Ж З И Й К Л М Н О П
420   Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
430   а б в г д е ж з и й к л м н о п
440   р с т у ф х ц ч ш щ ъ ы ь э ю я
450   ѐ ё ђ ѓ є ѕ і ї ј љ њ ћ ќ ѝ ў џ
460   Ѡ ѡ Ѣ ѣ Ѥ ѥ Ѧ ѧ Ѩ ѩ Ѫ ѫ Ѭ ѭ Ѯ ѯ
470   Ѱ ѱ Ѳ ѳ Ѵ ѵ Ѷ ѷ Ѹ ѹ Ѻ ѻ Ѽ ѽ Ѿ ѿ
480   Ҁ ҁ ҂ ҃ ҄ ҅ ҆ ҇ ҈ ҉ Ҋ ҋ Ҍ ҍ Ҏ ҏ
490   Ґ ґ Ғ ғ Ҕ ҕ Җ җ Ҙ ҙ Қ қ Ҝ ҝ Ҟ ҟ
4A0   Ҡ ҡ Ң ң Ҥ ҥ Ҧ ҧ Ҩ ҩ Ҫ ҫ Ҭ ҭ Ү ү
4B0   Ұ ұ Ҳ ҳ Ҵ ҵ Ҷ ҷ Ҹ ҹ Һ һ Ҽ ҽ Ҿ ҿ
4C0   Ӏ Ӂ ӂ Ӄ ӄ Ӆ ӆ Ӈ ӈ Ӊ ӊ Ӌ ӌ Ӎ ӎ ӏ
4D0   Ӑ ӑ Ӓ ӓ Ӕ ӕ Ӗ ӗ Ә ә Ӛ ӛ Ӝ ӝ Ӟ ӟ
4E0   Ӡ ӡ Ӣ ӣ Ӥ ӥ Ӧ ӧ Ө ө Ӫ ӫ Ӭ ӭ Ӯ ӯ
4F0   Ӱ ӱ Ӳ ӳ Ӵ ӵ Ӷ ӷ Ӹ ӹ Ӻ ӻ Ӽ ӽ Ӿ ӿ
500   Ԁ ԁ Ԃ ԃ Ԅ ԅ Ԇ ԇ Ԉ ԉ Ԋ ԋ Ԍ ԍ Ԏ ԏ
510   Ԑ ԑ Ԓ ԓ Ԕ ԕ Ԗ ԗ Ԙ ԙ Ԛ ԛ Ԝ ԝ Ԟ ԟ
520   Ԡ ԡ Ԣ ԣ Ԥ ԥ Ԧ ԧ Ԩ ԩ Ԫ ԫ Ԭ ԭ Ԯ ԯ