This page contains the full list of language names allowed to use in corpus registry file for creating corpora in Sketch Engine. This does not mean that all following languages have already had a corpus within Sketch Engine. However, users can use the following languages for creating their own corpora.
A
Afar, Abkhazian, Adyghe, Afrikaans, Aghem, Akan, Amharic, Arabic, Assamese, Asu, Asturian, Avaric, Aymara, Azerbaijani (Azeri)
B
Bashkir, Basaa, Belarusian, Bemba, Bena, Bulgarian, Bislama, Bambara, Bengali, Tibetan, Breton, Bodo, Bosnian, Blin
C
Catalan, Cawai (Atsam), Chechen, Cebuano, Chiga, Chamorro, Chuukese, Cherokee, Czech, Cymraeg (Welsh)
D
Danish, Dawida (Taita), Deutch (German), Djerma (Zarma), Duala, Divehi, Diola (Jola-Fonyi), Dzongkha
E
Embu, Ewe, Efik, Greek, English, Spanish, Estonian, Basque, Ewondo
F
Farsi (Persian), Fulah, Finnish, Filipino, Fijian, Faroese, French, Friulian, Western Frisian
G
Gaeilge (Irish), Ga, Gagauz, Scottish Gaelic, Gilbertese, Galician, Guarani, Swiss German, Gujarati, Gusii, Gaelg/Gailck (Manx Gaelic),
H
Hausa, Hawaiian, Hebrew, Hindi, Hiligaynon, Hiri Motu, hrvatski (Croatian), Haitian, Hungarian, hayeren (Armenian)
I
Indonesian, Igbo, Sichuan Yi, Iloko, Ingush, Icelandic, Italian
J
Japanese, Machame, Javanese
K
Kartuli (Georgian), Kabyle, Kaje (Jju), Kamba, Kabardian, Katab (Tyap), Kimakonde (Makonde), Kabuverdianu, Kongo, Khasi, Koyra Chiini, Kikuyu, Kuanyama, Kazakh, Kalaallisut, Kalenjin, Khmer, Kannada, Korean, Komi-Permyak, Konkani, Kosraean, Kpelle, Komi-Zyrian, Karachay-Balkar, Kashmiri, Kishambaa (Shambala), Kpa (Bafia), Kölsch (Colognian), Kurdish, Kurdish, Kurdish, Kurdish, Kumyk, Kernowek (Cornish), Kyrgyz (Kirghiz)
L
Lahnda, Lak, Langi, Lao, Latin, Latvian, Lezghian, Lingala, Lithuanian, Luba-Katanga, Luba-Lulua, Luganda (Ganda), Luo, Luxembourgish, Luyia
M
Maithili, Masai, Moksha, Maguindanaon, Meru, Morisyen, Malagasy, Makhuwa-Meetto, Marshallese, Maori, Macedonian, Malayalam, Mongolian, Mongolian, Mongolian, Marathi, Malay, Maltese, Myanmar (Burmese), Mordvin (Erzya)
N
Nauru, Nama, Norwegian Bokmål, North Ndebele, Nedderdüütsch (Low German), Nepali, Niuean, Nederlands (Dutch), Ngumba (Kwasio), Norwegian Nynorsk, South Ndebele, Northern Sotho, Nuer, Nyanja, Nyankole
O
Occitan, Oromo, Oriya, Ossetic
P
Punjabi, Punjabi, Punjabi, Pangasinan, Papiamento, Palauan, Polish, Pohnpeian, Pashto, Portuguese
Q
Quechua
R
Romansh, Romanian, Rombo, Russian, Rwanda (Kinyarwanda), Rwa
S
Sanskrit, Sakha, Samburu, Santali, Sangu, Sindhi, Northern Sami, Sena, Koyraboro Senni, Sango, Shilha (Tachelhit), Sinhala, Sidamo, Slovak, Slovenian, Samoan, Shona, Somali, shqip (Albanian), Serbian, Serbian, Serbian, Setswana, Swati, Saho, Southern Sotho, Swedish, Swahili, Congo Swahili
T
Tamil, Telugu, Teso, Tetum, Tajik, Thai, Tigrinya, Tigre, Turkmen, Tokelau, Tswana, Tonga, Tok Pisin, Turkish, Taroko, Tsonga, Tausug, Tatar, Tuvalu, Tasawaq, Tahitian, Tuvinian, Central Morocco Tamazight
U
Udmurt, Uighur, Ukrainian, Ulithian, Urdu, Uzbek, Uzbek, Uzbek
V
Vai, Venda, Vietnamese, Vunjo
W
Walser, Walamo, Waray, Wolof
X
Xhosa, Soga
Y
Yapese, Yangben, Yoruba
Z
Zhuang, Zhōngwén (Chinese), jiǎnhuàzì (Chinese Simplified), Zhèngtǐzì (Chinese Traditional), Zulu
Search the corpus
Sketch Engine offers a range of tools to work with text corpora in 90+ languages.
Text corpora in Sketch Engine
Sketch Engine offers 400+ language corpora.