Mancko
Website language identifier
This tool analyzes the language used on a website. It uses the state of the art of Natural Language Processing to distinguish between all the supported languages.
This tool is useful to propose contextual advertising in the right language or to aggregate blogs depending on their language. Of course, we can tailor this to fit your needs and specifications.
Please note that the smaller the number of languages to distinguish, the better the results of this tool.
For any question about this product, please use the contact form.
The 220 currently supported languages are:
- Abkhaz, Achehnese, Achuar shiwiar, Afrikaans, Aguaruna, Akha, Albanian, Amahuaca, Amarakaeri, Amuesha-Yanesha, Arabela, Arabic, Arapaho, Armenian, Asháninka, Ashéninka Pajonal, Asturian, Aymara
- Basque, Belarusian (Cyrillic and Latin alphabets), Bemba, Berber (Tamazight), Bhojpuri, Bislama, Bora, Breton, Brithenig, Buginese, Bulgarian
- Candoshi-Shapra, Caquinte, Cashibo-Cacataibo, Catalan, Cebuano, Chamorro, Chayahuita, Chichewa, Chickasaw, Chinantec (Chiltepec and Ojitlán), Chin Falam, Chinese (Mandarin), Chokwe, Chuukese, Cornish, Croatian, Czech
- Danish, Dhivehi, Dinka Padang, Dutch
- Edo, English, Esperanto, Estonian
- Faroese, Fijian, Finnish, French, Frisian, Friulian
- Galician, Garifuna, German, Glosa, Greek, Guaraní, Gujarati
- Hani, Hausa, Hawaiian, Hebrew, Hiligaynon, Hindi, Hmong (Northern Qiandong Miao, Southern Qiandong Miao, Hmong Njua), Huastec of San Luís Potosí, Huitoto Murui, Hungarian
- Ibibio, Icelandic, Ido, Igbo, Ilokano, Indonesian, Innu-aimun, Interlingua, Inuktitut, Italian
- Japanese, Javanese, Jola-Fogny, Judaeo-Spanish
- Kanuri Yerwa, Kaonde, Kapampangan, Kaqchikel, Kashubian, Kimbundu, Klingon, Konjo, Kurdish
- Lamnso, Latvian, Limburgish, Lingala, Lithuanian, Lojban, Lozi, Luba-Kasai, Luganda, Luvale, Luxembourgish
- Macedonian, Madurese, Makonde, Malagasy, Malay, Malayalam, Maltese, Mam, Maori, Mapudungun, Marathi, Marshallese, Matsés, Maya Yucatec, Micmac, Minangkabau, Miskito, Mixtec Metlatónoc
- Nahuatl, Ndonga, Nepali, Nomatsiguenga, Novial, Norwegian (Bokmål and Nynorsk), Nyamwezi, Nyemba
- Oromo, Ossetian, Otomi
- Páez, Palauan, Persian, Picard, Pipil, Pohnpeian, Polish, Portuguese, Provençal, Purepecha
- Q’eqchi’, Quenya
- Romansh, Romany (Balkan and Vlax), Romanian, Rundi, Runyankole, Russian
- Sámi (North, South and Lule), Sango, Saterland Frisian, Scottish Gaelic, Serbian (Cyrillic and Gaj’s latin alphabet), Sharanahua, Shipibo, Sindarin, Sindhi (Arabic script), Slovak, Slovenian, Soninke, Sorbian, Sotho (Northern and Southern), Spanish, Sukuma, Sundanese, Swahili, Swati, Swedish
- Tagalog, Tahitian, Tamil, Tedim, Tetum, Thai, Tiv, Toba, Tojolabal, Tok Pisin, Tonga, Tswana, Tzotzil (Chamula), Turkish
- Ukrainian, Umbundu, Urarina, Urdu, Uzbek (latin script)
- Vietnamese, Volapük
- Walloon, Waray, Wayuu, Welsh, Wolof
- Xhosa
- Yagua, Yao, Yapese, Yiddish, Yoruba
- Zapotec (Miahuatlán and Güilá), Zhuang (Northern), Zulu