Страница публикации
Phonetic String Matching for Languages with Cyrillic Alphabet
Тип публикации: Статья в журнале
Тип материала: Текст
Авторы: Paramonov V., Shigarov A., Ruzhnikov G., Cherkashin E.
Журнал: Advances in Intelligent Systems and Computing: Proc. of the 39th Intern. Conf. on Information Systems Architecture and Technology (ISAT'2018)
Язык публикации: english
Серия книг: Advances in Intelligent Systems and Computing
Том: 852
Номера страниц: 301-311
Количество страниц: 11
Год публикации: 2019
Отчетный год: 2019
URL: https://link.springer.com/chapter/10.1007/978-3-319-99981-4_28
DOI: 10.1007/978-3-319-99981-4_28
Аннотация: The usage of phonetic similarity in comparison of textual strings and elimination of misprints is one of significant issues in philology. It is widely used in automatic text checking. Nowadays most of phonetic algorithms are designed for English language words processing. The quality of comparison may be decreased for non-English languages especially for languages, which have rich morphology and use non-Latin alphabet symbols, e.g. East Slavic languages with Cyrillic letters. We propose an approach to phonetic comparison of Russian language words. It is based on detection letters and letter sequences that have similar pronunciation according to rules of the language. The resultant phonetic representation of the words are coded by prime numbers. The efficiency of the reviewed algorithm is considered in the paper. The algorithm was adopted for Mongolian language phonetic processing.
Индексируется WOS: Q5
Индексируется Scopus: Нет
Индексируется УБС: Нет
Индексируется РИНЦ: Да
Индексируется ВАК: Нет
Индексируется CORE: Нет