COMPARATIVE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR IDENTIFYING UZBEK DIALECTS AT THE SENTENCE LEVEL
This study examines the task of automatic classification of Uzbek language dialects. While resources of Natural Language Processing (NLP) are increasing, the scarcity of dialectological corpora remains one of the primary challenges. In this work, two fundamental approaches were tested on a small-scale, author-collected dataset comprising dialects on TF-IDF + Naive Bayes and BERT(bert-base-multilingual-cased) models. The main conclusion of the research is that the primary obstacle to creating high-accuracy models in Uzbek dialectology is not only the the right algorithm, but rather the absence of a high-quality, comprehensively annotated corpus.
1. M. K., S. A., T. O., & K. M. (2021). UzBERT: A New Uzbek Language Model and Its Application in Sentiment Analysis.
2. Mansurov B.., A. Mansurov. (2021). UzBERT: pretraining a BERT model for Uzbek. Copper City Labs
3. Pedregosa, F. et al. (2011). Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. Article in Journal of Machine Learning Research
4. Wolf, T. et al. (2020). Transformers: State-of-the-Art Natural Language Processing. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Hugging Face, Brooklyn, USA
5. Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics.
6. Reshetov V.V., Sh.Shoabdurahmonov muallifligida yaratilgan “O‘zbek dialektologiyasi” darsligining 60 yilligi munosabati bilan tashkil etilgan. (2022) “O‘zbek shevalari tadqiqotlari: amaliyot, metodologiya va yangicha yondashuv” mavzusidagi II Respublika ilmiy-nazariy konferensiyasi materiallari. Toshkent «Donishmand ziyosi»
7. Abdulla Qahhor. (1936). Anor, O‘g‘ri, Bemor, Daxshat hikoyalari. (adabiy sheva uchun)
8. Nazar Eshonqul. (2004).Maymun yetaklagan odam hikoyasi. (adabiy sheva uchun)
9. Ijtimoiy tarmoqlar: Instagram, Telegram (mahalliy aholining shevasi)
Copyright (c) 2025 «ACTA NUUz»

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.






.jpg)

1.png)





