O‘ZBEK TILIDAGI SHEVALARNI GAP DARAJASIDA ANIQLASHDA MASHINALI O‘QITISH ALGORITMLARINING QIYOSIY TAHLILI

Shahnoza POZILOVA; Madina RAXIMOVA

doi:10.69617/nuuz.v1i1.11.1.9880

Authors

Shahnoza POZILOVA Toshkent axborot texnologiyalari universiteti professori, DSc, Uzbekistan
Madina RAXIMOVA Toshkent axborot texnologiyalari universiteti magistranti, Uzbekistan

Vol. 1 No. 1.11.1 (2025): O'zMU xabarlari

Articles

Downloads

O‘ZBEK TILIDAGI SHEVALARNI GAP DARAJASIDA ANIQLASHDA MASHINALI O‘QITISH ALGORITMLARINING QIYOSIY TAHLILI (Uzbek)

Abstract
How to Cite
Metrics
References
License

This study examines the task of automatic classification of Uzbek language dialects. While resources of Natural Language Processing (NLP) are increasing, the scarcity of dialectological corpora remains one of the primary challenges. In this work, two fundamental approaches were tested on a small-scale, author-collected dataset comprising dialects on TF-IDF + Naive Bayes and BERT(bert-base-multilingual-cased) models. The main conclusion of the research is that the primary obstacle to creating high-accuracy models in Uzbek dialectology is not only the the right algorithm, but rather the absence of a high-quality, comprehensively annotated corpus.

1. M. K., S. A., T. O., & K. M. (2021). UzBERT: A New Uzbek Language Model and Its Application in Sentiment Analysis.

2. Mansurov B.., A. Mansurov. (2021). UzBERT: pretraining a BERT model for Uzbek. Copper City Labs

3. Pedregosa, F. et al. (2011). Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. Article in Journal of Machine Learning Research

4. Wolf, T. et al. (2020). Transformers: State-of-the-Art Natural Language Processing. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Hugging Face, Brooklyn, USA

5. Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics.

6. Reshetov V.V., Sh.Shoabdurahmonov muallifligida yaratilgan “O‘zbek dialektologiyasi” darsligining 60 yilligi munosabati bilan tashkil etilgan. (2022) “O‘zbek shevalari tadqiqotlari: amaliyot, metodologiya va yangicha yondashuv” mavzusidagi II Respublika ilmiy-nazariy konferensiyasi materiallari. Toshkent «Donishmand ziyosi»

7. Abdulla Qahhor. (1936). Anor, O‘g‘ri, Bemor, Daxshat hikoyalari. (adabiy sheva uchun)

8. Nazar Eshonqul. (2004).Maymun yetaklagan odam hikoyasi. (adabiy sheva uchun)

9. Ijtimoiy tarmoqlar: Instagram, Telegram (mahalliy aholining shevasi)

COMPARATIVE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR IDENTIFYING UZBEK DIALECTS AT THE SENTENCE LEVEL

Authors

Downloads

Language

ORCiD

submissions

SidebarMenu

thewur

QSrating

analytics

Analytics

editteam

Meet Our Editorial Team

google

Google scholar

crossref

Crossref DOIs member

orcidd

ORCiD

oac

Higher Attestation Commission of the Republic of Uzbekistan

issn

ISSN National Centre for Uzbekistan

Information

Address:

Contact Info: