N-GRAM YORDAMIDA TURG‘UN LISONIY BIRLIKLARNI ANIQLASH BOSQICHLARI
##submission.downloads##
Maqolada o‘zbek tilidagi turg‘un lisoniy birliklarni milliy matn korpusi asosida avtomatik aniqlashning ilmiy-metodik asoslari yoritiladi. Tadqiqotda 2–5 so‘zli N-gramlar statistik ko‘rsatkichlar orqali saralanib, lingvistik mezonlar va kontekstual modellar yordamida frazeologik va erkin birliklarga tasniflandi. Taklif etilgan yondashuv 90% aniqlik ko‘rsatkichiga erishdi hamda yuqori bog‘langan birliklarning muhim qismi frazeologik tabiatga egaligini tasdiqladi. Olingan natijalar avtomatik frazeologik lug‘atlar yaratish, korpus lingvistikasi amaliyoti va NLP tizimlarida ko‘p so‘zli birikmalarni qayta ishlash sifatini oshirishga xizmat qiladi.
1. Sag, I. A., Baldwin, T., Bond, F., Copestake, A., & Flickinger, D. Multiword expressions: A pain in the neck for NLP. In Computational Linguistics and Intelligent Text Processing: Third International Conference, 2002, Mexico, February 17–23.
2. https://blog.devgenius.io/ngram-collocation-analysis-for-hate-speech-detection-9de4330e410c
3. Mandravickaite, J., Krilavicius, T., & Man, K. L. A Combined approach for automatic identification of multi-word expressions for Latvian and Lithuanian. IAENG International Journal of Computer Science, 2017, 44(4), 598-606.
4. Manning, C., & Schutze, H. Foundations of Statistical Natural Language Processing. MIT Press, 1999.
5. Jurafsky, D., & Martin, J. Speech and Language Processing. Prentice Hall, 2023.
6. Ramshaw, L., & Marcus, M. “Text Chunking Using Transformation-Based Learning.” ACL Workshop, 1995.
7. Mikolov, T. et al. “Efficient Estimation of Word Representations in Vector Space.” ICLR, 2013.
8. Devlin, J. et al. “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.” ACL, 2019.
9. Rahmatullayev, Sh. O‘zbek tilining frazeologik lug‘ati. Toshkent: O‘qituvchi, 2010.
10. Uznatcorpora.uz – O‘zbek tilining milliy matn korpusi.
11. Kilgarriff, A. “Corpora and Collocations.” International Journal of Corpus Linguistics, 2006.
12. Baldwin, T., & Kim, S. N. “Multiword Expressions.” In: Handbook of NLP, 2010.
Mulkiiyat (c) 2025 «O‘zMU XABARLARI»

Ushbu ish quyidagi litsenziya asosida ruxsatlangan Kreativ Commons Attribution-NonCommercial-ShareAlike 4.0 International litsenziyasi asosida bu ish ruxsatlangan..






.jpg)

.png)





