SYNTACTIC PARSING IN THE UZBEK LANGUAGE: PROBLEMS AND PROPOSALS

Parsing, dependency parsing, treebanks, syntactic tag, corpus.

Authors

In natural language processing (NLP), parsing and treebank methods deal with the syntactic analysis of sentences, determining syntactic relationships between units in sentences, and identifying sentence structures. In the Uzbek language morphological analyzer, 35,000 sentences of varying lengths have been POS tagged. The next stage involves developing a syntactic tagging system for the Uzbek language and proposing appropriate tags. This article examines the issue of syntactic parsing in the Uzbek language, models for identifying sentence components, challenges in dependency parsing of Uzbek texts, and the matter of syntactic tags. The topic of syntactic parsing and treebanks in the Uzbek language is analyzed in comparison with other agglutinative languages.