
Premium content
Access to this content requires a subscription. You must be a premium user to view this content.

workshop paper
Enhancing Turkish Word Segmentation: A Focus on Borrowed Words and Invalid Morpheme
keywords:
rich morphological languages
morphology segmentation
turkish morphology
morfessor
This study addresses a challenge in Morphology segmentation: accurately segmenting words in languages with rich morphology. There needs to be more consistency between the outcomes of probabilistic approaches like Morfessor and words segmented by humans. Our study adds some steps to the Morfessor segmentation process to consider invalid morphemes and borrowed words from other languages to improve morphological segmentation significantly. Comparing our idea to the results obtained from Morfessor demonstrates its efficiency, leading to more accurate morphology segmentation; in the particular case of Turkish, and opening up new opportunities for progress in morpheme segmentation especially for rich morphologically language.