KOMBO: Korean Character Representations Based on the Combination Rules of Subcharacters

📰 ArXiv cs.AI

arXiv:2604.23948v1 Announce Type: cross Abstract: The Korean writing system, \textit{Hangeul}, has a unique character representation rigidly following the invention principles recorded in \textit{Hunminjeongeum}.\footnote{\textit{Hunminjeongeum} is a book published in 1446 that describes the principles of invention and usage of \textit{Hangeul}, devised by King Sejong \cite{Hunminjeongeum_Guide}.} However, existing pre-trained language models (PLMs) for Korean have overlooked these principles. I

Published 28 Apr 2026
Read full paper → ← Back to Reads