NTP-Based DPO Training for Suppressing Korean-Chinese Code-Switching 


Vol. 15,  No. 1, pp. 54-60, Jan.  2026
10.3745/TKIPS.2026.15.1.54


PDF
  Abstract

Large Language Models (LLMs) have demonstrated outstanding performance across various natural language processing (NLP) tasks. However, when Chinese-pretrained LLMs are applied in Korean environments, unintended code-switching between Korean and Chinese frequently occurs. This not only undermines user trust but also poses critical issues in domains requiring linguistic accuracy, such as translation, education, and official document writing.To address this problem, we propose a Direct Preference Optimization (DPO) training method based on Next Token Prediction (NTP). Specifically, we leverage NTP to detect confusion points—the moments when Chinese tokens first appear in Korean outputs. Using these points, we construct the NTP-CS dataset by pairing responses with code-switching (rejected) against responses without code-switching (chosen), and train the model accordingly. Experimental results show that our proposed NTP-CS approach consistently outperforms both LLM-CS, which induces code-switching via LLM prompts, and LLM-TX, which translates entire sentences. Notably, across all datasets, chosen responses exhibited positive log-likelihood values, while rejected responses showed negative values, forming an ideal probability pattern. This demonstrates that datasets constructed around local confusion points are more effective than simple translation datasets for mitigating code-switching. We expect that this research will improve the consistency of Korean-centric outputs in multilingual settings and help reduce the quality gap between Korean and Chinese responses.

  Statistics


  Cite this article

[IEEE Style]

S. Ko and Y. Shin, "NTP-Based DPO Training for Suppressing Korean-Chinese Code-Switching," The Transactions of the Korea Information Processing Society, vol. 15, no. 1, pp. 54-60, 2026. DOI: 10.3745/TKIPS.2026.15.1.54.

[ACM Style]

Sungmin Ko and Youhyun Shin. 2026. NTP-Based DPO Training for Suppressing Korean-Chinese Code-Switching. The Transactions of the Korea Information Processing Society, 15, 1, (2026), 54-60. DOI: 10.3745/TKIPS.2026.15.1.54.