Next Article in Journal
Neuro-Fuzzy Model Evaluation for Enhanced Prediction of Mechanical Properties in AM Specimens
Previous Article in Journal
Fine-Grained Tasks for Crowdsourced Entity Resolution
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

CGM: Copy Mechanism GPT with Mask for Ellipsis and Anaphora Resolution in Dialogue

Department of Computer Engineering, Changwon National University, Changwon 51140, Republic of Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(1), 5; https://doi.org/10.3390/app15010005
Submission received: 25 October 2024 / Revised: 16 December 2024 / Accepted: 20 December 2024 / Published: 24 December 2024

Abstract

GPT (Generative Pre-trained Transformer) is a generative language model that demonstrates outstanding performance in the field of text generation. Generally, the attention mechanism of the transformer model behaves similarly to a copy distribution. However, due to the absence of a dedicated encoder, it is challenging to ensure that the input is retained for generation. We propose a model that emphasizes the copy mechanism in GPT. We generate masks for the input words to initialize the distribution and explicitly encourage copying through training. To demonstrate the effectiveness of our approach, we conducted experiments to restore ellipsis and anaphora in dialogue. In a single domain, we achieved 0.4319 (BLEU), 0.6408 (Rouge-L), 0.9040 (simCSE), and 0.9070 (BERTScore), while in multi-domain settings we obtained 0.4611 (BLEU), 0.6379 (Rouge-L), 0.8902 (simCSE), and 0.8999 (BERTScore). Additionally, we evaluated the operation of the copy mechanism on out-of-domain data, yielding excellent results. We anticipate that applying the copy mechanism to GPT will be useful for utilizing language models in constrained situations.
Keywords: copy mechanism; curriculum learning; pre-trained models copy mechanism; curriculum learning; pre-trained models

Share and Cite

MDPI and ACS Style

Cho, J.-W.; Oh, J.; Cha, J.-W. CGM: Copy Mechanism GPT with Mask for Ellipsis and Anaphora Resolution in Dialogue. Appl. Sci. 2025, 15, 5. https://doi.org/10.3390/app15010005

AMA Style

Cho J-W, Oh J, Cha J-W. CGM: Copy Mechanism GPT with Mask for Ellipsis and Anaphora Resolution in Dialogue. Applied Sciences. 2025; 15(1):5. https://doi.org/10.3390/app15010005

Chicago/Turabian Style

Cho, Ji-Won, Jinyoung Oh, and Jeong-Won Cha. 2025. "CGM: Copy Mechanism GPT with Mask for Ellipsis and Anaphora Resolution in Dialogue" Applied Sciences 15, no. 1: 5. https://doi.org/10.3390/app15010005

APA Style

Cho, J.-W., Oh, J., & Cha, J.-W. (2025). CGM: Copy Mechanism GPT with Mask for Ellipsis and Anaphora Resolution in Dialogue. Applied Sciences, 15(1), 5. https://doi.org/10.3390/app15010005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop