Stroke-encoding and Pinyin-learning Enhanced Chinese Pre-trained Language Representation Model

ZHOU Tong

Chinese︱Feedback︱Save this page

• Elaborating Academic Views 　　　　 • Exchanging Innovative Ideas
• Protecting Intellectual Properties 　　• Fast Sharing Science Papers

Sponsored by the Center for Science and Technology Development of the Ministry of Education
Supervised by Ministry of Education of the People's Republic of China

Home > Papers

Stroke-encoding and Pinyin-learning Enhanced Chinese Pre-trained Language Representation Model

ZHOU Tong *

School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing, 100876

*Correspondence author

#Submitted by

Subject:

Funding: ***Foundation （No.00000000）, *** Foundation （No.00000000）

Opened online: 3 April 2023

Accepted by: none

Citation: ZHOU Tong.Stroke-encoding and Pinyin-learning Enhanced Chinese Pre-trained Language Representation Model[OL]. [ 3 April 2023] http://en.paper.edu.cn/en_releasepaper/content/4759900

Language models pre-trained on large unlabeled corpora have proven to be very effective in improving many downstream NLP tasks. However, existing language models are primarily designed for English, and less consideration has been given to the more abundant semantic information that Chinese characters imply. The unique semantically related stroke sequence pattern and polyphony enable the enhancement of a Chinese language representation model. Masked language models, such as BERT, are also plagued by inefficient training data utilization, requiring more iterations to complete training. In light of these shortcomings, we propose an improved, customized Chinese pre-trained language model based on the transformer, called SPCLM (Stroke-encoding and Pinyin-learning enhanced Chinese pre-trained Language representation Model). SPCLM contains stroke encoders and an additional pronunciation prediction task. Moreover, the autoregressive objection and mask prediction jointly assist in model formulation. Experimental results demonstrate that SPCLM outperforms other baseline methods, achieving comparable results on five Chinese NLP tasks, with insufficient pre-training, including natural language inference, semantic similarity, named entity recognition, sentiment analysis, and question answering.

Keywords:Software Engineering; Language Model; Multi-Task Learning

For this paper

● PDF (0B)
● Revision 0 　　
● Print this paper
● Recommend this paper to a friend
● Add to my favorite list

Saved Papers

Please enter a name for this paper to be shown in your personalized Saved Papers list

Tags

Add yours

Related Papers

Statistics

PDF Downloaded	7
Bookmarked	1
Recommend	0
Comments	Array

Submit your papers

Alert Name:
Alerting to:
Authentication email will be sent to your email address in 24 hours
Frequency:
Email Message Format:	Plain text Graphical(HTML)

Complete the form below and we will recommend the selected titles to your friends on your behalf. * Indicates a required field.
Your name*:
Your email address*:
Recipient's name*:
Recipient's email address*:
(multiple recipient's names and email addresses should be separated with semicolons)
Your comments:	I thought you would find the page(s) useful.

Your name:
Your email address:
Recipient's name:
Recipient's email address:
(multiple recipient's names and email addresses should be separated with semicolons)
Your comments:	I thought you would find this page useful.

Disclaimer: This message was sent to your friend using the "Send it to a friend" facility on the Sciencepaper Online’ WWW site, http://www.paper.edu.cn/en. The Sciencepaper Online is not responsible for the content of this email, and anything said in this email does not necessarily reflect the Sciencepaper Online's views.

	Check out RSS, or use RSS reader to subscribe this item

Saved Papers