Improving the Performance of Vietnamese?Korean Neural Machine Translation with Contextual Embedding
- Abstract
- With the recent evolution of deep learning, machine translation (MT) models and systems are being steadily improved. However, research on MT in low-resource languages such as Vietnamese and Korean is still very limited. In recent years, a state-of-the-art context-based embedding model introduced by Google, bidirectional encoder representations for transformers (BERT), has begun to appear in the neural MT (NMT) models in different ways to enhance the accuracy of MT systems.
The BERT model for Vietnamese has been developed and significantly improved in natural language processing (NLP) tasks, such as part-of-speech (POS), named-entity recognition, dependency parsing, and natural language inference. Our research experimented with applying the Vietnamese BERT model to provide POS tagging and morphological analysis (MA) for Vietnamese sentences, and applying word-sense disambiguation (WSD) for Korean sentences in our Vietnamese?Korean bilingual corpus. In the Vietnamese?Korean NMT system, with contextual embedding, the BERT model for Vietnamese is concurrently connected to both encoder layers and decoder layers in the NMT model. Experimental results assessed through BLEU, METEOR, and TER metrics show that contextual embedding significantly improves the quality of Vietnamese?Korean NMT.
- Author(s)
- 부 반 하이; 웬 콴 푸억; 빅토리야 에비파테이 투니얀; 옥철영
- Issued Date
- 2021
- Type
- Article
- Keyword
- neural machine translation; part-of-speech; word-sense disambiguation; Korean-Vietnamese machine translation; bidirectional encoder representations from transformers
- DOI
- 10.3390/app112311119
- URI
- https://oak.ulsan.ac.kr/handle/2021.oak/9178
https://ulsan-primo.hosted.exlibrisgroup.com/primo-explore/fulldisplay?docid=TN_cdi_doaj_primary_oai_doaj_org_article_504330e2603c43058f5e15a0cf6531b5&context=PC&vid=ULSAN&lang=ko_KR&search_scope=default_scope&adaptor=primo_central_multiple_fe&tab=default_tab&query=any,contains,Improving%20the%20Performance%20of%20Vietnamese%3FKorean%20Neural%20Machine%20Translation%20with%20Contextual%20Embedding&offset=0&pcAvailability=true
- Publisher
- APPLIED SCIENCES-BASEL
- Location
- 스위스
- Language
- 영어
- ISSN
- 2076-3417
- Citation Volume
- 11
- Citation Number
- 23
- Citation Start Page
- 11119
- Citation End Page
- 11119
-
Appears in Collections:
- Engineering > IT Convergence
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.