TY - JOUR
T1 - Multi task learning with general vector space for cross-lingual semantic relation detection
AU - Sholikah, Rizka W.
AU - Arifin, Agus Z.
AU - Fatichah, Chastine
AU - Purwarianti, Ayu
N1 - Publisher Copyright:
© 2022
PY - 2022/5
Y1 - 2022/5
N2 - Semantic relation detection has an important role in natural language processing. In a supervised approach, the training process requires a sufficient amount of labeled data. However, in low-resource languages, labeled data are limited, whereas in rich-resource languages, labeled data are available in large quantities. In addition, various studies tend to model the single-task problem without considering the generalization with other tasks. Hence, a strategy that can utilize the availability of labeled data in rich-resource languages and generalize models to improve the identification of relations in a cross-lingual manner is needed. In this paper, we propose a framework to identify cross-lingual semantic relation using multi-task learning with a general vector space. The proposed method was designed to construct a general vector space and semantic relation identification. The experiments were conducted over three datasets: Indonesian–Arabic, English–Arabic, and English–Indonesia. The results show that the use of multi-task learning with a general vector space can overcome the problem of cross-lingual semantic relation identification. This is shown by the accuracy of the synonym and hypernym tasks that reached 84.9% and 84.8%, respectively.
AB - Semantic relation detection has an important role in natural language processing. In a supervised approach, the training process requires a sufficient amount of labeled data. However, in low-resource languages, labeled data are limited, whereas in rich-resource languages, labeled data are available in large quantities. In addition, various studies tend to model the single-task problem without considering the generalization with other tasks. Hence, a strategy that can utilize the availability of labeled data in rich-resource languages and generalize models to improve the identification of relations in a cross-lingual manner is needed. In this paper, we propose a framework to identify cross-lingual semantic relation using multi-task learning with a general vector space. The proposed method was designed to construct a general vector space and semantic relation identification. The experiments were conducted over three datasets: Indonesian–Arabic, English–Arabic, and English–Indonesia. The results show that the use of multi-task learning with a general vector space can overcome the problem of cross-lingual semantic relation identification. This is shown by the accuracy of the synonym and hypernym tasks that reached 84.9% and 84.8%, respectively.
KW - Cross-lingual semantic relation
KW - General vector space
KW - Hypernym
KW - Multi-task learning
KW - Synonym
UR - http://www.scopus.com/inward/record.url?scp=85089579838&partnerID=8YFLogxK
U2 - 10.1016/j.jksuci.2020.08.002
DO - 10.1016/j.jksuci.2020.08.002
M3 - Article
AN - SCOPUS:85089579838
SN - 1319-1578
VL - 34
SP - 2161
EP - 2169
JO - Journal of King Saud University - Computer and Information Sciences
JF - Journal of King Saud University - Computer and Information Sciences
IS - 5
ER -