TY - GEN
T1 - Label Distribution Representation Learning in Document-Level Sentiment Analysis
AU - Zhu, Wenhao
AU - Yu, Ziyue
AU - Chio, Kakit
AU - Luo, Wuman
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Document-Level Sentiment Analysis (DSA) aims at predicting the user rating for a product based on the user text comments. One major challenge of DSA is the inconsistency between user comments and their ratings. So far, various methods have been proposed to address this challenge, and significant progress has been made. However, existing works ignore that user behaviors may be inconsistent even for the same product. For example, a user may leave similar reviews for the same product at different times, but his/her ratings may be completely different. This will make user labels in training data no longer reliable. To address this issue, we propose a method called Label Distribution Representation Learning (LDRL) for DSA. LDRL assumes the presence of noise in ground truth data and proposes a probability distribution scheme to denote data labels. During the training process, we assign different distribution standard deviations to the users with different reliability levels, thereby reducing the impact of noise. We conduct comprehensive experiments in three real world datasets to evaluate the proposed LDRL. The experimental results show that LDRL outperforms all the baselines in terms of Accuracy and RMSE.
AB - Document-Level Sentiment Analysis (DSA) aims at predicting the user rating for a product based on the user text comments. One major challenge of DSA is the inconsistency between user comments and their ratings. So far, various methods have been proposed to address this challenge, and significant progress has been made. However, existing works ignore that user behaviors may be inconsistent even for the same product. For example, a user may leave similar reviews for the same product at different times, but his/her ratings may be completely different. This will make user labels in training data no longer reliable. To address this issue, we propose a method called Label Distribution Representation Learning (LDRL) for DSA. LDRL assumes the presence of noise in ground truth data and proposes a probability distribution scheme to denote data labels. During the training process, we assign different distribution standard deviations to the users with different reliability levels, thereby reducing the impact of noise. We conduct comprehensive experiments in three real world datasets to evaluate the proposed LDRL. The experimental results show that LDRL outperforms all the baselines in terms of Accuracy and RMSE.
KW - Document-level
KW - Label distribution learning
KW - Sentiment analysis
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=85208447425&partnerID=8YFLogxK
U2 - 10.1109/ICCIA62557.2024.10719196
DO - 10.1109/ICCIA62557.2024.10719196
M3 - Conference contribution
AN - SCOPUS:85208447425
T3 - 2024 IEEE 9th International Conference on Computational Intelligence and Applications, ICCIA 2024
SP - 79
EP - 83
BT - 2024 IEEE 9th International Conference on Computational Intelligence and Applications, ICCIA 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th IEEE International Conference on Computational Intelligence and Applications, ICCIA 2024
Y2 - 9 August 2024 through 11 August 2024
ER -