Cross-domain Sentiment Encoding through Stochastic Word Embedding

Research output: Contribution to journalArticle

  • External authors:
  • Yanbin Hao
  • Richang Hong
  • Meng Wang
  • Xueliang Liu
  • John Goulermas

Abstract

Sentiment analysis is an important topic concerning identification of feelings, attitudes, emotions and opinions from text. To automate such analysis, a large amount of example text needs to be manually annotated for model training. This is laborious and expensive, but the cross-domain technique is a key solution to reducing the cost by reusing annotated reviews across domains. However, its success largely relies on the learning of a robust common representation space across domains. In the recent years, significant effort has been invested to improve the cross-domain representation learning by designing increasingly more complex and elaborate model inputs and architectures. We support that it is not necessary to increase design complexity as this inevitably consumes more time in model training. Instead, we propose to explore the word polarity and occurrence information through a simple mapping and encode such information more accurately whilst managing lower computational costs. The proposed approach is unique and takes advantage of the stochastic embedding technique to tackle cross-domain sentiment alignment. Its effectiveness is benchmarked with over ten data tasks constructed from two review corpora and it is compared against ten classical and state-of-the-art methods.

Bibliographical metadata

Original languageEnglish
JournalIEEE Transactions on Knowledge and Data Engineering
Publication statusAccepted/In press - 22 Apr 2019