ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2019, Vol. 56 ›› Issue (2): 293-305.doi: 10.7544/issn1000-1239.2019.20170752

Previous Articles     Next Articles

Deep Semantic Representation of Time-Sync Comments for Videos

Wu Famin1, Lü Guangyi2, Liu Qi2, He Ming2, Chang Biao2, He Weidong2, Zhong Hui2, Zhang Le2   

  1. 1(School of Software Engineering, University of Science and Technology of China, Hefei 230051); 2(Anhui Province Key Laboratory of Big Data Analysis and Application,School of Computer Science, University of Science and Technology of China, Hefei 230027)
  • Online:2019-02-01

Abstract: With the development of Internet, crowdsourcing short texts such as time-sync comments for videos are of significant importance for online media sharing platforms and leisure industry. It also provides a new research opportunity for the evolution of recommender system, artificial intelligence and so on, which have tremendous values for every walk of life. At the same time, there are many challenges for crowdsourcing short text analysis, because of its high noise, non-standard expressions and latent semantic implication. These have limited the application of traditional natural language processing (NLP) techniques, thus it needs a novel short text understanding method which is of high fault tolerance, and can capture the deep semantics. To this end, this paper proposes a deep semantic representation model based on recurrent neural network (RNN). It can avoid the effect of noise on text segmentation by exploiting the character-based RNN. To achieve the semantic representation, we apply the neural network to represent the latent semantics such that the outputted semantic vectors can deeply reflect the time-sync comments. Then we further design a time-sync comment explanation framework based on semantic retrieval, used for the validation of semantic representation. Finally, we compare them with others baselines, and apply many measures to validate the proposed model. The experimental results show that model can capture the semantics in these short texts more precisely, and the assumptions related to time-sync comments are reasonable.

Key words: time-sync comment for videos, bullet-screen, deep semantic representation, semantic retrieval, character-based recurrent neural network

CLC Number: