ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2020, Vol. 57 ›› Issue (3): 474-486.doi: 10.7544/issn1000-1239.2020.20190625

Special Issue: 2020面向服务的群智化生态化软件开发方法专题

Previous Articles     Next Articles

Status Prediction for Questions Post on Technical Forums

Shen Mingzhu, Liu Hui   

  1. (School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081)
  • Online:2020-03-01
  • Supported by: 
    This work was supported by the Major Program of the National Natural Science Foundation of China (61690205).

Abstract: When encountered by technical problems, developers often post questions on technical forums such as Stack Overflow, and wait for satisfying answers. QA forums are also an important manifestation of Internet-based group intelligence software development. However, the questions posted in the forums may not get satisfying answers. Therefore, asking problems and passively waiting for solution is not always the best strategy. To this end, we propose a deep neural network based approach to automatically predict whether the questions can obtain satisfying answers. Knowing whether the questions can be effectively answered in advance, developers figure out the best strategy for their technical problems in advance. This approach not only takes full usage of the text information of the problems itself, but also exploits the relevant content of the inquirer of the questions. With the latest deep learning technologies, it fully exploits the intrinsic relationship between the input features and the questions’ solving status. Experimental results on the dataset provided by Stack Overflow suggest that the proposed approach can accurately predict the solving status of the questions. The precision of predicting well-answered problems is 58.87%, and the recall is 46.68% (in contrast, random guess results in a precision of 38.77%, and recall of 35.26%), better than KNN and FastText.

Key words: group intelligence software, QA forum, status prediction, deep learning, text classification

CLC Number: