ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2021, Vol. 58 ›› Issue (2): 253-263.doi: 10.7544/issn1000-1239.2021.20200727

Special Issue: 2021数据治理与数据透明专题

Previous Articles     Next Articles

Ethical Behavior Discrimination Based on Social News Dataset

Gu Tianlong1, Feng Xuan1, Li Long1,2, Bao Xuguang1, Li Yunhui1   

  1. 1(Guangxi Key Laboratory of Trusted Software (Guilin University of Electronic Technology), Guilin, Guangxi 541004);2(College of Information Science and Technology/College of Cyber Security, Jinan University, Guangzhou 510632)
  • Online:2021-02-01
  • Supported by: 
    This work was supported by the National Natural Science Foundation of China (U1711263, U1811264, 61966009, 61961007, 61862016, 62006057), the Natural Science Foundation of Guangxi Province (2019GXNSFBA245049, 2019GXNSFBA245059, 2018GXNSFDA281045), and the Science and Technology Base and Talents Program of Guangxi Province (AD19245011).

Abstract: With the broader applications of artificial intelligence (AI), their ethical and moral issues have attracted more and more concerns. How to develop an AI system that complies with human values and ethical norms from the perspective of technology realization, namely, ethical aligned AI design, is one of the important issues that need to be solved urgently. The ethical and moral discrimination based on machine learning is a beneficial exploration in this aspect. Social news data has rich content and knowledge of ethics and morality, which provides the possibility for the training data development of machine learning. Because of this, this paper constructs a social news dataset with ethics and morality of human behavior, which is attached to law and code of conduct dataset for machine learning training and testing. The ethical behavior discrimination model ERNIE-CNN based on enhanced language representation of information entities (ERNIE) and convolutional neural network (CNN), is developed to extract ethical discriminations about behavior by calculating semantic similarity based on the vector representation of words. The experimental results show that the proposed model has better performance than the baseline models.

Key words: social news dataset, ethically aligned design, deep learning, ERNIE, CNN

CLC Number: