引用本文: | 古天龙, 高慧, 李龙, 包旭光, 李云辉. 基于强化学习的伦理智能体训练方法[J]. 计算机研究与发展, 2022, 59(9): 2039-2050. doi: 10.7544/issn1000-1239.20210474 |
Citation: | Gu Tianlong, Gao Hui, Li Long, Bao Xuguang, Li Yunhui. An Approach for Training Moral Agents via Reinforcement Learning[J]. Journal of Computer Research and Development, 2022, 59(9): 2039-2050. doi: 10.7544/issn1000-1239.20210474 |