一种权重平均值的深度双Q网络方法
- 吴金金1,
- 刘全1,2,3,4,
- 陈松1,
- 闫岩1
-
1(苏州大学计算机科学与技术学院 江苏苏州 215006)
-
2(符号计算与知识工程教育部重点实验室(吉林大学) 长春 130012)
-
3(江苏省计算机信息处理技术重点实验室(苏州大学) 江苏苏州 215006)
-
4(软件新技术与产业化协同创新中心(南京大学) 南京 210023) (20174227020@stu.suda.edu.cn)
基金项目: 国家自然科学基金项目(61772355,61702055,61502323,61502329);江苏省高等学校自然科学研究重大项目(18KJA520011,17KJA520004);吉林大学符号计算与知识工程教育部重点实验室项目(93K172014K04,93K172017K18);苏州市应用基础研究计划工业项目(SYG201422);江苏高校优势学科建设工程资助项目
详细信息
-
中图分类号: TP183
-
-
计量
-
文章访问数:
982
-
HTML全文浏览量:
0
-
PDF下载量:
265
Averaged Weighted Double Deep Q-Network
-
1(School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006)
-
2(Key Laboratory of Symbolic Computation and Knowledge Engineering (Jilin University), Ministry of Education, Changchun 130012)
-
3(Jiangsu Key Laboratory of Computer Information Processing Technology (Soochow University), Suzhou, Jiangsu 215006)
-
4(Collaborative Innovation Center of Novel Software Technology and Industrialization (Nanjing University), Nanjing 210023)
Funds: This work was supported by the National Natural Science Foundation of China (61772355, 61702055, 61502323, 61502329), the Jiangsu Provincial Natural Science Research University Major Projects (18KJA520011, 17KJA520004), the Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education (Jilin University) (93K172014K04, 93K172017K18), the Suzhou Industrial Application of Basic Research Program (SYG201422), and the Priority Academic Program Development of Jiangsu Higher Education Institutions.