Sang J, Wang Y, Khan Z A, et al. Reward Shaping Based on Optimal-Policy-Free[J]. IEEE Transactions on Big Data, 2024, 11(4): 1787-1798.(SCI中科院分区二区, CCF C)
[3]Sang J, Wang Y. Graph convolution with topology refinement for Automatic Reinforcement Learning[J]. Neurocomputing, 2023, 554: 126621. (SCI中科院分区二区, CCF C)
[4]Sang J, Wang Y, Yuan L, et al. Multi-label transfer learning via latent graph alignment[J]. World Wide Web, 2022, 25(2): 879-898. (SCI中科院分区三区, CCF B)
[5]Sang J, KHAN Z, Yin H, et al. Reward Shaping Using Directed Graph Convolution Neural Networks for Reinforcement Learning and Game[J]. Frontiers in Physics, 11: 1310467. (SCI中科院分区三区)
[6]Huang A, Wang Y, Sang J, et al. DVF: Multi-agent Q-learning with difference value factorization[J]. Knowledge-Based Systems, 2024, 286: 111422.(SCI中科院分区一区, CCF C)
[7]桑江徽,姜海燕.基于联合分布的多标记迁移学习[J].计算机工程与应用,2021,57(09):154-161.(中文核心期刊)