Deep Q-Learning 系列论文漫谈(二) 从Target Net和Experience replay聊起

Posted on 2020-01-17 Edited on 2020-06-08 In Deep Q-Learning Views: Valine:

Targe Net、Experience replay作为DQN最基础的改进，它们背后的理论是怎样的？这些措施能够生效是否还有其他的更深层的原因呢？

Deep Q-Learning 系列论文漫谈(一) 从Q-Learning到DQN

Posted on 2020-01-04 Edited on 2020-06-08 In Deep Q-Learning Views: Valine:

一起探索DQN系列论文的秘密，抛砖引玉，没有复杂的公式，只有直观的解释。