ChainerRL is a deep reinforcement learning library built upon the Chainer deep learning library. Part of my work at Preferred Networks is devoted to developing algorithms and infrastructure for ChainerRL.
A high-quality implementation of Human-level control through deep reinforcement learning. The is a deterministic implementation, aimed at reproducibility. This was built with PyTorch.