2019

  • Swarm-inspired Reinforcement Learning via Collaborative Inter-agent Knowledge Distillation
    Zhang-Wei Hong, Prabhat Nagarajan, Guilherme Maeda
    Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019) December 2019.

  • Learning Latent State Spaces for Planning through Reward Prediction
    Aaron Havens, Yi Ouyang, Prabhat Nagarajan, Yasuhiro Fujita
    Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019) December 2019.

  • ChainerRL: A Deep Reinforcement Learning Library
    Yasuhiro Fujita, Toshiki Kataoka, Prabhat Nagarajan, Takahiro Ishikawa
    Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019) December 2019.

  • Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
    Daniel S. Brown, Wonjoon Goo, Prabhat Nagarajan, Scott Niekum
    International Conference on Machine Learning (ICML) June 2019.

  • Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
    Prabhat Nagarajan, Garrett Warnell, Peter Stone
    AAAI 2019 Workshop on Reproducible AI, Honolulu, Hawaii. January 2019.


2018

  • Nondeterminism as a Reproducibility Challenge for Deep Reinforcement Learning
    Prabhat Nagarajan
    Master's Thesis, The University of Texas at Austin, August 2018
    Committee: Peter Stone (Supervisor), Scott Niekum

  • The Impact of Nondeterminism on Reproducibility in Deep Reinforcement Learning
    Prabhat Nagarajan, Garrett Warnell, Peter Stone
    2nd Reproducibility in Machine Learning Workshop at ICML 2018, Stockholm, Sweden.