Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning (bibtex)
by David Janz, Jiri Hron, Przemysław Mazur, Katja Hofmann, Jose Miguel Hernandez-Lobato, Sebastian Tschiatschek
Reference:
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference LearningDavid Janz, Jiri Hron, Przemysław Mazur, Katja Hofmann, Jose Miguel Hernandez-Lobato, Sebastian Tschiatschek In Neural Information Processing Systems (NeurIPS) 2019.
Bibtex Entry:
@inproceedings{janz2019successor,
  title={Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning},
  author={David Janz and Jiri Hron and Przemysław Mazur and Katja Hofmann and Jose Miguel Hernandez-Lobato and Sebastian Tschiatschek},
  booktitle={Neural Information Processing Systems (NeurIPS)},
  url={https://arxiv.org/pdf/1810.06530.pdf},
  year={2019}
}
Powered by bibtexbrowser