Twitter | Pretraživanje | |
Will Dabney
Happy to have worked with on research combining behavioural mutual information and successor features, which has been accepted for oral presentation at ICLR. Favorite part: clean answer to where to get the “features” for successor features.
It has been established that diverse behaviors spanning the controllable subspace of a Markov decision process can be trained by rewarding a policy for being distinguishable from other policies....
Open Review Open Review @openreviewnet
Reply Retweet Označi sa "sviđa mi se" More
Will Dabney 21. pro
Odgovor korisniku/ci @dwf @VladMnih
Work is also with Andre Barreto, , Tom Van de Wiele, and !
Reply Retweet Označi sa "sviđa mi se"