Joint Inference of Reward Machines and Policies for Reinforcement Learning Zhe Xu, Ivan Gavran, Yousef Ahmad, Rupak Majumdar, Daniel Neider, Ufuk Topcu, Bo Wu Published: 2019-09-12 14:00:00 -0400 Venue: ICAPS 2020 View Paper Learning