News

Philip Cross seems to have the wrong idea about why Canadians are choosing to forgo travelling to the U.S. or buying American ...
This letter proposes a model-free safe RL algorithm that achieves near-zero constraint violations with high rewards. Our key idea is to jointly learn a policy and a neural barrier certificate under ...