How undesired goals can arise with correct rewards

Images altered to trick machine vision can influence humans too
Research Published 7 October 2022 Authors Rohin Shah, Victoria Krakovna, Vikrant Varma, Zachary Kenton Exploring examples of goal misgeneralisation – ...
Read more