Unsolved problems in ML safety

Posted on
deep-learning ethics
thumbnail

This was a paper we presented about in Irina Rish’s neural scaling laws course (IFT6760A) in winter 2023. You can view the slides we used here, and the recording here (or my backup here).