Unsolved problems in ML safetyPosted on 2023-02-06 at 11:39:33 UTC-0500This was a paper we presented about in Irina Rish’s neural scaling laws course (IFT6760A) in winter 2023. You can view the slides we used here, and the recording here (or my backup here).