The effect of model size on worst-group generalization

Posted on
deep-learning generalization

This was a paper we presented about in Irina Rish’s neural scaling laws course (IFT6167) in winter 2022. You can view the slides we used here, and the recording here.