It's not just size that matters: small language models are also few-shot learners

Posted on
deep-learning nlp neural-scaling

We presented this paper as a mini-lecture in Bang Liu’s IFT6289 course in winter 2022. You can view the slides we used here.