ICLR 2023: Is GPT the Wrong Architecture?
Machine Learning · December 21, 2022
Our paper at ICLR 2023, “Bidirectional Language Models Are Also Few-shot Learners” surprisingly discovers that older models like T5, that predate GPT-3, were promptable and could perform in-context learning.