Chenhao Tan University of Chicago @ChenhaoTan, @chenhaotan.bsky.social chenhao@uchicago.edu
Decoder-Only Transformers
Vaswani et al. (2017)
RoFormer: Enhanced Transformer with Rotary Position Embedding by Su et al. (2021)
Shazeer (2020)
He et al. (2016)
Zhang & Sennrich (2019)