This lecture (by Graham Neubig) for CMU CS 11-711, Advanced NLP (Fall 2024) covers: * Transformer compute/memory complexity * Extrapolation of trained models * Alternative transformer architectures * Non-attentional models * Evaluation of long-context models Class Site: https://phontron.com/class/anlp-fall2024/