MENU

Fun & Interesting

CMU Advanced NLP Fall 2024 (13): Long Sequence Models

Graham Neubig 630 5 months ago
Video Not Working? Fix It Now

This lecture (by Graham Neubig) for CMU CS 11-711, Advanced NLP (Fall 2024) covers: * Transformer compute/memory complexity * Extrapolation of trained models * Alternative transformer architectures * Non-attentional models * Evaluation of long-context models Class Site: https://phontron.com/class/anlp-fall2024/

Comment