Paper Reading: Overtrained Language Models Are Harder to Fine-Tune
I read through/highlight the paper "Overtrained Language Models Are Harder to Fine-Tune" (by Spring et al) using Zotero while I look things up online and ask Claude project many clarifying questions.
Paper: https://arxiv.org/abs/2503.19206