1. Transformers March 2026

    Attention, long-range dependencies, and next-token prediction in transformers.