May 30, 2025
2025
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models. UCLA NLP Seminar. Video 🎥