jason d lee Archives | NEC Labs America

Quantitative Bounds for Length Generalization in Transformers

July 19, 2025/in Publications/by NEC Labs America

We provide quantitative bounds on the length of sequences required to be observed during training for a transformer to length generalize, e.g., to continue to perform well on sequences unseen during training. Our results improve on Huang et al. [8], who show that there is a finite training length beyond which length generalization is guaranteed, but for which they do not provide quantitative bounds.

Posts

Quantitative Bounds for Length Generalization in Transformers

Contact Us

About Us

Our Pages

Read Our Blog Posts

Tag Archive for: jason d lee

Posts

Contact Us

About Us

Our Pages

Read Our Blog Posts