Talks at AI2 + UW titled “Understanding and Improving Generalization in Transformers”