Talk at MIT BCS titled “Transformers, Tree Structures and Generalization”