Probability Distributions Computed by Hard-Attention Transformers
Andy Yang, Anej Svete, Jiaoda Li, Anthony Widjaja Lin, Jonathan Rawski, Ryan Cotterell, David Chiang
January 2025
Add the full text or supplementary notes for the publication here using Markdown formatting.