Home
People
Publications
Teaching
Thesis Projects
Reading Group
Talks
Joining
Brian DuSell
Latest
Information Locality as an Inductive Bias for Neural Language Models
Language Models over Canonical Byte-Pair Encodings
The Foundations of Tokenization: Statistical and Computational Concerns
Training Neural Networks as Recognizers of Formal Languages
PILA: A Historical-Linguistic Dataset of Proto-Italic and Latin
Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns
The Foundations of Tokenization: Statistical and Computational Concerns
Algorithms for Weighted Pushdown Automata
Cite
×