Search

Brian DuSell

Information Locality as an Inductive Bias for Neural Language Models
Language Models over Canonical Byte-Pair Encodings
The Foundations of Tokenization: Statistical and Computational Concerns
Training Neural Networks as Recognizers of Formal Languages
On the Proper Treatment of the Word in Computational Psycholinguistics
PILA: A Historical-Linguistic Dataset of Proto-Italic and Latin
Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns
The Foundations of Tokenization: Statistical and Computational Concerns
Algorithms for Weighted Pushdown Automata