Publications | Rycolab

Ivan Baburin, Ryan Cotterell (2025). A Close Analysis of the Subset Construction. Conference on Foundations of Software Technology and Theoretical Computer Science.

URL

Ionut Constantinescu, Tiago Pimentel, Ryan Cotterell, Alex Warstadt (2025). Investigating Critical Period Effects in Language Acquisition through Neural Language Models. Transactions of the Association for Computational Linguistics.

URL

Afra Amini, Tim Vieira, Elliott Ash, Ryan Cotterell (2025). Variational Best-of-$N$ Alignment.

URL

Selim Jerad, Anej Svete, Jiaoda Li, Ryan Cotterell (2025). Unique Hard Attention: A Tale of Two Sides.

Alexandra Butoi, Ghazal Khalighinejad, Anej Svete, Josef Valvoda, Ryan Cotterell, Brian DuSell (2025). Training Neural Networks as Recognizers of Formal Languages.

URL

Eleftheria Tsipidi, Samuel Kiegeland, Franz Nowak, Tianyang Xu, Ethan Wilcox, Alex Warstadt, Ryan Cotterell, Mario Giulianelli (2025). The Harmonic Structure of Information Contours.

Juan Luis Gastaldi, John Terilla, Luca Malagutti, Brian DuSell, Tim Vieira, Ryan Cotterell (2025). The Foundations of Tokenization: Statistical and Computational Concerns.

Vicky Xefteri, Afra Amini, Tim Vieira, Ryan Cotterell (2025). Syntactic Control of Language Models by Posterior Inference.

João Loula, Benjamin LeBrun, Li Du, Ben Lipkin, Clemente Pasti, Gabriel Grand, Tianyu Liu, Yahya Emara, Marjorie Freedman, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Alexander K. Lew, Tim Vieira, Timothy J. O'Donnell (2025). Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo.

Tianyu Liu*, Jirui Qi*, Paul He, Arianna Bisazza, Mrinmaya Sachan, Ryan Cotterell (2025). Pointwise Mutual Information as a Performance Gauge for Retrieval-Augmented Generation.

Laura Manduchi, Kushagra Pandey, Robert Bamler, Ryan Cotterell, Sina Däubener, Sophie Fellenz, Asja Fischer, Thomas Gärtner, Matthias Kirchler, Marius Kloft, Yingzhen Li, Christoph Lippert, Gerard de Melo, Eric Nalisnick, Björn Ommer, Rajesh Ranganath, Maja Rudolph, Karen Ullrich, Guy Van den Broeck, Julia E Vogt, Yixin Wang, Florian Wenzel, Frank Wood, Stephan Mandt, Vincent Fortuin (2025). On the challenges and opportunities in generative AI. arXiv.

Tim Vieira, Tianyu Liu, Clemente Pasti, Yahya Emara, Brian DuSell, Benjamin LeBrun, Mario Giulianelli, Juan Luis Gastaldi, John Terilla, Timothy J. O'Donnell, Ryan Cotterell (2025). Language Models over Canonical Byte-Pair Encodings.

Taiga Someya, Anej Svete, Brian DuSell, Timothy J. O'Donnell, Mario Giulianelli, Ryan Cotterell (2025). Information Locality as an Inductive Bias for Neural Language Models.

Mario Giulianelli, Sarenne Wallbridge, Ryan Cotterell, Raquel Fernández (2025). Incremental Alternative Sampling as a Lens into the Temporal and Representational Resolution of Linguistic Prediction. PsyArXiv.

URL

Shauli Ravfogel, Anej Svete, Vésteinn Snæbjarnarson, Ryan Cotterell (2025). Gumbel Counterfactual Generation from Language Models.

Julian Minder*, Kevin Du*, Niklas Stoehr, Giovanni Monea, Chris Wendler, Robert West, Ryan Cotterell (2025). Controllable Context Sensitivity and the Knob Behind It.

Tianyang Xu, Tatsuki Kuribayashi, Yohei Oseki, Ryan Cotterell, Alex Warstadt (2025). Can Language Models Learn Typologically Implausible Languages?.

URL

Ethan Wilcox, Michael Y. Hu, Aaron Mueller, Alex Warstadt, Leshen Choshen, Chengxu Zhuang, Adina Williams, Ryan Cotterell, Tal Linzen (2025). Bigger is not always better: The importance of human-scale language modeling for psycholinguistics.

Francesco Ignazio Re, Andreas Opedal, Glib Manaiev, Mario Giulianelli, Ryan Cotterell (2025). A Spatio-Temporal Point Process for Fine-Grained Modeling of Reading Behavior.

Matan Avitan, Ryan Cotterell, Yoav Goldberg, Shauli Ravfogel (2025). A Practical Method for Generating String Counterfactuals.

Filippo Ficarra, Ryan Cotterell, Alex Warstadt (2025). A Distributional Perspective on Word Learning in Neural Language Models.

Anej Svete, Robin Shing Moon Chan, Ryan Cotterell (2024). On Efficiently Representing Regular Languages as RNNs. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Andreas Opedal, Alessandro Stolfo, Haruki Shirakami, Ying Jiao, Ryan Cotterell, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan (2024). Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?. Proceedings of the 41st International Conference on Machine Learning.

URL

Anej Svete, Ryan Cotterell (2024). Transformers Can Represent n-gram Language Models. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers).

URL

Brian DuSell, David Chiang (2024). Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns. The Twelfth International Conference on Learning Representations.

URL

Stephen Bothwell, Brian DuSell, David Chiang, Brian Krostenko (2024). PILA: A Historical-Linguistic Dataset of Proto-Italic and Latin. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024).

URL

Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak, Isabelle Augenstein, Eleanor Chodroff, Ryan Cotterell (2024). What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Jiaoda Li, Yifan Hou, Mrinmaya Sachan, Ryan Cotterell (2024). What Do Language Models Learn in Context? The Structured Task Hypothesis.. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Josef Valvoda, Ryan Cotterell (2024). Towards Explainability in Legal Outcome Prediction Models. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers).

URL

Juan Luis Gastaldi, John Terilla, Luca Malagutti, Brian DuSell, Tim Vieira, Ryan Cotterell (2024). The Foundations of Tokenization: Statistical and Computational Concerns.

Franz Nowak$^*$, Anej Svete$^*$, Alexandra Butoi, Ryan Cotterell (2024). On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Anej Svete$^*$, Franz Nowak$^*$, Anisha Mohamed Sahabdeen, Ryan Cotterell (2024). Lower Bounds on the Expressivity of Recurrent Neural Language Models. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers).

URL

Kevin Du, Vésteinn Snæbjarnarson, Niklas Stoehr, Jennifer C. White, Aaron Schein, Ryan Cotterell (2024). Context versus Prior Knowledge in Language Models. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Jiaoda Li, Jennifer C. White, Mrinmaya Sachan, Ryan Cotterell (2024). A Transformer with Stack Attention. Findings of the Association for Computational Linguistics: NAACL 2024.

URL

Josef Valvoda, Alec Thompson, Ryan Cotterell, Simone Teufel (2023). The Ethics of Automating Legal Actors. Association for Computational Linguistics.

PDF

Afra Amini, Li Du, Ryan Cotterell (2023). Structured Voronoi Sampling.

URL

Tiago Pimentel, Clara Meister, Ethan Wilcox, Kyle Mahowald, Ryan Cotterell (2023). Revisiting the Optimality of Word Lengths. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

URL

Anej Svete, Ryan Cotterell (2023). Recurrent Neural Language Models as Probabilistic Finite-state Automata. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

URL

Lukas Wolf, Tiago Pimentel, Evelina Fedorenko, Ryan Cotterell, Alex Warstadt, Ethan Wilcox, Tamar Regev (2023). Quantifying the redundancy between prosody and text. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

PDF

Anej Svete, Ryan Cotterell (2023). On the Representational Capacity of Recurrent Neural Language Models. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

URL

Tiago Pimentel, Clara Meister, Ethan Wilcox, Kyle Mahowald, Ryan Cotterell (2023). On the Optimality of Word Lengths. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

Tianyu Liu, Afra Amini, Mrinmaya Sachan, Ryan Cotterell (2023). Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

PDF

Ethan Wilcox, Clara Meister, Ryan Cotterell, Tiago Pimentel (2023). Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, Ryan Cotterell (2023). Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora. Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning.

Alexandra Butoi, Tim Vieira, Ryan Cotterell, David Chiang (2023). Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

URL

Andreas Opedal, Eleftheria Tsipidi, Tiago Pimentel, Ryan Cotterell, Tim Vieira (2023). An Exploration of Left-Corner Transformations. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

URL

Vilém Zouhar, Clara Meister, Juan Gastaldi, Li Du, Mrinmaya Sachan, Ryan Cotterell (2023). Tokenization and the Noiseless Channel. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Ethan G. Wilcox, Tiago Pimentel, Clara Meister, Ryan Cotterell (2023). Testing the Predictions of Surprisal Theory in 11 Languages. Transactions of the Association for Computational Linguistics.

URL

Clara Meister, Tiago Pimentel, Luca Malagutti, Ryan Cotterell (2023). On the Efficacy of Sampling Adapters. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Tiago Pimentel, Clara Meister, Ethan G. Wilcox, Roger Levy, Ryan Cotterell (2023). On the Effect of Anticipation on Reading Times. Transactions of the Association for Computational Linguistics.

URL

Afra Amini, Tiago Pimentel, Clara Meister, Ryan Cotterell (2023). Naturalistic Causal Probing for Morpho-Syntax. Transactions of the Association for Computational Linguistics.

URL

Shauli Ravfogel, Yoav Goldberg, Ryan Cotterell (2023). Log-Linear Guardedness and Its Implications. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Clara Meister, Tiago Pimentel, Gian Wiher, Ryan Cotterell (2023). Locally Typical Sampling. Transactions of the Association for Computational Linguistics.

URL

Afra Amini$^*$, Tianyu Liu$^*$, Ryan Cotterell (2023). Hexatagging: Projective Dependency Parsing as Tagging. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).

URL

Kevin Du, Lucas Torroba Hennigen, Niklas Stoehr, Alex Warstadt, Ryan Cotterell (2023). Generalizing Backpropagation for Gradient-Based Interpretability. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Kevin Du, Lucas Torroba Hennigen, Niklas Stoehr, Alex Warstadt, Ryan Cotterell (2023). Generalizing Backpropagation for Gradient-Based Interpretability. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Andreas Opedal, Ran Zmigrod, Tim Vieira, Ryan Cotterell, Jason Eisner (2023). Efficient Semiring-Weighted Earley Parsing. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Andreas Opedal, Ran Zmigrod, Tim Vieira, Ryan Cotterell, Jason Eisner (2023). Efficient Semiring-Weighted Earley Parsing. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Ryan Cotterell, Mrinmaya Sachan (2023). Discourse-Centric Evaluation of Document-level Machine Translation with a New Densely Annotated Parallel Corpus of Novels. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Ryan Cotterell, Mrinmaya Sachan (2023). Discourse-Centric Evaluation of Document-level Machine Translation with a New Densely Annotated Parallel Corpus of Novels. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Alexandra Butoi, Ryan Cotterell, David Chiang (2023). Convergence and Diversity in the Control Hierarchy. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Niklas Stoehr, Lucas Torroba Hennigen, Josef Valvoda, Robert West, Ryan Cotterell, Aaron Schein (2023). An Ordinal Latent Variable Model of Conflict Intensity. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

PDF

Li Du, Lucas Torroba Hennigen, Tiago Pimentel, Clara Meister, Jason Eisner, Ryan Cotterell (2023). A Measure-theoretic Characterization of Tight Language Model. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

URL

Vilém Zouhar, Clara Meister, Juan Gastaldi, Li Du, Tim Vieira, Mrinmaya Sachan, Ryan Cotterell (2023). A Formal Perspective on Byte-Pair Encoding. Findings of the Association for Computational Linguistics: ACL 2023.

URL

Franz Nowak, Ryan Cotterell (2023). A Fast Algorithm for Computing Prefix Probabilities. Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).

URL

Niklas Stoehr, Ryan Cotterell, Aaron Schein (2023). Sentiment as an Ordinal Latent Variable. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics.

URL

Tiago Pimentel$^*$, Clara Meister$^*$, Ryan Cotterell (2023). On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation. Proceedings of the 11th International Conference on Learning Representations.

URL

Clemente Pasti, Andreas Opedal, Tiago Pimentel, Tim Vieira, Jason Eisner, Ryan Cotterell (2023). On the Intersection of Context-Free and Regular Languages. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics.

URL

Niklas Stoehr, Benjamin J. Radford, Ryan Cotterell, Aaron Schein (2023). The Ordered Matrix Dirichlet for State-Space Models. Proceedings of the 26th International Conference on Artificial Intelligence and Statistics.

URL

Karolina Stańczak, Sagnik Ray Choudhury, Tiago Pimentel, Ryan Cotterell, Isabelle Augenstein (2023). Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models.

URL

Wangchunshu Zhou, Yuchen Jiang, Ethan Wilcox, Ryan Cotterell, Mrinmaya Sachan (2023). Controlled Text Generation with Natural Language Instructions. Proceedings of the 39th International Conference on Machine Learning.

URL

Karolina Stańczak, Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell, Isabelle Augenstein (2023). A Latent-Variable Model for Intrinsic Probing. Proceedings of the 37th AAAI Conference on Artificial Intelligence.

URL

Thomas Clark, Clara Meister, Tiago Pimentel, Michael Hahn, Richard Futrell, Ryan Cotterell, Roger Levy (2023). A Cross-Linguistic Pressure for Uniform Information Density in Word Order. Transactions of the Association for Computational Linguistics.

URL

Tiago Pimentel*, Josef Valvoda*, Niklas Stoehr, Ryan Cotterell (2022). The Architectural Bottleneck Principle. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.

PDF URL

Afra Amini, Ryan Cotterell (2022). On Parsing as Tagging. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.

PDF URL

Liam van der Poel, Ryan Cotterell, Clara Meister (2022). Mutual Information and Hallucinations in Abstractive Summarization. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.

PDF URL

Shauli Ravfogel, Francisco Vargas, Yoav Goldberg, Ryan Cotterell (2022). Kernelized Concept Erasure. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.

PDF URL

Tianyu Liu, Yuchen Jiang, Nicholas Monath, Ryan Cotterell, Mrinmaya Sachan (2022). Autoregressive Structure Prediction with Language Models. Findings of the Association for Computational Linguistics: EMNL 2022.

PDF URL

Alexandra Butoi, Brian DuSell, Tim Vieira, Ryan Cotterell, David Chiang (2022). Algorithms for Weighted Pushdown Automata. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.

PDF URL

Anej Svete, Benjamin Dayan, Tim Vieira, Ryan Cotterell, Jason Eisner (2022). Algorithms for Weighted Finite-State Automata with Failure Arcs. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.

URL

Jennifer White, Ryan Cotterell (2022). Equivariant Transduction through Invariant Alignment. Proceedings of the 29th International Conference on Computational Linguistics.

URL

Josef Valvoda, Naomi Saphra, Jon Rawski, Adina Williams, Ryan Cotterell (2022). Benchmarking Compositionality with Formal Languages. Proceedings of the 29th International Conference on Computational Linguistics.

PDF URL

Johann-Mattis List, Ekaterina Vylomova, Robert Forkel, Nathan Hill, Ryan Cotterell (2022). The SIGTYP 2022 Shared Task on the Prediction of Cognate Reflexes. Proceedings of the 4th Workshop on Research in Computational Linguistic Typology and Multilingual NLP.

Khuyagbaatar Batsuren, Gábor Bella, Aryaman Arora, Viktor Martinovic, Kyle Gorman, Zdeněk Žabokrtský, Amarsanaa Ganbold, Šárka Dohnalová, Magda Ševčíková, Kateřina Pelegrinová, Fausto Giunchiglia, Ryan Cotterell, Ekaterina Vylomova (2022). The SIGMORPHON 2022 Shared Task on Morpheme Segmentation. Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

Jordan Kodner, Salam Khalifa, Khuyagbaatar Batsuren, Hossep Dolatian, Ryan Cotterell, Faruk Akkus, Antonios Anastasopoulos, Taras Andrushko, Aryaman Arora, Nona Atanalov, Gábor Bella, Elena Budianskaya, Yustinus Ghanggo Ate, Omer Goldman, David Guriel, Simon Guriel, Silvia Guriel-Agiashvili, Witold Kieraś, Andrew Krizhanovsky, Natalia Krizhanovsky, Igor Marchenko, Magdalena Markowska, Polina Mashkovtseva, Maria Nepomniashchaya, Daria Rodionova, Karina Scheifer, Alexandra Sorova, Anastasia Yemelina, Jeremiah Young, Ekaterina Vylomova (2022). SIGMORPHON--UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection. Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

Karolina Stańczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein (2022). Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF URL

Jiaoda Li, Ryan Cotterell, Mrinmaya Sachan (2022). Probing via Prompting. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF URL

Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams (2022). On the Machine Learning of Ethical Judgments from Natural Language. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF URL

Shauli Ravfogel, Michael Twiton, Yoav Goldberg, Ryan Cotterell (2022). Linear Adversarial Concept Erasure. Proceedings of the 39th International Conference on Machine Learning.

PDF URL

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2022). Exact Paired-Permutation Testing for Structured Test Statistics. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF URL

Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Jian Yang, Haoyang Huang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan, Ming Zhou (2022). BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF URL

Tianyu Liu, Yuchen Eleanor Jiang, Ryan Cotterell, Mrinmaya Sachan (2022). A Structured Span Selector. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF URL

Karim Lasri, Tiago Pimentel, Alessandro Lenci, Thierry Poibeau, Ryan Cotterell (2022). Probing for the Usage of Grammatical Number. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

PDF URL

Alexander Immer, Lucas Torroba Hennigen, Vincent Fortuin, Ryan Cotterell (2022). Probing as Quantifying the Inductive Bias of Pre-trained Representations. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

PDF URL

On the probability-quality paradox in language generation (2022). On the probability-quality paradox in language generation. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

PDF URL

Aryaman Arora, Clara Meister, Ryan Cotterell (2022). Estimating the Entropy of Linguistic Distributions. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

PDF URL

Clara Meister, Tiago Pimentel, Thomas Hikaru Clark, Ryan Cotterell, Roger P. Levy (2022). Analyzing Wrap-Up Effects through an Information-Theoretic Lens. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

PDF URL

Rita Sevastjanova, Eren Cakmak, Shauli Ravfogel, Ryan Cotterell, Mennatallah El-Assady (2022). Visual Comparison of Language Model Adaptation. IEEE Visualization.

Dieuwke Hupkes, Mario Giulianelli, Verna Dankers, Mikel Artetxe, Yanai Elazar, Tiago Pimentel, Christos Christodoulopoulos, Karim Lasri, Naomi Saphra, Arabella Sinclair, Dennis Ulmer, Florian Schottmann, Khuyagbaatar Batsuren, Kaiser Sun, Koustuv Sinha, Leila Khalatbari, Maria Ryskina, Rita Frieske, Ryan Cotterell, Zhijing Jin (2022). State-of-the-art generalisation research in NLP: a taxonomy and review. arXiv.

URL

Clemente Pasti, Andreas Opedal, Tiago Pimentel, Tim Vieira, Jason Eisner, Ryan Cotterell (2022). On the Intersection of Context-Free and Regular Languages. arXiv.

PDF URL

Gian Wiher, Clara Meister, Ryan Cotterell (2022). On Decoding Strategies for Neural Text Generators. Transactions of the Association for Computational Linguistics.

PDF URL

Tiago Pimentel, Clara Meister, Ryan Cotterell (2022). Cluster-based Evaluation of Automatically Generated Text. arXiv.

PDF URL

Niklas Stoehr, Lucas Torroba Hennigen, Josef Valvoda, Robert West, Ryan Cotterell, Aaron Schein (2022). An Ordinal Latent Variable Model of Conflict Intensity. arXiv.

PDF URL

Niklas Stoehr, Lucas Torroba Hennigen, Samin Ahbab, Robert West, Ryan Cotterell (2021). Text or Topology? Classifying Ally-Enemy Pairs in Militarised Conflict. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

Tim Vieira, Ryan Cotterell, Jason Eisner (2021). Searching for More Efficient Dynamic Programs. Findings of EMNLP.

PDF URL

Tim Vieira, Ryan Cotterell, Jason Eisner (2021). Searching for More Efficient Dynamic Programs. Findings of the Association for Computational Linguistics: EMNLP 2021.

Clara Meister, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell, Roger Levy (2021). Revisiting the Uniform Information Density Hypothesis. EMNLP.

PDF Code URL

Clara Meister, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell, Roger Levy (2021). Revisiting the Uniform Information Density Hypothesis. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell (2021). Phone-level Uniform Information Density across and within Languages. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

Tiago Pimentel, Clara Meister, Simone Teufel, Ryan Cotterell (2021). On Homophony and Rényi Entropy. EMNLP.

PDF Code URL

Tiago Pimentel, Clara Meister, Simone Teufel, Ryan Cotterell (2021). On Homophony and Rényi Entropy. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

Damian Pascual, Beni Egressy, Clara Meister, Ryan Cotterell, Roger Wattenhofer (2021). Keyword2Text: A Plug-and-Play Method for Controlled Text Generation. Findings of the Association for Computational Linguistics: EMNLP 2021.

Jennifer C. White, Ryan Cotterell (2021). Equivariant Transduction through Invariant Alignment. Findings of the Association for Computational Linguistics: EMNLP 2021.

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2021). Efficient Sampling of Dependency Structure. EMNLP.

PDF Code URL

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2021). Efficient Sampling of Dependency Structure. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

Clara Meister, Afra Amini, Tim Vieira, Ryan Cotterell (2021). Conditional Poisson Stochastic Beams. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

Clara Meister, Afra Amini, Tim Vieira, Ryan Cotterell (2021). Conditional Poisson Stochastic Beam Search. EMNLP.

PDF Code URL

Niklas Stoehr, Lucas Torroba Hennigen, Samin Ahbab, Robert West, Ryan Cotterell (2021). Classifying Dyads for Militarized Conflict Analysis. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

URL

Niklas Stoehr, Josef Valvoda, Lucas Torroba Hennigen, Giuseppe Russo, Robert West, Ryan Cotterell (2021). Adjusting the Conflict-Cooperation Scale for Armed Conflict Assessment. Findings of the Association for Computational Linguistics: EMNLP 2021.

Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell (2021). A surprisal--duration trade-off across and within the world’s languages. EMNLP.

PDF Code URL

Damian Pascual, Beni Egressy, Clara Meister, Ryan Cotterell, Roger Wattenhofer (2021). A Plug-and-Play Method for Controlled Text Generation. Findings of EMNLP.

PDF Code URL

Tiago Pimentel, Ryan Cotterell (2021). A Bayesian Framework for Information-Theoretic Probing. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.

Tiago Pimentel, Ryan Cotterell (2021). A Bayesian Framework for Information-Theoretic Probing. EMNLP.

PDF Code URL

Tiago Pimentel, Maria Ryskina, Sabrina J. Mielke, Shijie Wu, Eleanor Chodroff, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Salam Khalifa, Nizar Habash, Charbel El-Khaissi, Omer Goldman, Michael Gasser, William Lane, Matt Coler, Arturo Oncevay, Jaime Rafael Montoya Samame, Gema Celeste Silva Villegas, Adam Ek, Jean-Philippe Bernardy, Andrey Shcherbakov, Aziyana Bayyr-ool, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Andrew Krizhanovsky, Natalia Krizhanovsky, Clara Vania, Sardana Ivanova, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Duygu Ataman, Witold Kieraś, Marcin Woliński, Totok Suhardijanto, Niklas Stoehr, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Richard J. Hatcher, Emily Prud'hommeaux, Ritesh Kumar, Mans Hulden, Botond Barta, Dorina Lakatos, Gábor Szolnok, Judit Ács, Mohit Raj, David Yarowsky, Ryan Cotterell, Ben Ambridge, Ekaterina Vylomova (2021). SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages. Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2021). On Finding the $K$-best Non-projective Dependency Trees. ACL.

PDF Code URL

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2021). On Finding the $K$-best Non-projective Dependency Trees. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (Volume 1: Long Papers).

PDF

Irene Nikkarinen*, Tiago Pimentel*, Damián Blasi, Ryan Cotterell (2021). Modelling the Unigram Distribution. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.

URL

Irene Nikkarinen$^*$, Tiago Pimentel$^*$, Damián Blasi, Ryan Cotterell (2021). Modeling the Unigram Distribution. Findings of ACL.

PDF Code URL

Clara Meister, Ryan Cotterell (2021). Language Model Evaluation Beyond Perplexity. ACL.

PDF URL

Clara Meister, Ryan Cotterell (2021). Language Model Evaluation Beyond Perplexity. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (Volume 1: Long Papers).

PDF

Clara Meister, Stefan Lazov, Isabelle Augenstein, Ryan Cotterell (2021). Is Sparse Attention more Interpretable?. ACL.

PDF URL

Clara Meister, Stefan Lazov, Isabelle Augenstein, Ryan Cotterell (2021). Is Sparse Attention more Interpretable?. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (Volume 2: Short Papers).

PDF

Higher-order Derivatives of Weighted Finite-state Machines (2021). Higher-order Derivatives of Weighted Finite-state Machines. ACL.

PDF Code URL

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2021). Higher-order Derivatives of Weighted Finite-state Machines. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (Volume 2: Short Papers).

PDF

Jennifer C. White, Ryan Cotterell (2021). Examining the Inductive Bias of Neural Language Models with Artificial Languages. ACL.

PDF Code URL

Jennifer C. White, Ryan Cotterell (2021). Examining the Inductive Bias of Neural Language Models with Artificial Languages. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (Volume 1: Long Papers).

URL

Clara Meister, Martina Forster, Ryan Cotterell (2021). Determinantal Beam Search. ACL.

PDF Code URL

Clara Meister, Martina Forster, Ryan Cotterell (2021). Determinantal Beam Search. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (Volume 1: Long Papers).

PDF

Jason Wei, Clara Meister, Ryan Cotterell (2021). A cognitive regularizer for language modeling. ACL.

PDF URL

Jason Wei, Clara Meister, Ryan Cotterell (2021). A cognitive regularizer for language modeling. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (Volume 1: Long Papers).

PDF

Josef Valvoda, Tiago Pimentel, Niklas Stoehr, Ryan Cotterell, Simone Teufel (2021). What About the Precedent: An Information-Theoretic Analysis of Common Law. NAACL.

PDF Code URL

Josef Valvoda, Tiago Pimentel, Niklas Stoehr, Ryan Cotterell, Simone Teufel (2021). What About the Precedent: An Information-Theoretic Analysis of Common Law. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF

Elizabeth Salesky, Badr M. Abdullah, Sabrina Mielke, Elena Klyachko, Oleg Serikov, Edoardo Maria Ponti, Ritesh Kumar, Ryan Cotterell, Ekaterina Vylomova (2021). SIGTYP 2021 Shared Task: Robust Spoken Language Identification. Proceedings of the Third Workshop on Computational Typology and Multilingual NLP.

Tiago Pimentel$^*$, Irene Nikkarinen$^*$, Kyle Mahowald, Ryan Cotterell, Damián Blasi (2021). How (Non-)Optimal is the Lexicon?. NAACL.

PDF URL

Tiago Pimentel*, Irene Nikkarinen*, Kyle Mahowald, Ryan Cotterell, Damián Blasi (2021). How (Non-)Optimal is the Lexicon?. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

URL

Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell, Damián Blasi (2021). Finding Concept-specific Biases in Form--Meaning Associations. NAACL.

PDF Code URL

Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell, Damián Blasi (2021). Finding Concept-specific Biases in Form--Meaning Associations. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

URL

Rowan Hall Mauslay, Ryan Cotterell (2021). Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing. NAACL.

PDF Code URL

Rowan Hall Mauslay, Ryan Cotterell (2021). Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

PDF

Jennifer C. White, Tiago Pimentel, Naomi Saphra, Ryan Cotterell (2021). A Non-Linear Structural Probe. NAACL.

PDF Anthology arXiv

Jennifer C. White, Tiago Pimentel, Naomi Saphra, Ryan Cotterell (2021). A Non-Linear Structural Probe. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

URL

Martina Forster, Clara Meister, Ryan Cotterell (2021). Searching for Search Errors in Neural Morphological Inflection. EACL.

PDF Code URL

Martina Forster, Clara Meister, Ryan Cotterell (2021). Searching for Search Errors in Neural Morphological Inflection. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics.

URL

Tiago Pimentel, Ryan Cotterell, Brian Roark (2021). Disambiguatory signals are stronger in word initial positions. EACL.

PDF Code URL

Shijie Wu, Mans Hulden, Ryan Cotterell (2021). Applying the Transformer to Character-level Transduction. EACL.

PDF Code URL

Shijie Wu, Mans Hulden, Ryan Cotterell (2021). Applying the Transformer to Character-level Transduction. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics.

PDF

Edoardo M. Ponti, Ivan Vulić, Ryan Cotterell, Marinela Parović, Roi Reichart, Anna Korhonen (2021). Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages. TACL.

PDF Code URL

Adina Williams, Ryan Cotterell, Lawrence Wolf-Sonkin, Damián Blasi, Hanna Wallach (2021). On the Relationships Between the Grammatical Genders of Inanimate Nouns and Their Co-Occurring Adjectives and Verbs. TACL.

PDF URL

Emanuele Bugliarello, Ryan Cotterell, Naoaki Okazaki, Desmond Elliott (2021). Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs. TACL.

PDF Code URL

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2021). Efficient Computation of Expectations under Spanning Tree Distributions. TACL.

PDF Code URL

Jiaoda Li, Ryan Cotterell, Mrinmaya Sachan (2021). Differentiable Subset Pruning of Transformer Heads. TACL.

PDF Code URL

Zeerak Talat$^*$, Hagen Blix$^*$, Adina Williams, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell (2021). A Word on Machine Ethics: A Response to Jiang et al. (2021).

PDF

Paula Czarnowska, Sebastian Ruder, Ryan Cotterell, Ann Copestake (2020). Morphologically Aware Word-Level Translation. COLING.

PDF URL

Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Giuseppe G. A. Celano, Edoardo M. Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein (2020). SIGTYP 2020 Shared Task: Prediction of Typological Features. SIGTYP.

PDF Code URL

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2020). Please Mind the Root: Decoding Arborescences for Dependency Parsing. EMNLP.

PDF Code URL

Tiago Pimentel, Naomi Saphra, Adina Williams, Ryan Cotterell (2020). Pareto Probing: Trading Off Accuracy for Simplicity. EMNLP.

PDF Code URL

Arya D. McCarthy, Adina Williams, Shijia Liu, David Yarowsky, Ryan Cotterell (2020). Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions. EMNLP.

PDF URL

Jun Yen Leung, Guy Emerson, Ryan Cotterell (2020). Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model. EMNLP.

PDF URL

Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell (2020). Intrinsic Probing through Dimension Selection. EMNLP.

PDF Code URL

Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell (2020). Intrinsic Probing through Dimension Selection. EMNLP.

PDF Code URL

Clara Meister, Tim Vieira, Ryan Cotterell (2020). If Beam Search is the Answer, What was the Question?. EMNLP.

PDF Code URL

Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell, Damián Blasi (2020). Finding Concept-specific Biases in Form–Meaning Associations. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing.

Francisco Vargas, Ryan Cotterell (2020). Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation. EMNLP.

PDF Code URL

Francisco Vargas Palomo, Ryan Cotterell (2020). Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing.

Alexander Erdmann, Micha Elsner, Shijie Wu, Ryan Cotterell, Nizar Habash (2020). The Paradigm Discovery Problem. ACL.

PDF Code URL

Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J. Mielke, Shijie Wu, Edoardo Maria Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff, Ryan Cotterell, Miikka Silfverberg, Mans Hulden (2020). SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection. SIGMORPHON.

PDF Code URL

Adina Williams, Tiago Pimentel, Arya McCarthy, Hagen Blix, Eleanor Chodroff, Ryan Cotterell (2020). Predicting Declension Class from Form and Meaning. ACL.

PDF Code URL

Rowan Hall Maudslay, Tiago Pimentel, Ryan Cotterell, Simone Teufel (2020). Metaphor Detection Using Context and Concreteness. Second Workshop on Figurative Language Processing.

PDF URL

Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, Naoaki Okazaki (2020). It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information. ACL.

PDF Code URL

Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams, Ryan Cotterell (2020). Information-Theoretic Probing for Linguistic Structure. ACL.

PDF Code URL

Clara Meister, Elizabeth Salesky, Ryan Cotterell (2020). Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing. ACL.

PDF Code URL

Rowan Hall Maudslay, Josef Valvoda, Tiago Pimentel, Adina Williams, Ryan Cotterell (2020). A Tale of a Probe and a Parser. ACL.

PDF URL

Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black, Jason Eisner (2020). A Corpus for Large-Scale Phonetic Typology. ACL.

PDF Code URL

Arya D. McCarthy, Christo Kirov, Matteo Grella, Amrit Nidhi, Patrick Xia, Kyle Gorman, Ekaterina Vylomova, Sabrina J. Mielke, Garrett Nicolai, Miikka Silfverberg, Timofey Arkhangelskiy, Nataly Krizhanovsky, Andrew Krizhanovsky, Elena Klyachko, Alexey Sorokin, John Mansfield, Valts Ernštreits, Yuval Pinter, Cassandra L. Jacobs, Ryan Cotterell, Mans Hulden, David Yarowsky (2020). UniMorph 3.0: Universal Morphology. LREC.

PDF URL

Martina Forster, Clara Meister (2020). SIGMORPHON 2020 Task 0 System Description: ETH Zürich Team. SIGMORPHON.

PDF URL

Tiago Pimentel, Brian Roark, Ryan Cotterell (2020). Phonotactic Complexity and its Trade-offs. TACL.

PDF Code URL

Edoardo M Ponti, Ivan Vulić, Ryan Cotterell, Marinela Parovic, Roi Reichart, Anna Korhonen (2020). Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages. Transactions of the Association for Computational Linguistics.

Adina Williams, Ryan Cotterell, Lawrence Wolf-Sonkin, Damian Blasi, Hanna Wallach (2020). On the Relationships Between the Grammatical Genders of Inanimate Nouns and Their Co-Occurring Adjectives and Verbs. Transactions of the Association for Computational Linguistics.

URL

Ran Zmigrod, Tim Vieira, Ryan Cotterell (2020). Efficient Computation of Expectations under Spanning Tree Distributions. Transactions of the Association for Computational Linguistics.

URL

Clara Meister, Tim Vieira, Ryan Cotterell (2020). Best-First Beam Search. TACL.

PDF Code URL

Edoardo Maria Ponti, Ivan Vulić, Ryan Cotterell, Roi Reichart, Anna Korhonen (2019). Towards Zero-Shot Language Modeling. EMNLP.

PDF Anthology arXiv

Adina Williams, Ryan Cotterell, Lawrence Wolf-Sonkin, Damian Blasi, Hanna Wallach (2019). Quantifying the Semantic Core of Gender Systems. EMNLP.

PDF Anthology arXiv

Rowan Hall Maudslay, Hila Gonen, Ryan Cotterell, Simone Teufel (2019). It’s All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution. EMNLP.

PDF Anthology arXiv

Pei Zhou, Weijia Shi, Jieyu Zhao, Kuan-Hao Huang, Muhao Chen, Ryan Cotterell, Kai-Wei Chang (2019). Examining Gender Bias in Languages with Grammatical Gender. EMNLP.

PDF Anthology arXiv

Paula Czarnowska, Sebastian Ruder, Edouard Grave, Ryan Cotterell, Ann Copestake (2019). Don’t Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction. EMNLP.

PDF Anthology arXiv

Arya D. McCarthy, Ekaterina Vylomova, Shijie Wu, Chaitanya Malaviya, Lawrence Wolf-Sonkin, Garrett Nicolai, Christo Kirov, Miikka Silfverberg, Sabrina Mielke, Jeffrey Heinz, Ryan Cotterell, Mans Hulden (2019). The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection. SIGMORPHON.

PDF Anthology arXiv

Sabrina Mielke, Ryan Cotterell, Kyle Gorman, Brian Roark, Jason Eisner (2019). What Kind of Language Is Hard to Language-Model?. ACL.

PDF Anthology arXiv

Alexander M. Hoyle, Lawrence Wolf-Sonkin, Hanna Wallach, Isabelle Augenstein, Ryan Cotterell (2019). Unsupervised Discovery of Gendered Language through Latent-Variable Modeling. ACL.

PDF Anthology arXiv

Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell, Isabelle Augenstein (2019). Uncovering Typological Implications with Belief Nets. ACL.

PDF Anthology arXiv

Damian Blasi, Ryan Cotterell, Lawrence Wolf-Sonkin, Sabine Stoll, Balthasar Bickel, Marco Baroni (2019). On the distribution of deep clausal embeddings: A large cross-linguistic study. ACL.

PDF Anthology

Shijie Wu, Ryan Cotterell, Timothy J. O'Donnell (2019). Measuring Morphological Irregularity. ACL.

PDF Anthology arXiv

Tiago Pimentel, Arya McCarthy, Damian Blasi, Brian Roark, Ryan Cotterell (2019). Meaning to Form: Measuring Systematicity as Information. ACL.

PDF Anthology arXiv

Ran Zmigrod, Sabrina Mielke, Hanna Wallach, Ryan Cotterell (2019). Counterfactual Data Augmentation for Mitigating Gender Bias in Languages with Rich Morphology. ACL.

PDF Anthology arXiv

Shijia Liu, Adina Williams, Hongyuan Mei, Ryan Cotterell (2019). On the Idiosyncrasies of the Mandarin Chinese Classifier System. NAACL.

PDF Anthology arXiv

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente Ordonez, Kai-Wei Chang (2019). Gender Bias in Contextualized Word Embeddings. NAACL.

PDF Anthology arXiv

Ekaterina Vylomova, Ryan Cotterell, Timothy Baldwin, Trevor Cohn, Jason Eisner (2019). Contextualization of Morphological Inflection. NAACL.

PDF Anthology arXiv

Alexander Hoyle, Lawrence Wolf-Sonkin, Hanna Wallach, Ryan Cotterell, Isabelle Augenstein (2019). Combining Sentiment Lexica with a Multi-View Variational Autoencoder. NAACL.

PDF Anthology arXiv

Chaitanya Malaviya, Shijie Wu, Ryan Cotterell (2019). A Simple Joint Model for Improved Contextual Neural Lemmatization. NAACL.

PDF Anthology arXiv

Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell, Isabelle Augenstein (2019). A Probabilistic Generative Model of Linguistic Typology. NAACL.

PDF Anthology arXiv

Ryan Cotterell, Christo Kirov, Mans Hulden, Jason Eisner (2019). On the Complexity and Typology of Inflectional Morphological Systems. TACL.

PDF Anthology arXiv

Christo Kirov, Ryan Cotterell (2019). Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate. Transactions of the Association for Computational Linguistics.

PDF

Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Arya D. McCarthy, Katharina Kann, Sabrina Mielke, Garrett Nicolai, Miikka Silfverberg, David Yarowsky, Jason Eisner, Mans Hulden (2018). The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. CoNLL.

PDF Anthology arXiv

Arya D. McCarthy, Miikka Silfverberg, Ryan Cotterell, Mans Hulden, David Yarowsky (2018). Marrying Universal Dependencies and Universal Morphology. UDW.

PDF Anthology arXiv

Shijie Wu, Pamela Shapiro, Ryan Cotterell (2018). Hard Non-Monotonic Attention for Character-Level Transduction. EMNLP.

PDF URL

Yova Kementchedjhieva, Sebastian Ruder, Ryan Cotterell, Anders Søgaard (2018). Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction. CoNLL.

PDF Anthology arXiv

Sebastian Ruder$^*$, Ryan Cotterell$^*$, Yova Kementchedjhieva, Anders Søgaard (2018). A Discriminative Latent-Variable Model for Bilingual Lexicon Induction. EMNLP.

PDF Anthology arXiv

Lawrence Wolf-Sonkin$^*$, Jason Naradowsky$^*$, Sabrina J. Mielke$^*$, Ryan Cotterell$^*$ (2018). A Structured Variational Autoencoder for Contextual Morphological Inflection. ACL.

PDF Anthology arXiv

Ryan Cotterell, Christo Kirov, Sabrina J. Mielke, Jason Eisner (2018). Unsupervised Disambiguation of Syncretism in Inflected Lexicons. NAACL.

PDF Anthology arXiv

Ryan Cotterell, Sabrina J. Mielke, Jason Eisner, Brian Roark (2018). Are All Languages Equally Hard to Language-Model?. NAACL.

PDF Anthology arXiv

Ryan Cotterell, Jason Eisner (2018). A Deep Generative Model of Vowel Formant Typology. NAACL.

PDF Anthology arXiv

Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sabrina Mielke, Arya McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner, Mans Hulden (2018). UniMorph 2.0: Universal Morphology. LREC.

PDF Anthology arXiv

Christo Kirov, Ryan Cotterell (2018). Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate. TACL.

PDF Anthology arXiv

Ryan Cotterell, Christo Kirov, Mans Hulden, Jason Eisner (2018). On the Diachronic Stability of Irregularity in Inflectional Morphology. arXiv.

PDF URL

Ryan Cotterell, Hinrich Schütze (2018). Joint Semantic Synthesis and Morphological Analysis of the Derived Word. TACL.

PDF Anthology arXiv

Ryan Cotterell, Julia Kreutzer (2018). Explaining and Generalizing Back-Translation through Wake-Sleep. arXiv.

PDF URL

Ryan Cotterell, Kevin Duh. (2017). Low-Resource Named Entity Recognition with Cross-lingual, Character-Level Neural Conditional Random Fields. IJCNLP.

PDF Anthology

Ryan Cotterell, Ekaterina Vylomova, Huda Khayrallah, Christo Kirov, David Yarowsky (2017). Paradigm Completion for Derivational Morphology. EMNLP.

PDF Anthology

Ryan Cotterell, Georg Heigold (2017). Cross-lingual, Character-Level Neural Morphological Tagging. EMNLP.

PDF Anthology arXiv

Ryan Cotterell, Jason Eisner (2017). Probabilistic Typology: Deep Generative Models of Vowel Inventories. ACL.

PDF Anthology arXiv

Katharina Kann, Ryan Cotterell, Hinrich Schütze (2017). One-Shot Neural Cross-Lingual Transfer for Paradigm Completion. ACL.

PDF Anthology arXiv

Francis Ferraro, Adam Poliak, Ryan Cotterell, Benjamin Van Durme (2017). Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles. *SEM.

PDF Anthology

Francis Ferraro, Adam Poliak, Ryan Cotterell, Benjamin Van Durme (2017). Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles. Proceedings of the 6th Joint Conference on Lexical and Computational Semantics.

PDF

Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sandra Kübler, David Yarowsky, Jason Eisner, Mans Hulden (2017). CoNLL--SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages. CoNLL.

PDF Anthology

Katharina Kann, Ryan Cotterell, Hinrich Schütze (2017). Neural Multi-Source Morphological Reinflection. EACL.

PDF Anthology arXiv

Ryan Cotterell, John Sylak-Glassman, Christo Kirov (2017). Neural Graphical Models over Strings for Principal Parts Morphological Paradigm Completion. EACL.

PDF Anthology

Arun Kumar, Ryan Cotterell, Lluís Padró, Antoni Oliver (2017). Morphological Analysis of the Dravidian Language Family. EACL.

PDF Anthology

Ryan Cotterell, Adam Poliak, Ben Van Durme, Jason Eisner (2017). Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis. EACL.

PDF Anthology

Ekaterina Vylomova, Ryan Cotterell, Timothy Baldwin, Trevor Cohn (2017). Context-Aware Prediction of Derivational Word-forms. EACL.

PDF Anthology

Christo Kirov, John Sylak-Glassman, Rebecca Knowles, Ryan Cotterell, Matt Post (2017). A Rich Morphological Tagger for English: Exploring the Cross-Linguistic Tradeoff Between Morphology and Syntax. EACL.

PDF Anthology

Tim Vieira$^*$, Ryan Cotterell$^*$, Jason Eisner (2016). Speed-Accuracy Tradeoffs in Tagging with Variable-Order CRFs and Structured Sparsity. EMNLP.

PDF Anthology

Katharina Kann, Ryan Cotterell, Hinrich Schütze (2016). Neural Morphological Analysis: Encoding-Decoding Canonical Segments. EMNLP.

PDF URL

Ryan Cotterell, Arun Kumar, Hinrich Schütze (2016). Morphological Segmentation Inside-Out. EMNLP.

PDF Anthology arXiv

Ryan Cotterell, Christo Kirov, John Sylak-Glassman, David Yarowsky, Jason Eisner, Mans Hulden (2016). The SIGMORPHON 2016 Shared Task—Morphological Reinflection. SIGMORPHON.

PDF URL

Ryan Cotterell, Hinrich Schütze, Jason Eisner (2016). Morphological Smoothing and Extrapolation of Word Embeddings. ACL.

PDF URL

Pushpendre Rastogi, Ryan Cotterell, Jason Eisner (2016). Weighting Finite-State Transductions With Neural Context. NAACL.

PDF Anthology

Ryan Cotterell, Tim Vieira, Hinrich Schütze (2016). A Joint Model of Orthography and Morphological Segmentation. NAACL.

PDF Anthology

John Sylak-Glassman, Ryan Cotterell (2016). Contrastive Morphological Typology and Logical Hierarchies. Chicago Linguistic Society.

PDF

Chandler May, Ryan Cotterell, Benjamin Van Durme (2016). Analysis of Morphology in Topic Modeling. arXiv.

PDF URL

Thomas Müller, Ryan Cotterell, Alexander Fraser, Hinrich Schütze (2015). Joint Lemmatization and Morphological Tagging with Lemming. EMNLP.

PDF Anthology

Nanyun Peng, Ryan Cotterell, Jason Eisner (2015). Dual Decomposition Inference for Graphical Models over Strings. EMNLP.

PDF Anthology

Ryan Cotterell, Jason Eisner (2015). Penalized Expectation Propagation for Graphical Models over Strings. NAACL.

PDF Anthology

Ryan Cotterell, Hinrich Schütze (2015). Morphological Word Embeddings. NAACL.

PDF Anthology arXiv

Ryan Cotterell, Thomas Müller, Alexander Fraser, Hinrich Schütze (2015). Labeled Morphological Segmentation with Semi-Markov Models. CoNLL.

PDF Anthology

Ryan Cotterell, Nanyun Peng, Jason Eisner (2015). Modeling Word Forms Using Latent Underlying Morphs and Phonology. TACL.

PDF Anthology

Ryan Cotterell, Nanyun Peng, Jason Eisner (2014). Stochastic Contextual Edit Distance and Probabilistic FSTs. ACL.

PDF Anthology

Gaurav Kumar, Yuan Cao, Ryan Cotterell, Chris Callison-Burch, Daniel Povey, Sanjeev Khudanpur (2014). Translation of the CALLHOME Egyptian Arabic Corpus For Conversational Speech Translation. IWSLT.

PDF Anthology

Ryan Cotterell, Adithya Renduchintala, Naomi Saphra, Chris Callison-Burch (2014). An Algerian Arabic-French Code-Switched Corpus. Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tool 2014.

PDF Anthology

Ryan Cotterell, Chris Callison-Burch (2014). A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic. LREC.

PDF URL