Benchmarking Compositionality with Formal Languages

Josef Valvoda, Naomi Saphra, Jon Rawski, Adina Williams, Ryan Cotterell

October 2022

URL

Abstract

Recombining known primitive concepts into larger novel combinations is a quintessentially human cognitive capability. Whether large neural models in NLP acquire this ability while learning from data is an open question. In this paper, we look at this problem from the perspective of formal languages. We use deterministic finite-state transducers to make an unbounded number of datasets with controllable properties governing compositionality. By randomly sampling over many transducers, we explore which of their properties (number of states, alphabet size, number of transitions etc.) contribute to learnability of a compositional relation by a neural network. In general, we find that the models either learn the relations completely or not at all. The key is transition coverage, setting a soft learnability limit at 400 examples per transition.

Type

Conference paper

Publication

Proceedings of the 29th International Conference on Computational Linguistics

Add the full text or supplementary notes for the publication here using Markdown formatting.