*Article* **Productivity and Predictability for Measuring Morphological Complexity**

#### **Ximena Gutierrez-Vasques 1,\*,† and Victor Mijangos 2,†**


Received: 31 October 2019; Accepted: 23 December 2019; Published: 30 December 2019

**Abstract:** We propose a quantitative approach for quantifying morphological complexity of a language based on text. Several corpus-based methods have focused on measuring the different word forms that a language can produce. We take into account not only the productivity of morphological processes but also the predictability of those morphological processes. We use a language model that predicts the probability of sub-word sequences within a word; we calculate the entropy rate of this model and use it as a measure of predictability of the internal structure of words. Our results show that it is important to integrate these two dimensions when measuring morphological complexity, since languages can be complex under one measure but simpler under another one. We calculated the complexity measures in two different parallel corpora for a typologically diverse set of languages. Our approach is corpus-based and it does not require the use of linguistic annotated data.

**Keywords:** language complexity; morphology; TTR; language model; entropy rate
