Prior studies focus on investigating how to affect the compositionality of the emergent language. Researchers have found that various environmental pressures would affect compositionality, e.g., small vocabulary sizes[3], memoryless[4], carefully constructed rewards[5] and ease-of-teaching[6]. However, these works only consider \emph{nurture} [7] (i.e., environmental factors), rather than \emph{nature} (i.e., hereditary factors from agents), when inducing or exploring the emergent language without exception. Moreover, some environmental pressures, like regrading the entropy as an item of additional rewards, may be too ideal to exist in the real world.
In contrast to prior work, we investigate the compositionality of emergent language from a new perspective, i.e., the agent capacity. Different from previous work that only considers external environmental factors, we study the impact of agent internal capacity on the compositionality of emergent language. Specifically, we first analyze the correlation between agent capacity and compositionality theoretically, and propose a novel metric to evaluate compostionality quantitatively. Then, on the basis of the theoretical analysis and the metric proposed, we verify the relationship between agent capacity and compostionality experimentally.
Theoretically, on the basis of mutual information theory[8], we analyse the correlation between compostionality of the emergent language and complexity of the semantic information carried by a symbol. Such semantic information can be characterized in neural network-based agents and requires the certain capacity (i.e., the count of neural nodes in the hidden layer). Specifically, we use the MSC (Markov Series Channel)[9] to model the language transmission process and use the probability distribution of symbols and concepts to model policies of agents. After modelling, we use the mutual information matrix MRI^B to quantitatively represent the semantic information, and each column of MRI^B correspond to information carried by one symbol. We find that each column of the matrix should be an one-hot vector for a perfectly compositional language, cause a symbol only transmit information of a certain concept exclusively. Therefore, the average similarity between the columns of MRI^B and a one-hot vector is higher, indicating that the emergent language is more compostional (i.e., the compostionality is higher). We propose the metric \emph{MIS} to measure compositionality by calculating such average similarity quantitatively. Different from other metrics, such as \emph{topographic similarity}[10] and \emph{posdis}[11], \emph{MIS} is a bilateral metric because it takes both listener and speaker's understanding of semantics into account. Moreover, \emph{MIS} comes lower indicates that the emergent language tends to delivery semantic information about more concepts in each symbol, so that the complexity of semantic information carried by one symbol tend to be higher. As a result, higher agent capacity is required to characterize the more complex semantic information when \emph{MIS} (i.e., compositionality) is lower.
Experimentally, we verify the relationship between agent capacity and compostionality. We build a listener-speaker referential game as experimental framework, and train agents of Stochastic Policy Gradient Algorithm[12] with the correctness of forecast output from the listener as the criterion (i.e., reward). The criterion does not imply any environmental pressures on the agents. Therefore, we can study the impact of capacity on the compositionality without any environmental pressures’ affection. Moreover, to study the impact of capacity on the compositionality under a more ‘natural’ environment, the speaker and listener are disconnected models without sharing parameters. Our first experiment is to verify that agent need higher capacity to master an artificial language with lower compositionality under a scenario of language teaching. Specifically, we fabricate the speaker to output preassigned languages with different compostionality respectively, and train the listener to interpret the preassigned language. For all artificial language, we compare the accuracy curve during training process of the listener with different capacity, and show how capacity affect learning languages with different compostionality. Our second experiment is to verify that lower agent capacity would facilitate higher compostionality of the emergent language under a scenario of language inducing. Specifically, we training a speaker and a listener to create a communication protocol (i.e., emergent language), so that the listener can select the same object which is received by the speaker. By adjusting capacity and comparing the compositionality of emergent language, we show that the emergent language attend to have higher compositionality when agent capacity is restricted more stringently. As a result, these two experiments verify the negative correlation between agent capacity and compostionality both in language teaching and language inducing.
Prior studies focus on investigating how to affect the compositionality of the
emergent language. Researchers have found that various environmental pressures
would affect compositionality, e.g., small vocabulary sizes[3], memoryless[4],
carefully constructed rewards[5] and ease-of-teaching[6]. However, these works
only consider \emph{nurture} [7] (i.e., environmental factors), rather than
\emph{nature} (i.e., hereditary factors from agents), when inducing or exploring
the emergent language without exception. Moreover, some environmental pressures,
like regrading the entropy as an item of additional rewards, may be too ideal to
exist in the real world.
In contrast to prior work, we investigate the compositionality of emergent
language from a new perspective, i.e., the agent capacity. Different from
previous work that only considers external environmental factors, we study the
impact of agent internal capacity on the compositionality of emergent
language. Specifically, we first analyze the correlation between agent capacity
and compositionality theoretically, and propose a novel metric to evaluate
compostionality quantitatively. Then, on the basis of the theoretical analysis
and the metric proposed, we verify the relationship between agent capacity and
compostionality experimentally.
Theoretically, on the basis of mutual information theory[8], we analyse the
correlation between compostionality of the emergent language and complexity of
the semantic information carried by a symbol. Such semantic information can be
characterized in neural network-based agents and requires the certain capacity
(i.e., the count of neural nodes in the hidden layer). Specifically, we use the
MSC (Markov Series Channel)[9] to model the language transmission process and
use the probability distribution of symbols and concepts to model policies of
agents. After modelling, we use the mutual information matrix $MRI^B$ to
quantitatively represent the semantic information, and each column of $MRI^B$
correspond to information carried by one symbol. We find that each column of the
matrix should be an one-hot vector for a perfectly compositional language, cause
a symbol only transmit information of a certain concept exclusively. Therefore,
the average similarity between the columns of $MRI^B$ and a one-hot vector is
higher, indicating that the emergent language is more compostional (i.e., the
compostionality is higher). We propose the metric \emph{MIS} to measure
compositionality by calculating such average similarity
quantitatively. Different from other metrics, such as \emph{topographic
similarity}[10] and \emph{posdis}[11], \emph{MIS} is a bilateral metric
because it takes both listener and speaker's understanding of semantics into
account. Moreover, \emph{MIS} comes lower indicates that the emergent language
tends to delivery semantic information about more concepts in each symbol, so
that the complexity of semantic information carried by one symbol tend to be
higher. As a result, higher agent capacity is required to characterize the more
complex semantic information when \emph{MIS} (i.e., compositionality) is lower.
Experimentally, we verify the relationship between agent capacity and
compostionality. We build a listener-speaker referential game as experimental
framework, and train agents of Stochastic Policy Gradient Algorithm[12] with the
correctness of forecast output from the listener as the criterion (i.e.,
reward). The criterion does not imply any environmental pressures on the
agents. Therefore, we can study the impact of capacity on the compositionality
without any environmental pressures’ affection. Moreover, to study the impact of
capacity on the compositionality under a more ‘natural’ environment, the speaker
and listener are disconnected models without sharing parameters. Our first
experiment is to verify that agent need higher capacity to master an artificial
language with lower compositionality under a scenario of language
teaching. Specifically, we fabricate the speaker to output preassigned languages
with different compostionality respectively, and train the listener to interpret
the preassigned language. For all artificial language, we compare the accuracy
curve during training process of the listener with different capacity, and show
how capacity affect learning languages with different compostionality. Our
second experiment is to verify that lower agent capacity would facilitate higher
compostionality of the emergent language under a scenario of language
inducing. Specifically, we training a speaker and a listener to create a
communication protocol (i.e., emergent language), so that the listener can
select the same object which is received by the speaker. By adjusting capacity
and comparing the compositionality of emergent language, we show that the
emergent language attend to have higher compositionality when agent capacity is
restricted more stringently. As a result, these two experiments verify the
negative correlation between agent capacity and compostionality both in language
teaching and language inducing.
...
...
@@ -57,4 +121,4 @@ We propose a ‘bilateral’ metric \emph{MIS}, which takes both listener and sp
We analyse the relationship between compostionality and agent capacity theoretically.
We verify the negative correlation between agent capacity and compostionality both in language teaching and language inducing.