Commit aee4d501 by Zidong Du

add Makefile

parent 6b1f7234
\section{Introduction}
\label{sec:introduction}
The emergence of human language has always been an important and controversial
issue. This problem attracts attentions from a broad range of communities,
including philology, biology and computer science. In computer science,
researchers induce and analyze the emergent language in multi-agent systems by
setting up communication scenarios, such as referential games and
communication-action policies.
Compositionality is a widely used metric to evaluate the emergent language. It
is a concept in the philosophy of language [1], which describes and quantifies
how complex expressions can be assembled out of simpler parts [2]. For example,
Figure1(a) shows a perfect compositional language (with maximum
compostionality). In this example, each shape is represented by a unique value
of symbol $s_0$ and each color is represented by symbol $s_1$. Figure1(b) shows a
language with low compostionality. Colors and shapes are ambiguous if only we
extract information from a single symbol.
\begin{figure}[t]
\centering
\includegraphics[width=0.9\columnwidth]{fig/occupy}
\caption{(a): The correspondence between symbol sequences ($s_0$, $s_1$) and (shape,
color) pairs in a perfectly compostional language. $s_0$, $s_1$ in {a, b, c}, shape
in {circle, square} and color in {red, blue, green}; (b): The correspondence
between symbol sequences ($s_0$, $s_1$) and (shape, color) pairs in a language with
low compostionality.}
\label{fig:symbols}
\end{figure}
Prior studies focus on investigating how to affect the compositionality of the
emergent language. Researchers have found that various environmental pressures
would affect compositionality, e.g., small vocabulary sizes[3], memoryless[4],
carefully constructed rewards[5] and ease-of-teaching[6]. However, these works
only consider \emph{nurture} [7] (i.e., environmental factors), rather than
\emph{nature} (i.e., hereditary factors from agents), when inducing or exploring
the emergent language without exception. Moreover, some environmental pressures,
like regrading the entropy as an item of additional rewards, may be too ideal to
exist in the real world.
In contrast to prior work, we investigate the compositionality of emergent
language from a new perspective, i.e., the agent capacity. Different from
previous work that only considers external environmental factors, we study the
impact of agent internal capacity on the compositionality of emergent
language. Specifically, we first analyze the correlation between agent capacity
and compositionality theoretically, and propose a novel metric to evaluate
compostionality quantitatively. Then, on the basis of the theoretical analysis
and the metric proposed, we verify the relationship between agent capacity and
compostionality experimentally.
Theoretically, on the basis of mutual information theory[8], we analyse the
correlation between compostionality of the emergent language and complexity of
the semantic information carried by a symbol. Such semantic information can be
characterized in neural network-based agents and requires the certain capacity
(i.e., the count of neural nodes in the hidden layer). Specifically, we use the
MSC (Markov Series Channel)[9] to model the language transmission process and
use the probability distribution of symbols and concepts to model policies of
agents. After modelling, we use the mutual information matrix $MRI^B$ to
quantitatively represent the semantic information, and each column of $MRI^B$
correspond to information carried by one symbol. We find that each column of the
matrix should be an one-hot vector for a perfectly compositional language, cause
a symbol only transmit information of a certain concept exclusively. Therefore,
the average similarity between the columns of $MRI^B$ and a one-hot vector is
higher, indicating that the emergent language is more compostional (i.e., the
compostionality is higher). We propose the metric \emph{MIS} to measure
compositionality by calculating such average similarity
quantitatively. Moreover, MIS comes lower indicates that the emergent language
tends to delivery semantic information about more concepts in each symbol, so
that the complexity of semantic information carried by one symbol tend to be
higher. As a result, higher agent capacity is required to characterize the more
complex semantic information when MIS (i.e., compositionality) is lower.
Experimentally, we verify the relationship between agent capacity and
compostionality in a
Then with experiments we show that a low-bilateral (i.e. low-compositionality)
language needs higher capacity of the model to emerge. We build a
listener-speaker referential game as experimental framework, and train agents
with the correctness of forecast output from the listener as the only
criterion. The criterion does not imply any environmental pressures on the
agents. Therefore, we can study the impact of capacity on the compositionality
without any environmental pressures’ affection. Moreover, to study the impact of
capacity on the compositionality under a more ‘natural’ environment, the speaker
and listener are individual agents, i.e. disconnected models without sharing
parameters. The conclusion suggests that by restricting the number of neurons in
a model the emerging languages attend to have higher bilaterality, thus higher
compositionality.
This paper makes the following contributions:
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment